·
AI & ML interests
Machine learning, RLHF
Organizations
weqweasdas/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
9
weqweasdas/zephyr-7b-gemma-dpo
Updated
weqweasdas/zephyr-7b-sft-full
Updated
weqweasdas/zephyr-7b-dpo-qlora
Updated
weqweasdas/gpt2-cpt-dutch
Text Generation
•
0.1B
•
Updated
•
9
weqweasdas/zephyr-7b-gemma-sft
Updated
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6_weight085
Text Generation
•
7B
•
Updated
•
6
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6
Text Generation
•
7B
•
Updated
•
4
weqweasdas/raft_baseline_zephyr_packing_model6
Text Generation
•
7B
•
Updated
•
5
weqweasdas/raft_baseline_openchat_llama13b_model1
Text Generation
•
7B
•
Updated
•
15
weqweasdas/raft_zephyr_baseline_model1
Text Generation
•
7B
•
Updated
•
11
weqweasdas/raft_baseline_openchat_30k_n32
Text Generation
•
7B
•
Updated
•
4
weqweasdas/raft_openchat_baseline_model1_09
Text Generation
•
7B
•
Updated
•
4
weqweasdas/raft_openchat_5e7_baseline_model1
Text Generation
•
7B
•
Updated
•
5
weqweasdas/ratio_09_c12_model1_lr_2e6_2epoch
Text Generation
•
7B
•
Updated
•
4
weqweasdas/ratio_095_c52_model1_lr_2e6_2epoch
Text Generation
•
7B
•
Updated
•
7
weqweasdas/rsf_plus_mistral7b_ratio_09_5kbz_model1
Text Generation
•
7B
•
Updated
•
5
Text Classification
•
7B
•
Updated
•
945
•
24
weqweasdas/RM-Gemma-2B-Mixture2
Text Classification
•
3B
•
Updated
•
7
weqweasdas/RM-Gemma-2B-Mixture2-Safety30K
Text Classification
•
3B
•
Updated
•
6
•
1
Text Classification
•
9B
•
Updated
•
30
•
8
Text Classification
•
3B
•
Updated
•
454
•
25
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
•
131
•
17