·
AI & ML interests
Machine learning, RLHF
Organizations
weqweasdas/zephyr-7b-dpo-full
Text Generation
• 7B • Updated • 5
weqweasdas/zephyr-7b-gemma-dpo
Updated
weqweasdas/zephyr-7b-sft-full
Updated
weqweasdas/zephyr-7b-dpo-qlora
Updated
weqweasdas/gpt2-cpt-dutch
Text Generation
• 0.1B • Updated • 7
weqweasdas/zephyr-7b-gemma-sft
Updated
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6_weight085
Text Generation
• 7B • Updated weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6
Text Generation
• 7B • Updated weqweasdas/raft_baseline_zephyr_packing_model6
Text Generation
• 7B • Updated • 1
weqweasdas/raft_baseline_openchat_llama13b_model1
Text Generation
• 7B • Updated • 2
weqweasdas/raft_zephyr_baseline_model1
Text Generation
• 7B • Updated • 1
weqweasdas/raft_baseline_openchat_30k_n32
Text Generation
• 7B • Updated • 3
weqweasdas/raft_openchat_baseline_model1_09
Text Generation
• 7B • Updated • 1
weqweasdas/raft_openchat_5e7_baseline_model1
Text Generation
• 7B • Updated • 2
weqweasdas/ratio_09_c12_model1_lr_2e6_2epoch
Text Generation
• 7B • Updated • 2
weqweasdas/ratio_095_c52_model1_lr_2e6_2epoch
Text Generation
• 7B • Updated • 2
weqweasdas/rsf_plus_mistral7b_ratio_09_5kbz_model1
Text Generation
• 7B • Updated • 1
Text Classification
• 7B • Updated • 2.95k
• 25
weqweasdas/RM-Gemma-2B-Mixture2
Text Classification
• 3B • Updated • 1
weqweasdas/RM-Gemma-2B-Mixture2-Safety30K
Text Classification
• 3B • Updated • 11
• 1
Text Classification
• 9B • Updated • 11
• 8
Text Classification
• 3B • Updated • 202
• 25
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
• Updated • 191
• 17