alexshengzhili/llama3_8b_dpo_0908_preference_4_conference_shuffled_2023 Text Generation • 8B • Updated Sep 11, 2024 • 7
alexshengzhili/mistral_3_0908_preference_4_conference_shuffled_2023_sft Text Generation • 7B • Updated Sep 10, 2024 • 8
alexshengzhili/dpo_0908_preference_4_conference_shuffled_2023_checkpoint_30 Text Generation • 7B • Updated Sep 10, 2024 • 6
alexshengzhili/phi3-dpo_0908_preference_4_conference_shuffled_2023 Text Generation • 4B • Updated Sep 9, 2024 • 9
alexshengzhili/llama3.1-8b-lora_dpo_0907_preference_iclr2023 Text Generation • 8B • Updated Sep 8, 2024 • 6
alexshengzhili/llava-lora-dpo-1227lrvtail2000_from_sft-self-sampled-beta-0.5-lr-5e-5-avg-False-epoch-3 Updated Dec 29, 2023 • 2
alexshengzhili/llava-v1.5-13b-lora-1227-COH-lrv0-3230llava0-5879_interleaved.json Updated Dec 29, 2023 • 22
alexshengzhili/llava-lora-dpo-1227lrvtail2000_sft-self-sampled-beta-0.5-lr-5e-6-avg-False-epoch-3 Updated Dec 28, 2023
alexshengzhili/llava-lora-dpo-1227lrvtail2000_sft-self-sampled-beta-0.5-lr-5e-6-avg-False-epoch-2 Updated Dec 28, 2023
alexshengzhili/llava-lora-dpo-1227lrvtail2000_sft-self-sampled-beta-0.5-lr-5e-5-avg-False-epoch-3 Updated Dec 28, 2023
alexshengzhili/llava-lora-dpo-1227lrvtail2000_sft-self-sampled-beta-0.5-lr-5e-5-avg-False-epoch-2 Updated Dec 28, 2023