AmberYifan/Llama-3-8B-Instruct-wildfeedback-RPO-iterDPO-iter1 Text Generation • 0.0B • Updated 10 days ago • 11
AmberYifan/Llama-3-8B-Instruct-wildfeedback-RPO-iterDPO-iter1 Text Generation • 0.0B • Updated 10 days ago • 11
AmberYifan/Llama-3-8B-Instruct-wildfeedback-RPO-DRIFT-iter1 Text Generation • 0.0B • Updated 10 days ago • 13
AmberYifan/Llama-3-8B-Instruct-wildfeedback-RPO-DRIFT-iter1 Text Generation • 0.0B • Updated 10 days ago • 13
AmberYifan/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 12
AmberYifan/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 12
AmberYifan/qwen3-4b-thinking-full-pretrain-mix-mid-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 8
AmberYifan/qwen3-4b-thinking-full-pretrain-mix-mid-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 8
AmberYifan/qwen3-4b-thinking-full-pretrain-mix-high-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 9
AmberYifan/qwen3-4b-thinking-full-pretrain-mix-high-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 9
AmberYifan/qwen3-4b-thinking-full-pretrain-junk-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 13
AmberYifan/qwen3-4b-thinking-full-pretrain-junk-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 13
AmberYifan/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 13
AmberYifan/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en-gpt-sft Text Generation • 4B • Updated 11 days ago • 13
AmberYifan/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-gpt Text Generation • 4B • Updated 11 days ago • 28
AmberYifan/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-gpt Text Generation • 4B • Updated 11 days ago • 28