the best collection of RLXF model including RLHF, RLAIF etc.
lil
Amu
AI & ML interests
None yet
Recent Activity
new activity 14 days ago
Amu/t1-1.5B:Adding `safetensors` variant of this model liked
a Space 14 days ago
safetensors/convert updated
a model 14 days ago
Amu/t1-1.5BOrganizations
None yet
Collections 3
models 14
Amu/t1-1.5B
Updated
• 50 • 1
Amu/supertiny-llama3-0.25B-v0.1
Text Generation • Updated
• 156 • 5
Amu/dpo-qlora-Qwen1.5-0.5B-Chat-xtuner
Text Generation • Updated
• 54
Amu/orpo-phi2
Text Generation • Updated
• 10
Amu/orpo-lora-phi2
Text Generation • Updated
• 167
Amu/spin-phi2
Text Generation • Updated
• 83 • 9
Amu/r-zephyr-7b-beta-qlora
Updated
Amu/dpo-phi2
Text Generation • Updated
• 201 • 2
Amu/zen-moe
Text Generation • Updated
• 18
Amu/zen
Text Generation • Updated
• 21
datasets
None public yet