umangkaushik

ubermenchh

AI & ML interests

None yet

Recent Activity

updated a model about 8 hours ago
ubermenchh/Qwen2.5-0.5B-openr1-math
published a model about 8 hours ago
ubermenchh/Qwen2.5-0.5B-openr1-math
upvoted a collection 3 days ago
🧠 Reasoning datasets
View all activity

Organizations

Social Post Explorers's profile picture Hugging Face Discord Community's profile picture

ubermenchh's activity

upvoted an article 14 days ago
view article
Article

The N Implementation Details of RLHF with PPO

39
New activity in ubermenchh/SmolLM2-DPO 20 days ago

details pls

1
#1 opened 20 days ago by
archit11