aman prakash

MLap

AI & ML interests

None yet

Recent Activity

Organizations

None yet

MLap's activity

reacted to clem's post with 🔥 about 19 hours ago
view post
Post
2514
I was chatting with @peakji , one of the cofounders of Manu AI, who told me he was on Hugging Face (very cool!).

He shared an interesting insight which is that agentic capabilities might be more of an alignment problem rather than a foundational capability issue. Similar to the difference between GPT-3 and InstructGPT, some open-source foundation models are simply trained to 'answer everything in one response regardless of the complexity of the question' - after all, that's the user preference in chatbot use cases. Just a bit of post-training on agentic trajectories can make an immediate and dramatic difference.

As a thank you to the community, he shared 100 invite code first-come first serve, just use “HUGGINGFACE” to get access!
·
upvoted an article about 1 month ago
view article
Article

Open-source DeepResearch – Freeing our search agents

1.14k
upvoted 3 articles about 1 month ago
view article
Article

We now support VLMs in smolagents!

91
view article
Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

146
view article
Article

SmolVLM - small yet mighty Vision Language Model

212
upvoted an article about 2 months ago
view article
Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

By ariG23498
14