Mohammed Hamdy

mmhamdy

AI & ML interests

TechBio | AI4Sci | NLP | Reinforcement Learning

Recent Activity

Organizations

Massive Text Embedding Benchmark's profile picture Blog-explorers's profile picture Hugging Face for Computer Vision's profile picture ASAS AI's profile picture ZeroGPU Explorers's profile picture Social Post Explorers's profile picture Cohere Labs Community's profile picture M4-ai's profile picture LLMem's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture MOTH Lab's profile picture

mmhamdy's activity

reacted to AdinaY's post with πŸ‘ about 5 hours ago
view post
Post
2434
RoboBrain 2.0πŸ”₯ OPEN embedded brain model by BAAIBeijing

BAAI/RoboBrain2.0-7B

✨ 7B - Apache 2.0 / 32B coming soon
✨ Supports multiple images, long videos, and high-resolution visuals
✨ Spatial + temporal reasoning
✨ Real-time memory & scene graphs
upvoted an article 6 days ago
view article
Article

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

By nvidia and 2 others β€’
β€’ 19
upvoted an article 13 days ago
upvoted an article about 2 months ago
view article
Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By julien-c β€’
β€’ 267
posted an update 2 months ago
view post
Post
1636
What inspired the Transformer architecture in the "Attention Is All You Need" paper? And how were various ideas combined to create this groundbreaking model?

In this lengthy article, I explore the story and the origins of some of the ideas introduced in the paper. We'll explore everything from the fundamental attention mechanism that lies at its heart to the surprisingly simple explanation for its name, Transformer.

πŸ’‘ Examples of ideas explored in the article:

βœ… What was the inspiration for the attention mechanism?
βœ… How did we go from attention to self-attention?
βœ… Did the team have any other names in mind for the model?

and more...

I aim to tell the story of Transformers as I would have wanted to read it, and hopefully, one that appeals to others interested in the details of this fascinating idea. This narrative draws from video interviews, lectures, articles, tweets/Xs, and some digging into the literature. I have done my best to be accurate, but errors are possible. If you find inaccuracies or have any additions, please do reach out, and I will gladly make the necessary updates.

Read the article: https://huggingface.co/blog/mmhamdy/pandemonium-the-transformers-story
published an article 2 months ago
published an article 3 months ago
view article
Article

Osirian AI: A Call For The Resurrection And Reuse Of Deep Learning Models.

By mmhamdy β€’
upvoted an article 3 months ago
view article
Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

By saurabhdash and 3 others β€’
β€’ 75