Hafedh Hichri's picture

Hafedh Hichri

not-lain

AI & ML interests

custom AI models with HF integration, HuggingFace fellow ๐Ÿค—

Recent Activity

Organizations

Stanford AI's profile picture AI FILMS's profile picture OpenGVLab's profile picture MusicAI's profile picture BigScience Biomedical Datasets's profile picture OpenVINO Toolkit's profile picture Hugging Face Fellows's profile picture Gradio-Blocks-Party's profile picture scikit-learn's profile picture DeepGHS's profile picture Open-Source AI Meetup's profile picture lora concepts library's profile picture The introspector project's profile picture Arabic Machine Learning 's profile picture Literally Me FRFR Research Society's profile picture East China Normal University's profile picture Kornia AI's profile picture Tune a video concepts library's profile picture Keras Dreambooth Event's profile picture AI Zero to Hero's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture The Waifu Research Department's profile picture AI Indonesia Community's profile picture M.O.F.U.'s profile picture ShoukanLabs's profile picture Blog-explorers's profile picture Arabic Clip's profile picture BangumiBase's profile picture CyberHarem's profile picture Touhou AI Experimental Group (MOFU)'s profile picture Tensor Diffusion's profile picture OpenOrca's profile picture huggingPartyParis's profile picture Multi๐Ÿค–Transformers's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture LocalLLaMA's profile picture BrainPulse's profile picture MLX Community's profile picture INNOVA AI's profile picture Narra's profile picture Social Post Explorers's profile picture Cohere Labs Community's profile picture Tunisia.AI's profile picture M4-ai's profile picture Dev Mode Explorers's profile picture Chinese LLMs on Hugging Face's profile picture Paris AI Running Club's profile picture AI4Health's profile picture Stable Diffusion Community (Unofficial, Non-profit)'s profile picture Hugging Face for Legal's profile picture Hugging Face Discord Community's profile picture phxia's profile picture Arilio's profile picture Data Tonic (Alignment Lab)'s profile picture Juritech's profile picture Nerdy Face's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture Donut Earthers ๐Ÿฉ's profile picture None yet's profile picture Hugging Face Agents Course's profile picture Bitsandbytes Community's profile picture

not-lain's activity

reacted to danielhanchen's post with ๐Ÿš€โค๏ธ๐Ÿ”ฅ๐Ÿค— 6 days ago
reacted to as-cle-bert's post with ๐Ÿ”ฅ 7 days ago
view post
Post
2837
Llama-4 is out and I couldn't resist but to cook something with it... So I came up with ๐‹๐ฅ๐š๐ฆ๐š๐‘๐ž๐ฌ๐ž๐š๐ซ๐œ๐ก๐ž๐ซ (https://llamaresearcher.com), your deep-research AI companion!๐Ÿ”Ž

The workflow behind ๐—Ÿ๐—น๐—ฎ๐—บ๐—ฎ๐—ฅ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต๐—ฒ๐—ฟ is simple:
๐Ÿ’ฌ You submit a query
๐Ÿ›ก๏ธ Your query is evaluated by Llama 3 guard model, which deems it safe or unsafe
๐Ÿง  If your query is safe, it is routed to the Researcher Agent
โš™๏ธ The Researcher Agent expands the query into three sub-queries, with which to search the web
๐ŸŒ The web is searched for each of the sub-queries
๐Ÿ“Š The retrieved information is evaluated for relevancy against your original query
โœ๏ธ The Researcher Agent produces an essay based on the information it gathered, paying attention to referencing its sources

The agent itself is also built with easy-to-use and intuitive blocks:
๐Ÿฆ™ LlamaIndex provides the agentic architecture and the integrations with the language models
โšกGroq makes Llama-4 available with its lightning-fast inference
๐Ÿ”Ž Linkup allows the agent to deep-search the web and provides sourced answers
๐Ÿ’ช FastAPI does the heavy loading with wrapping everything within an elegant API interface
โฑ๏ธ Redis is used for API rate limiting
๐ŸŽจ Gradio creates a simple but powerful user interface

Special mention also to Lovable, which helped me build the first draft of the landing page for LlamaResearcher!๐Ÿ’–

If you're curious and you want to try LlamaResearcher, you can - completely for free and without subscription - for 30 days from now โžก๏ธ https://llamaresearcher.com
And if you're like me, and you like getting your hands in code and build stuff on your own machine, I have good news: this is all open-source, fully reproducible locally and Docker-ready๐Ÿ‹
Just go to the GitHub repo: https://github.com/AstraBert/llama-4-researcher and don't forget to star it, if you find it useful!โญ

As always, have fun and feel free to leave your feedbackโœจ
  • 2 replies
ยท
upvoted an article 8 days ago
view article
Article

Mixture of Experts Explained

โ€ข 554
upvoted an article 9 days ago
view article
Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

โ€ข 140
reacted to jsulz's post with ๐Ÿ”ฅ 10 days ago
view post
Post
3580
Huge week for xet-team as Llama 4 is the first major model on Hugging Face uploaded with Xet providing the backing! Every byte downloaded comes through our infrastructure.

Using Xet on Hugging Face is the fastest way to download and iterate on open source models and we've proved it with Llama 4 giving a boost of ~25% across all models.

We expect builders on the Hub to see even more improvements, helping power innovation across the community.

With the models on our infrastructure, we can peer in and see how well our dedupe performs across the Llama 4 family. On average, we're seeing ~25% dedupe, providing huge savings to the community who iterate on these state-of-the-art models. The attached image shows a few selected models and how they perform on Xet.

Thanks to the meta-llama team for launching on Xet!
reacted to hesamation's post with โค๏ธ 12 days ago
view post
Post
2681
What, How, Where, and How Well? This paper reviews test-time scaling methods and all you need to know about them:
> parallel, sequential, hybrid, internal scaling
> how to scale (SFT, RL, search, verification)
> metrics and evals of test-time scaling

๐Ÿ”—paper: What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models (2503.24235)

If you want to learn what inference-time compute scaling is @rasbt has a great blog post on that:
https://magazine.sebastianraschka.com/p/state-of-llm-reasoning-and-inference-scaling