237 371 543

Hafedh Hichri

not-lain

https://not-lain.github.io

AI & ML interests

custom AI models with HF integration, HuggingFace fellow 🤗

Recent Activity

updated a Space 4 days ago

not-lain/OmniParser-v2

reacted to danielhanchen's post with 🚀 6 days ago

You can now run Llama 4 on your own local device! 🦙 Run our Dynamic 1.78-bit and 2.71-bit Llama 4 GGUFs: https://huggingface.co/unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF You can run them on llama.cpp and other inference engines. See our guide here: https://docs.unsloth.ai/basics/tutorial-how-to-run-and-fine-tune-llama-4

reacted to danielhanchen's post with ❤️ 6 days ago

View all activity

Organizations

not-lain's activity

updated a Space 4 days ago

OmniParser V2

🏢

OmniParser, turn your LLM into GUI agent

reacted to danielhanchen's post with 🚀❤️🔥🤗 6 days ago

Post

4506

You can now run Llama 4 on your own local device! 🦙
Run our Dynamic 1.78-bit and 2.71-bit Llama 4 GGUFs:
unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF

You can run them on llama.cpp and other inference engines. See our guide here: https://docs.unsloth.ai/basics/tutorial-how-to-run-and-fine-tune-llama-4

1 reply

liked a model 6 days ago

3DAIGC/LAM-20K

Updated 12 days ago • 358 • 6

reacted to as-cle-bert's post with 🔥 7 days ago

Post

2837

Llama-4 is out and I couldn't resist but to cook something with it... So I came up with 𝐋𝐥𝐚𝐦𝐚𝐑𝐞𝐬𝐞𝐚𝐫𝐜𝐡𝐞𝐫 (https://llamaresearcher.com), your deep-research AI companion!🔎

The workflow behind 𝗟𝗹𝗮𝗺𝗮𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵𝗲𝗿 is simple:
💬 You submit a query
🛡️ Your query is evaluated by Llama 3 guard model, which deems it safe or unsafe
🧠 If your query is safe, it is routed to the Researcher Agent
⚙️ The Researcher Agent expands the query into three sub-queries, with which to search the web
🌐 The web is searched for each of the sub-queries
📊 The retrieved information is evaluated for relevancy against your original query
✍️ The Researcher Agent produces an essay based on the information it gathered, paying attention to referencing its sources

The agent itself is also built with easy-to-use and intuitive blocks:
🦙 LlamaIndex provides the agentic architecture and the integrations with the language models
⚡Groq makes Llama-4 available with its lightning-fast inference
🔎 Linkup allows the agent to deep-search the web and provides sourced answers
💪 FastAPI does the heavy loading with wrapping everything within an elegant API interface
⏱️ Redis is used for API rate limiting
🎨 Gradio creates a simple but powerful user interface

Special mention also to Lovable, which helped me build the first draft of the landing page for LlamaResearcher!💖

If you're curious and you want to try LlamaResearcher, you can - completely for free and without subscription - for 30 days from now ➡️ https://llamaresearcher.com
And if you're like me, and you like getting your hands in code and build stuff on your own machine, I have good news: this is all open-source, fully reproducible locally and Docker-ready🐋
Just go to the GitHub repo: https://github.com/AstraBert/llama-4-researcher and don't forget to star it, if you find it useful!⭐

As always, have fun and feel free to leave your feedback✨

2 replies

upvoted a paper 7 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 8 days ago • 158

upvoted an article 8 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 554

upvoted an article 9 days ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

11 days ago

• 140

reacted to jsulz's post with 🔥 10 days ago

Post

3580

Huge week for

xet-team as Llama 4 is the first major model on Hugging Face uploaded with Xet providing the backing! Every byte downloaded comes through our infrastructure.

Using Xet on Hugging Face is the fastest way to download and iterate on open source models and we've proved it with Llama 4 giving a boost of ~25% across all models.

We expect builders on the Hub to see even more improvements, helping power innovation across the community.

With the models on our infrastructure, we can peer in and see how well our dedupe performs across the Llama 4 family. On average, we're seeing ~25% dedupe, providing huge savings to the community who iterate on these state-of-the-art models. The attached image shows a few selected models and how they perform on Xet.

Thanks to the

meta-llama team for launching on Xet!

upvoted a collection 10 days ago

Llama 4

Collection

Llama 4 release • 10 items • Updated 10 days ago • 429

updated a Space 11 days ago

Gpu Utils

🏃

Manipulate and enhance images with background removal, inpainting, and outpainting

liked a model 12 days ago

Alpha-VLLM/Lumina-mGPT-2.0

Updated 13 days ago • 1.96k • 70

reacted to hesamation's post with ❤️ 12 days ago

Post

2681

What, How, Where, and How Well? This paper reviews test-time scaling methods and all you need to know about them:
> parallel, sequential, hybrid, internal scaling
> how to scale (SFT, RL, search, verification)
> metrics and evals of test-time scaling

🔗paper: What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models (2503.24235)

If you want to learn what inference-time compute scaling is @rasbt has a great blog post on that:
https://magazine.sebastianraschka.com/p/state-of-llm-reasoning-and-inference-scaling

liked a Space 14 days ago

ContributionChartHuggingFace

🔥

GitHub like contribution Chart for Hugging Face

upvoted a paper 15 days ago

Your ViT is Secretly an Image Segmentation Model

Paper • 2503.19108 • Published 22 days ago • 20

liked a dataset 15 days ago

biglam/european_art

Viewer • Updated 15 days ago • 15.2k • 1.37k • 16

liked a Space 15 days ago

PDFNet

😻

PDFNet