Hugging Face Smol Cluster

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

thomwolf authored a paper 18 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

lvwerra authored a paper 18 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

guipenedo authored a paper 18 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

View all activity

thomwolf

authored a paper 18 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 19 days ago • 61

lvwerra

authored a paper 18 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 19 days ago • 61

guipenedo

authored a paper 18 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 19 days ago • 61

clem

posted an update 25 days ago

Post

2582

We got a visitor to the office today!

pollen-robotics ,

lerobot ,

unitreerobotics meetings!

1 reply

loubnabnl

authored a paper about 1 month ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 42

guipenedo

authored a paper about 1 month ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 42

thomwolf

authored a paper about 1 month ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 113

clem

posted an update about 2 months ago

Post

6845

Today, we're unveiling two new open-source AI robots! HopeJR for $3,000 & Reachy Mini for $300 🤖🤖🤖

Let's go open-source AI robotics!

7 replies

clem

posted an update about 2 months ago

Post

3455

It's just become easier to share your apps on the biggest AI app store (aka HF spaces) for unlimited storage, more visibility and community interactions.

Just pick a React, Svelte, or Vue template when you create your space or add app_build_command: npm run build in your README's YAML and app_file: build/index.html in your README's YAML block.

Or follow this link: https://huggingface.co/new-space?sdk=static

Let's build!

1 reply

clem

posted an update about 2 months ago

Post

3828

Playing with Veo3 this morning. Share your prompt if you want me to create videos for you (bonus point if they funnily reference HF/open-source). These videos are "a cat on the moon rapping "I love Hugging Face""!

26 replies

clefourrier

posted an update about 2 months ago

Post

953

Always surprised that so few people actually read the FineTasks blog, on
✨how to select training evals with the highest signal✨

If you're serious about training models without wasting compute on shitty runs, you absolutely should read it!!

An high signal eval actually tells you precisely, during training, how wel & what your model is learning, allowing you to discard the bad runs/bad samplings/...!

The blog covers in depth prompt choice, metrics, dataset, across languages/capabilities, and my fave section is "which properties should evals have"👌
(to know on your use case how to select the best evals for you)

Blog: HuggingFaceFW/blogpost-fine-tasks

2 replies

loubnabnl

posted an update about 2 months ago

Post

3209

SmolVLM is now available on PocketPal — you can run it offline on your smartphone to interpret the world around you. 🌍📱

And check out this real-time camera demo by @ngxson , powered by llama.cpp:
https://github.com/ngxson/smolvlm-realtime-webcam
https://x.com/pocketpal_ai

3 replies

clem

posted an update 2 months ago

Post

3168

Very cool to see

pytorch contributing on Hugging Face. Time to follow them to see what they're cooking!

2 replies

lewtun

authored a paper 2 months ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

Paper • 2504.11354 • Published Apr 15 • 6

clem

posted an update 2 months ago

Post

3977

nvidia dominating the top trending open datasets these days!

http://hf.co/datasets

clem

posted an update 2 months ago

Post

4081

What are you using to evaluate models or AI systems? So far we're building lighteval & leaderboards on the hub but still feels early & a lot more to build. What would be useful to you?

6 replies

clem

posted an update 2 months ago

Post

1670

LeRobot-worldwide-hackathon is already scheduled in 30 cities all over the world!

Check if there's one in your city here: LeRobot-worldwide-hackathon/worldwide-map

clem

posted an update 2 months ago

Post

1556

The

meta-llama org just crossed 40,000 followers on Hugging Face. Grateful for all their impact on the field sharing the Llama weights openly and much more!

We need more of this from all other big tech to make the AI more open, collaborative and beneficial to all!

clem

posted an update 3 months ago

Post

4053

Energy is a massive constraint for AI but do you even know what energy your chatGPT convos are using?

We're trying to change this by releasing ChatUI-energy, the first interface where you see in real-time what energy your AI conversations consume. Great work from @jdelavande powered by spaces & TGI, available for a dozen of open-source models like Llama, Mistral, Qwen, Gemma and more.

jdelavande/chat-ui-energy

Should all chat interfaces have this? Just like ingredients have to be shown on products you buy, we need more transparency in AI for users!

3 replies

clem

posted an update 3 months ago

Post

3000

Just crossed half a million public apps on Hugging Face. A new public app is created every minute these days 🤯🤯🤯

What's your favorite? http://hf.co/spaces

3 replies

AI & ML interests

Recent Activity

Team members 17

HFSmolCluster's activity