Elie Bakouch's picture

Elie Bakouch

eliebak

·

AI & ML interests

Training LLM's @ 🤗

Recent Activity

upvoted an article about 24 hours ago

FineWeb2-C: Help Build Better Language Models in Your Language

liked a model 1 day ago

deepseek-ai/DeepSeek-R1-0528

liked a model 3 days ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

View all activity

Organizations

eliebak's activity

upvoted an article about 24 hours ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

and 5 others •

Dec 23, 2024

• 20

upvoted 2 collections 5 days ago

👩‍💻 OlympicCoder

Reasoning datasets and models for competitive coding • 4 items • Updated 23 days ago • 17

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated 17 days ago • 148

upvoted an article 10 days ago

Article

Interactive Tools for machine learning, deep learning, and math

By

•

10 days ago

• 40

upvoted a paper 17 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 22 days ago • 181

upvoted an article 21 days ago

Article

The Transformers Library: standardizing model definitions

By

and 3 others •

22 days ago

• 110

upvoted 2 collections 24 days ago

Common Pile

Datasets in the Common Pile. • 28 items • Updated Mar 22 • 6

INTELLECT-2

INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. • 3 items • Updated 25 days ago • 22

upvoted a paper 26 days ago

Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

Paper • 2505.04842 • Published 29 days ago • 12

upvoted a paper about 1 month ago

Practical Efficiency of Muon for Pretraining

Paper • 2505.02222 • Published May 4 • 37

upvoted 2 articles about 1 month ago

Article

Bamba-9B-v2 - Fast and powerful!

By

and 12 others •

Apr 29

• 32

Article

PipelineRL

By

and 3 others •

Apr 25

• 26

upvoted a paper about 1 month ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published Apr 15 • 28

upvoted an article about 1 month ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By

•

Apr 25

• 267

upvoted 3 articles about 2 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

By

•

Apr 18

• 37

Article

Cohere on Hugging Face Inference Providers 🔥

By

and 6 others •

Apr 16

• 126

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

By

•

Apr 9

• 40

upvoted 2 papers about 2 months ago

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published Apr 5 • 79

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 188

upvoted an article about 2 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 144