Quentin Gallouédec's picture

Quentin Gallouédec PRO

qgallouedec

·

AI & ML interests

None yet

Recent Activity

upvoted a changelog about 5 hours ago

Connect Your MCP Client to the Hugging Face Hub

updated a model 2 days ago

trl-internal-testing/tiny-DeepseekV3ForCausalLM-0528

published a model 2 days ago

trl-internal-testing/tiny-DeepseekV3ForCausalLM-0528

View all activity

Organizations

qgallouedec's activity

upvoted a changelog about 5 hours ago

Changelog

Connect Your MCP Client to the Hugging Face Hub

about 10 hours ago

• 18

upvoted an article 3 days ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

4 days ago

• 37

upvoted an article 4 days ago

Article

🐯 Liger GRPO meets TRL

By

and 5 others •

13 days ago

• 36

upvoted a paper 8 days ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

Paper • 2504.11354 • Published Apr 15 • 5

upvoted a paper 11 days ago

INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning

Paper • 2505.07291 • Published 26 days ago • 12

upvoted a paper 21 days ago

Layer Normalization

Paper • 1607.06450 • Published Jul 21, 2016 • 3

upvoted an article 25 days ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

26 days ago

• 417

upvoted an article 26 days ago

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

By

and 6 others •

27 days ago

• 57

upvoted an article about 2 months ago

Article

Cohere on Hugging Face Inference Providers 🔥

By

and 6 others •

Apr 16

• 126

upvoted a paper 2 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 128

upvoted an article 2 months ago

Article

Open R1: Update #4

By

and 3 others •

Mar 26

• 48

upvoted a paper 2 months ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24 • 30

upvoted a collection 3 months ago

Gemma 3 Release

24 items • Updated 8 days ago • 380

upvoted an article 3 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 292

upvoted a paper 3 months ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 8

upvoted an article 3 months ago

Article

The N Implementation Details of RLHF with PPO

By

and 2 others •

Oct 24, 2023

• 58

upvoted 2 papers 3 months ago

ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

Paper • 1910.02054 • Published Oct 4, 2019 • 6

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 115

upvoted a paper 4 months ago

Presumed Cultural Identity: How Names Shape LLM Responses

Paper • 2502.11995 • Published Feb 17 • 11

upvoted an article 4 months ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 214