Quentin Gallouédec's picture

Hiring 💼

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 5 hours ago

hf-doc-build/doc-build-dev

updated a dataset about 14 hours ago

hf-doc-build/doc-build

updated a dataset 4 days ago

qgallouedec/test-grpo-vlm-log-completions

View all activity

Organizations

Posts 1

Post

2863

@CohereLabs just released 🌿 Tiny Aya: a fully open-source 3B parameter model that speaks 70+ languages 🌍! But there’s a catch:

Tiny Aya is just a language model. It doesn’t support tool calling, the key capability that turns frontier models into powerful *agents*.
So the real question is:

How hard is it to turn Tiny Aya into an agent?

Turns out… it’s simple, thanks to Hugging Face TRL.
We’re sharing a hands-on example showing how to train Tiny Aya to turn it into a tool-calling agent using TRL, unlocking what could become the first *massively multilingual open agent*.

Small model. Global reach. Agent capabilities.

👉 https://github.com/huggingface/trl/blob/main/examples/notebooks/sft_tool_calling.ipynb

Articles 13

Article

74

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

View all Articles

Papers 4

arxiv:2402.09844

arxiv:2402.03046

arxiv:2208.14928

arxiv:2106.13687

spaces 21

Trackio2

Display tracked experiment data in an interactive view

Trackio

Display an interactive tracking dashboard

Trackio Dev

Visualize and monitor experiment metrics with Trackio dashboard

My Awesome Space

Visualize and manage experiment metrics with Trackio

Trackio Trl Issues

Show a live dashboard of your I/O tracking data

Trl 2048

Show your activity tracking dashboard

models 789

qgallouedec/tiny-aya-global-SFT

qgallouedec/tiny-aya-global-tool-calling-SFT

qgallouedec/my-other-awesome-model

Text Generation • 0.5B • Updated Feb 14 • 1

qgallouedec/my-awesome-model

Text Generation • 0.5B • Updated Feb 14 • 2

qgallouedec/trainer_output

Text Generation • 0.5B • Updated Feb 14 • 2

qgallouedec/test_push_output_4

Text Classification • 87.5k • Updated Feb 14

qgallouedec/qwen2-0.5b-deepmath-grpo

qgallouedec/my-finetuned-model

0.8B • Updated Jan 2 • 4

qgallouedec/Qwen3-0.6B-SFT-20251113165959

Text Generation • 0.6B • Updated Nov 13, 2025 • 2

qgallouedec/Qwen3-0.6B-SFT-20251113163732

Updated Nov 13, 2025

View 789 models

datasets 85

qgallouedec/test-grpo-vlm-log-completions

Viewer • Updated 4 days ago • 435 • 80

qgallouedec/llama_star_formatted

Viewer • Updated Feb 21 • 7.21k • 19

qgallouedec/deepmath-completions-logs2

Viewer • Updated Jan 22 • 48 • 87

qgallouedec/deepmath-completions-logs

Viewer • Updated Jan 13 • 232 • 97 • 1

qgallouedec/Dolci-Think-DPO-7B

Viewer • Updated Nov 28, 2025 • 150k • 11

qgallouedec/biogrid_qa

Viewer • Updated Nov 18, 2025 • 59.4k • 138

qgallouedec/human_gene_interaction_qa_v2

Viewer • Updated Nov 18, 2025 • 79.2k • 12

qgallouedec/human_gene_interaction_qa

Viewer • Updated Nov 17, 2025 • 1.84M • 11

qgallouedec/biogrid

Viewer • Updated Nov 17, 2025 • 2.82M • 343

qgallouedec/trl-metrics

Viewer • Updated Oct 7, 2025 • 148k • 58 • 1

View 85 datasets