Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, ML4Science

Recent Activity

liked a dataset 17 days ago

ILSVRC/imagenet-1k

liked a dataset 7 months ago

LEAP/ClimSim_high-res

upvoted an article 7 months ago

Finally, a Replacement for BERT: Introducing ModernBERT

View all activity

Organizations

None yet

liked a dataset 17 days ago

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 108k • 727

liked a dataset 7 months ago

LEAP/ClimSim_high-res

Updated Sep 29, 2023 • 43.5k • 12

upvoted an article 7 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

726

liked a dataset 9 months ago

mcherukara/PtychoNN_data

Updated Mar 18, 2025 • 95 • 2

liked 2 models 10 months ago

allenai/ACE2-ERA5

Updated Nov 18, 2025 • 69 • 15

microsoft/aurora

Updated Jun 20, 2025 • 50

upvoted an article 11 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

liked a Space 11 months ago

Memory Viz

🧠

Memory Viz

liked 2 Spaces 12 months ago

Predict Memory

🧮

106

Calculate and visualize LLM training memory usage

The Ultra-Scale Playbook

🌌

3.68k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 12 months ago

Article

Open-R1: Update #1

Feb 2, 2025

•

305

liked 2 datasets 12 months ago

PleIAs/common_corpus

Viewer • Updated Jun 10, 2025 • 470M • 33.1k • 340

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 289k • 939

liked 3 models about 1 year ago

upvoted a collection about 1 year ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 157

liked a model about 1 year ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15, 2025 • 871k • 993

liked 2 Spaces about 1 year ago

TheWell

🌍

Visualization of data from the Well

FineWeb: decanting the web for the finest text data at scale

🍷

1.29k

Read about FineWeb, a large web‑text dataset for LLMs

Thomas Bouvier

AI & ML interests

Recent Activity

Organizations

tbouvier's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Memory Viz

Predict Memory

The Ultra-Scale Playbook

Open-R1: Update #1

TheWell

FineWeb: decanting the web for the finest text data at scale