1 13 142

andthattoo

https://twitter.com/andthatto

AI & ML interests

Synthetic data, verifiable information retrieval

Recent Activity

liked a model 14 days ago

moonshotai/Kimi-K2-Instruct

liked a model 18 days ago

rasbt/llama-3.2-from-scratch

upvoted an article 22 days ago

Finally, a Replacement for BERT: Introducing ModernBERT

View all activity

Organizations

upvoted an article 22 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 670

upvoted an article 2 months ago

Article

Context Parallelism

•

Aug 13, 2024

• 21

upvoted a paper 5 months ago

LightThinker: Thinking Step-by-Step Compression

Paper • 2502.15589 • Published Feb 21 • 29

upvoted 2 papers 6 months ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 70

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 238

upvoted an article 6 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

and 5 others •

Feb 4

• 99

upvoted 3 papers 6 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123

ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Paper • 2501.10132 • Published Jan 17 • 22

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 106

upvoted a paper about 1 year ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 102

upvoted a collection about 1 year ago

Function Calling v3

Collection

Models fine-tuned for function-calling • 14 items • Updated Apr 27, 2024 • 21

upvoted a paper about 1 year ago

RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content

Paper • 2403.13031 • Published Mar 19, 2024 • 1

upvoted a paper over 1 year ago

RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Paper • 2401.18059 • Published Jan 31, 2024 • 46

andthattoo

AI & ML interests

Recent Activity

Organizations

andthattoo's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Context Parallelism

DABStep: Data Agent Benchmark for Multi-step Reasoning