48 25 167

Ivan Fioravanti PRO

ivanfioravanti

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

2025-ai-timeline/2025-ai-timeline

upvoted an article 6 days ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

commented on an article 7 days ago

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

View all activity

Organizations

upvoted an article 6 days ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

upvoted an article 7 days ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

751

upvoted an article 29 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

564

upvoted an article about 1 month ago

Article

Continuous batching from first principles

Nov 25, 2025

•

297

upvoted a paper about 2 months ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published Nov 7, 2025 • 54

upvoted an article 2 months ago

Article

On the Shifting Global Compute Landscape

Oct 29, 2025

•

upvoted an article 4 months ago

Article

Introducing Marvis TTS: Real-Time Streaming Speech Synthesis

Aug 27, 2025

•

upvoted an article 5 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

753

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

upvoted 2 articles 6 months ago

Article

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Jul 10, 2025

•

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

743

upvoted an article 8 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

429

upvoted a collection 9 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 677

upvoted a collection 12 months ago

DolphinLabeled Datasets

Collection

Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6, 2025 • 15

upvoted an article about 1 year ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Jan 2, 2025

•

upvoted 4 papers about 1 year ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 60

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 158

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 43

upvoted an article about 1 year ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Dec 4, 2024

•

Ivan Fioravanti PRO

AI & ML interests

Recent Activity

Organizations

ivanfioravanti's activity

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

We Got Claude to Fine-Tune an Open Source LLM

Continuous batching from first principles

On the Shifting Global Compute Landscape

Introducing Marvis TTS: Real-Time Streaming Speech Synthesis

Uncensor any LLM with abliteration

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

SmolLM3: smol, multilingual, long-context reasoner

You could have designed state of the art positional encoding

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs