mltrials

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

SmolLM3: smol, multilingual, long-context reasoner

upvoted a paper 23 days ago

Scaling Test-time Compute for LLM Agents

updated a model about 1 month ago

mltrials/opt-350m-lora

View all activity

Organizations

None yet

upvoted an article 3 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

4 days ago

• 479

upvoted a paper 23 days ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published 26 days ago • 61

updated a model about 1 month ago

mltrials/opt-350m-lora

Updated Jun 9

published a model about 1 month ago

mltrials/opt-350m-lora

Updated Jun 9

liked a model 4 months ago

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • 8B • Updated 23 days ago • 1.44M • • 4.06k

liked a model 6 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 880k • • 12.5k

upvoted 3 papers 6 months ago

upvoted 5 articles 6 months ago

Article

🌁#81: Key AI Concepts to Follow in 2025

•

Dec 23, 2024

• 24

Article

Fine-tune ModernBERT for text classification using synthetic data

•

Dec 30, 2024

• 38

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

•

Jan 2

• 41

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

•

Jan 3

• 37

Article

Accelerating Language Model Inference with Mixture of Attentions

and 1 other •

Jan 7

• 24

liked a Space 6 months ago

463

2024 AI Timeline

📈

View and filter AI model releases in 2024

mltrials

AI & ML interests

Recent Activity

Organizations

mltrials's activity

SmolLM3: smol, multilingual, long-context reasoner

🌁#81: Key AI Concepts to Follow in 2025

Fine-tune ModernBERT for text classification using synthetic data

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

Accelerating Language Model Inference with Mixture of Attentions

2024 AI Timeline