BuiDoan

AI & ML interests

None yet

Recent Activity

liked a model 14 days ago

google/gemma-3n-E4B-it-litert-preview

liked a model 14 days ago

nari-labs/Dia-1.6B

liked a model 14 days ago

sarvamai/sarvam-m

View all activity

Organizations

BuiDoan's activity

upvoted an article 26 days ago

Article

The 4 Things Qwen-3's Chat Template Teaches Us

•

Apr 30

• 52

upvoted a paper 27 days ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published 29 days ago • 79

upvoted 2 papers 30 days ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 176

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 170

upvoted an article about 1 month ago

Article

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

and 2 others •

Jun 16, 2023

• 31

upvoted 2 papers about 1 month ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 37

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55

upvoted an article about 1 month ago

Article

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

and 1 other •

Apr 27

• 9

upvoted a paper about 1 month ago

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Paper • 2505.02835 • Published May 5 • 26

upvoted 2 articles about 1 month ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 284

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

Jan 31

• 50

upvoted 3 papers about 1 month ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 50

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published Apr 25 • 43

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 46

upvoted 2 articles about 1 month ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 292

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 867

upvoted 2 papers about 1 month ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 35

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22 • 55

upvoted a collection about 1 month ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. • 33 items • Updated about 10 hours ago • 117

upvoted an article about 2 months ago

Article

Faster fine-tuning using TRL & Unsloth

•

Jan 10, 2024

• 62