12 2 29

garrett galloway PRO

RecViking

recreationalviking

AI & ML interests

None yet

Recent Activity

liked a dataset 15 days ago

wikimedia/structured-wikipedia

new activity about 1 month ago

RecViking/Mistral-Medium-3.5-128B-NVFP4:Fix Transformers config

updated a model about 1 month ago

RecViking/Mistral-Medium-3.5-128B-NVFP4

View all activity

Organizations

liked a dataset 15 days ago

wikimedia/structured-wikipedia

Viewer • Updated 24 days ago • 10.5M • 15.4k • 357

New activity in RecViking/Mistral-Medium-3.5-128B-NVFP4 about 1 month ago

Fix Transformers config

#1 opened about 1 month ago by

juliendenize

updated 2 models about 1 month ago

RecViking/Mistral-Medium-3.5-128B-NVFP4

74B • Updated May 9 • 105k • 7

RecViking/Mistral-Medium-3.5-128B-GGUF

125B • Updated May 9 • 19.8k

published 2 models about 1 month ago

RecViking/Mistral-Medium-3.5-128B-GGUF

125B • Updated May 9 • 19.8k

RecViking/Mistral-Medium-3.5-128B-NVFP4

74B • Updated May 9 • 105k • 7

New activity in Qwen/Qwen3.6-27B about 2 months ago

what kind of effect if I increase the expert number used in lm studio?

#11 opened about 2 months ago by

Raffaelelu

liked 3 models 4 months ago

New activity in Qwen/Qwen3.5-397B-A17B 4 months ago

will there be a smaller version?

👍🔥 12

#16 opened 4 months ago by

iojvsuynv

upvoted a collection 5 months ago

Cerebras REAP

Collection

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 145

New activity in Qwen/Qwen3-Next-80B-A3B-Instruct-FP8 7 months ago

I find qwen3 next exceptional, but too big.

#7 opened 7 months ago by

ZeroWw

New activity in openai/gpt-oss-20b 10 months ago

Unable to load gpt-oss-20b on dual L40 (48GB) GPUs with vLLM

#136 opened 10 months ago by

yjban

liked 2 models about 1 year ago

Qwen/Qwen3-32B

Text Generation • 33B • Updated Jul 26, 2025 • 2.99M • • 700

microsoft/Phi-4-reasoning-plus

Text Generation • 15B • Updated Nov 24, 2025 • 24.9k • 343

New activity in microsoft/phi-4 over 1 year ago

Phi-4 with Tools

#28 opened over 1 year ago by

tahafatih

liked a dataset over 1 year ago

GAIR/LIMR

Viewer • Updated Feb 17, 2025 • 1.39k • 521 • 33

liked a model over 1 year ago

microsoft/OmniParser-v2.0

Updated Mar 28, 2025 • 63.3k • 1.34k

reacted to lewtun's post with 🔥 over 1 year ago

Post

10556

We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

🔥 Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1

5 replies

garrett galloway PRO

AI & ML interests

Recent Activity

Organizations

RecViking's activity

Fix Transformers config

what kind of effect if I increase the expert number used in lm studio?

will there be a smaller version?

I find qwen3 next exceptional, but too big.

Unable to load gpt-oss-20b on dual L40 (48GB) GPUs with vLLM

Phi-4 with Tools