wangrui's picture

wangrui

varuy322

·

varuy322

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

upvoted a paper 9 days ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

upvoted a paper 18 days ago

Shaping capabilities with token-level data filtering

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published 8 days ago • 177

upvoted a paper 9 days ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published 10 days ago • 151

upvoted a paper 18 days ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published 21 days ago • 27

upvoted a paper 22 days ago

RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation

Paper • 2601.08430 • Published Jan 13 • 59

upvoted a paper 23 days ago

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

Paper • 2511.03146 • Published Nov 5, 2025 • 8

upvoted a paper about 1 month ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 86

upvoted a collection about 1 month ago

Qwen3-VL-Embedding

2 items • Updated Jan 8 • 61

upvoted 2 collections about 2 months ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 15 days ago • 52

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 15 days ago • 97

upvoted a collection 2 months ago

Multimodal Implementations

Comprehensive Demo of Multimodal VLMs on the Hub • 24 items • Updated 4 days ago • 11

upvoted a paper 2 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 124

upvoted 2 collections 2 months ago

Multimodal Dataset

88 items • Updated 12 days ago • 9

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 30 items • Updated 8 days ago • 83

upvoted a paper 3 months ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 90

upvoted 4 collections 3 months ago

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated Dec 23, 2025 • 50

Synthetic Data and Self-Improvement

113 items • Updated Sep 26, 2025 • 9

Reasoning, Thinking, RL and Test-Time Scaling

261 items • Updated Nov 22, 2025 • 15

Papers

654 items • Updated about 3 hours ago • 16

upvoted a paper 3 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143

upvoted an article 3 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

58