1 62 6

Shuai Wang

Shuaiii

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

cais/hle

upvoted a paper 4 days ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

upvoted a paper 5 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

View all activity

Organizations

None yet

Shuaiii's activity

liked a dataset 3 days ago

cais/hle

Viewer • Updated 17 days ago • 2.5k • 9.05k • 313

upvoted a paper 4 days ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published 5 days ago • 12

upvoted a paper 5 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 6 days ago • 228

upvoted a paper 7 days ago

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Paper • 2504.07866 • Published 10 days ago • 8

upvoted a paper 9 days ago

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 34

upvoted a collection 9 days ago

InternVL3

Collection

34 items • Updated about 17 hours ago • 50

upvoted a paper 9 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 11 days ago • 113

upvoted 3 papers 11 days ago

upvoted a paper 12 days ago

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published 13 days ago • 94

upvoted a collection 15 days ago

Llama 4

Collection

Llama 4 release • 10 items • Updated 15 days ago • 438

upvoted a paper 16 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 18 days ago • 52

upvoted a paper 18 days ago

Scaling Language-Free Visual Representation Learning

Paper • 2504.01017 • Published 19 days ago • 26

upvoted a collection 20 days ago

Meta's Llama 3.1 models & evals

Collection

17 items • Updated Dec 13, 2024 • 127

upvoted an article 20 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 392

upvoted a paper 24 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 26 days ago • 139

liked 2 datasets 25 days ago

OpenGVLab/ShareGPT-4o

Viewer • Updated Aug 17, 2024 • 59.4k • 8.4k • 163

a-m-team/AM-DeepSeek-R1-Distilled-1.4M

Preview • Updated 22 days ago • 13.1k • 119