9 88 174

YangWang92

yangwang92

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

deepseek-ai/DeepSeek-Prover-V2-671B

liked a dataset 8 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

upvoted a collection 8 days ago

Qwen3

View all activity

Organizations

yangwang92's activity

liked a model 7 days ago

deepseek-ai/DeepSeek-Prover-V2-671B

Text Generation • Updated 6 days ago • 3.47k • • 699

liked a dataset 8 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated 9 days ago • 3.91M • 10.1k • 454

upvoted a collection 8 days ago

Qwen3

Collection

27 items • Updated 5 days ago • 524

liked a model 9 days ago

Kwaipilot/SRPO-Qwen-32B

Updated 9 days ago • 129 • 13

upvoted a paper 9 days ago

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published 11 days ago • 41

upvoted 2 papers 19 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 21 days ago • 60

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 20 days ago • 71

upvoted 3 papers 20 days ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published 21 days ago • 12

Seedream 3.0 Technical Report

Paper • 2504.11346 • Published 21 days ago • 53

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published 22 days ago • 40

liked 2 models 21 days ago

nvidia/Nemotron-H-56B-Base-8K

Text Generation • Updated 20 days ago • 923 • 26

nvidia/Nemotron-H-47B-Base-8K

Text Generation • Updated 14 days ago • 1.33k • 17

liked a model 22 days ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • Updated 6 days ago • 58.8k • 942

liked a model 24 days ago

Skywork/Skywork-OR1-32B-Preview

Updated 21 days ago • 5.2k • 67

liked a dataset 24 days ago

AI-MO/NuminaMath-1.5

Viewer • Updated Feb 10 • 896k • 2.33k • 136

liked a model 24 days ago

Skywork/Skywork-R1V-38B

Image-Text-to-Text • Updated 15 days ago • 12.8k • 124

liked 2 datasets 24 days ago

LLM360/MegaMath

Viewer • Updated 28 days ago • 217M • 70.2k • 83

THU-KEG/PairJudge-432K

Viewer • Updated Feb 19 • 432k • 36 • 1

liked a model 24 days ago

THU-KEG/PairJudge-RM

Updated Feb 19 • 3 • 1

liked a model 25 days ago

google/gemma-3-27b-it

Image-Text-to-Text • Updated Mar 21 • 408k • • 1.32k