3 8 40

Weijing Huang

waleking

AI & ML interests

Language Models

Recent Activity

upvoted a paper about 1 month ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

liked a dataset 2 months ago

OpenStellarTeam/Chinese-SimpleQA

liked a dataset 3 months ago

allenai/olmOCR-mix-0225

View all activity

Organizations

None yet

waleking's activity

upvoted a paper about 1 month ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7 • 25

liked a dataset 2 months ago

OpenStellarTeam/Chinese-SimpleQA

Viewer • Updated Dec 16, 2024 • 3k • 282 • 26

liked a dataset 3 months ago

allenai/olmOCR-mix-0225

Viewer • Updated Feb 25 • 259k • 1.76k • 127

upvoted a paper 3 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 52

liked a dataset 3 months ago

Anthropic/EconomicIndex

Viewer • Updated 8 days ago • 3.36k • 2.42k • 279

upvoted a paper 3 months ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published Feb 5 • 17

upvoted an article 3 months ago

Article

Replicating DeepSeek R1 for Information Extraction

•

Jan 31

• 42

upvoted a paper 4 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98

liked a Space 4 months ago

563

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

upvoted an article 5 months ago

Article

Deriving DPO's Loss

•

Dec 24, 2024

• 28

liked a dataset 7 months ago

m-a-p/MAP-CC

Viewer • Updated Jul 11, 2024 • 1.77B • 2.3k • 69

liked a dataset 8 months ago

Lyte/Reasoner-1o1-v0.3-HQ

Viewer • Updated Sep 18, 2024 • 370 • 25 • 8

liked a dataset 9 months ago

pints-ai/Expository-Prose-V1

Viewer • Updated Aug 12, 2024 • 6.67M • 60 • 19

liked a model 9 months ago

PleIAs/OCRonos-Vintage

Text Generation • Updated Aug 8, 2024 • 321 • 78

liked a dataset 11 months ago

mikex86/stackoverflow-posts

Viewer • Updated Aug 1, 2023 • 58.3M • 3.72k • 53

liked 3 datasets about 1 year ago

liked a model about 1 year ago

shenzhi-wang/Llama3-8B-Chinese-Chat

Text Generation • Updated Jul 4, 2024 • 2.88k • • 680

liked a dataset about 1 year ago

YanweiLi/MGM-Pretrain

Viewer • Updated Apr 21, 2024 • 1.27M • 20 • 16