8 11 3

Dayiheng Liu

Losin94

AI & ML interests

None yet

Recent Activity

authored a paper 19 days ago

Qwen3 Technical Report

upvoted a paper 19 days ago

WorldPM: Scaling Human Preference Modeling

upvoted a paper 19 days ago

Qwen3 Technical Report

View all activity

Organizations

Losin94's activity

authored a paper 19 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

upvoted 2 papers 19 days ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published 22 days ago • 33

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

authored 2 papers 21 days ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 22 days ago • 78

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published 22 days ago • 33

authored a paper 3 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

upvoted a paper 3 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

authored 2 papers 4 months ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 71

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published Jan 24 • 34

upvoted a paper 4 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91

authored a paper 5 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

upvoted a paper 5 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

authored a paper 5 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98

upvoted a paper 5 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98

authored a paper 5 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 76

upvoted a paper 5 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 76

authored a paper 5 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

upvoted a paper 5 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

authored 2 papers 6 months ago

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

Paper • 2310.05492 • Published Oct 9, 2023 • 2

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training

Paper • 2001.04063 • Published Jan 13, 2020