1 11 9

Haoling Li

Ringo1110

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

IterPref: Focal Preference Learning for Code Generation via Iterative Debugging

upvoted a paper 5 days ago

IterPref: Focal Preference Learning for Code Generation via Iterative Debugging

commented on a paper 5 days ago

IterPref: Focal Preference Learning for Code Generation via Iterative Debugging

View all activity

Organizations

Ringo1110's activity

authored a paper 5 days ago

IterPref: Focal Preference Learning for Code Generation via Iterative Debugging

Paper • 2503.02783 • Published 5 days ago • 5

upvoted a paper 5 days ago

IterPref: Focal Preference Learning for Code Generation via Iterative Debugging

Paper • 2503.02783 • Published 5 days ago • 5

commented a paper 5 days ago

IterPref: Focal Preference Learning for Code Generation via Iterative Debugging

Paper • 2503.02783 • Published 5 days ago • 5 •

liked a dataset 11 days ago

microsoft/Iter-DPO-Dataset

Updated 11 days ago • 18 • 4

upvoted a paper 12 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 12 days ago • 67

upvoted a paper 18 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 18 days ago • 157

liked a dataset 21 days ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 18 days ago • 228k • 97.2k • 647

upvoted a paper about 1 month ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 44

liked a dataset about 1 month ago

cais/hle

Viewer • Updated 23 days ago • 2.7k • 7.26k • 270

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 14 days ago • 11.6k • 860

liked a dataset about 2 months ago

microsoft/EpiCoder-func-380k

Viewer • Updated 5 days ago • 380k • 100 • 11

authored 2 papers about 2 months ago

Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance

Paper • 2406.15330 • Published Jun 21, 2024

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

Paper • 2411.14318 • Published Nov 21, 2024

upvoted a paper about 2 months ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published Jan 8 • 15

authored a paper about 2 months ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published Jan 8 • 15

upvoted a paper about 2 months ago

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 50

liked a dataset 2 months ago

RLHFlow/Deepseek-PRM-Data

Viewer • Updated Nov 9, 2024 • 253k • 83 • 12

liked a model 2 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 14 days ago • 765k • 1.59k

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 349

liked a Space 3 months ago

531

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute