Jian Hu's picture

Jian Hu

chuyi777

·

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 4 days ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

liked a dataset about 2 months ago

open-r1/OpenR1-Math-220k

liked a dataset about 2 months ago

open-thoughts/OpenThoughts-114k

View all activity

Organizations

chuyi777's activity

upvoted a paper 4 days ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published 5 days ago • 11

upvoted a paper about 2 months ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20 • 48

upvoted 2 papers 3 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 118

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 99

upvoted a paper 4 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83

upvoted a paper 11 months ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 39