Hoang Long Pham's picture

5

Hoang Long Pham

HPLong1

AI & ML interests

None yet

Recent Activity

updated a collection about 2 hours ago

Papers read 2025

upvoted a paper about 2 hours ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

upvoted a collection about 20 hours ago

RL+reason model

View all activity

Organizations

None yet

HPLong1's activity

upvoted a paper about 2 hours ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 6 days ago • 56

upvoted a collection about 20 hours ago

RL+reason model

104 items • Updated 3 days ago • 4

upvoted a paper 13 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 18 days ago • 52

upvoted a paper 14 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 21 days ago • 61

upvoted a paper 15 days ago

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published 18 days ago • 76