13 12 9

Hyungjoo Chae

hyungjoochae

https://hyungjoo-homepage.netlify.app/

kyle8581

AI & ML interests

Commonsense Reasoning, LLM

Recent Activity

authored a paper 8 days ago

One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

upvoted a paper 8 days ago

One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

commented on a paper 8 days ago

One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

View all activity

Organizations

hyungjoochae's activity

authored a paper 8 days ago

One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

Paper • 2506.02338 • Published 9 days ago • 4

upvoted a paper 8 days ago

One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

Paper • 2506.02338 • Published 9 days ago • 4

commented a paper 8 days ago

One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL

Paper • 2506.02338 • Published 9 days ago • 4 •

upvoted 2 papers 16 days ago

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Paper • 2505.19640 • Published 17 days ago • 12

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Paper • 2505.16348 • Published 21 days ago • 46

authored 4 papers 21 days ago

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9, 2024 • 3

Evaluating Robustness of Reward Models for Mathematical Reasoning

Paper • 2410.01729 • Published Oct 2, 2024

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

Paper • 2406.14703 • Published Jun 20, 2024 • 2

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published 22 days ago • 99

upvoted 2 papers 21 days ago

RLVR-World: Training World Models with Reinforcement Learning

Paper • 2505.13934 • Published 23 days ago • 14

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published 22 days ago • 99

updated 2 datasets 21 days ago

LangAGI-Lab/WebRewardBench

Viewer • Updated 21 days ago • 776 • 78

LangAGI-Lab/WebPRMCollection_preference_pair

Viewer • Updated 21 days ago • 9.46k • 212

updated 2 models 21 days ago

LangAGI-Lab/WebShepherd_3B

Feature Extraction • Updated 21 days ago • 80 • 1

LangAGI-Lab/WebShepherd_8B

Feature Extraction • Updated 21 days ago • 25 • 4

updated a collection 21 days ago

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Collection

7 items • Updated 21 days ago • 3

commented a paper 21 days ago

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Paper • 2505.15277 • Published 22 days ago • 99 •

updated a Space 22 days ago

Web Shepherd Demo

😻

Display a loading screen with Hugging Face logo

New activity in hyungjoochae/Web-Shepherd-Demo 23 days ago

update

#2 opened 23 days ago by

iruno

published a Space 24 days ago

Web Shepherd Demo

😻

Display a loading screen with Hugging Face logo