Jiaxin Qin's picture

6 2

Jiaxin Qin

JiaxinQin-cc

·

https://jiaxinqin0814.github.io/

JiaxinQin0814

AI & ML interests

Natural Language Processing Reinforcement Learning

Recent Activity

upvoted a paper 9 days ago

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

upvoted a paper 10 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

upvoted a paper 17 days ago

s3: You Don't Need That Much Data to Train a Search Agent via RL

View all activity

Organizations

None yet

JiaxinQin-cc's activity

upvoted a paper 9 days ago

MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

Paper • 2505.24846 • Published 12 days ago • 15

upvoted a paper 10 days ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published 12 days ago • 90

upvoted 2 papers 17 days ago

s3: You Don't Need That Much Data to Train a Search Agent via RL

Paper • 2505.14146 • Published 23 days ago • 17

Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Paper • 2505.16270 • Published 21 days ago • 6

upvoted a paper 18 days ago

Time-R1: Towards Comprehensive Temporal Reasoning in LLMs

Paper • 2505.13508 • Published 27 days ago • 14

upvoted a paper about 1 month ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 76

liked a Space 9 months ago

Qwen2.5

Chat with Qwen, get text responses

liked a model about 1 year ago

JiaxinQin-cc/MiniGrid-DistShift1-v0

Updated Sep 15, 2023 • 1

updated a dataset over 1 year ago

JiaxinQin-cc/Offline-RL-MiniGrid

Preview • Updated Sep 16, 2023 • 9

updated a model over 1 year ago

JiaxinQin-cc/MiniGrid-DistShift1-v0

Updated Sep 15, 2023 • 1