Xiaoqi Jian
mx1024
ยท
AI & ML interests
None yet
Recent Activity
liked
a model
14 days ago
miromind-ai/MiroThinker-32B-DPO-v0.2
upvoted
a
paper
19 days ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn
Tool-Integrated Reasoning
authored
a paper
4 months ago
Stress Testing Generalization: How Minor Modifications Undermine Large
Language Model Performance