2 11

xts

xtsssss

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

upvoted a paper 2 days ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

upvoted a paper 3 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

View all activity

Organizations

upvoted 2 papers 2 days ago

MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published 9 days ago • 15

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Paper • 2508.11987 • Published 7 days ago • 54

upvoted a paper 3 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published 17 days ago • 100

upvoted a paper 11 days ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published 12 days ago • 103

upvoted 2 papers 16 days ago

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published 30 days ago • 81

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published 17 days ago • 138

upvoted a paper 17 days ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published 19 days ago • 126

updated a model about 1 month ago

FR3E-Bytedance/FR3E-32B

33B • Updated Jul 10 • 9 • 1

authored 3 papers about 1 month ago

published a model about 1 month ago

FR3E-Bytedance/FR3E-32B

33B • Updated Jul 10 • 9 • 1

updated a model about 1 month ago

FR3E-Bytedance/FR3E-7B

8B • Updated Jul 10 • 9

published a model about 1 month ago

FR3E-Bytedance/FR3E-7B

8B • Updated Jul 10 • 9

updated a model about 1 month ago

FR3E-Bytedance/FR3E-Math-7B

8B • Updated Jul 10 • 9

published a model about 1 month ago

FR3E-Bytedance/FR3E-Math-7B

8B • Updated Jul 10 • 9

upvoted a paper about 1 month ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

upvoted a paper 4 months ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published Apr 21 • 22

New activity in O1-OPEN/OpenO1-SFT 4 months ago

Update task category to text-generation, add link to paper and code

#16 opened 4 months ago by

nielsr

updated a model 5 months ago

xtsssss/xrpo_qwen7B_openr1

Updated Mar 18 • 1

xts

AI & ML interests

Recent Activity

Organizations

xtsssss's activity

Update task category to text-generation, add link to paper and code