Zhenran Xu's picture

Zhenran Xu

imryanxu

·

AI & ML interests

fishing in lab while working on language agents

Recent Activity

upvoted a paper 25 days ago

LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding

upvoted a paper 2 months ago

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

upvoted a paper 2 months ago

Step-GUI Technical Report

View all activity

Organizations

upvoted a paper 25 days ago

LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding

Paper • 2602.04541 • Published 26 days ago • 8

upvoted 2 papers 2 months ago

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Paper • 2512.13507 • Published Dec 15, 2025 • 40

Step-GUI Technical Report

Paper • 2512.15431 • Published Dec 17, 2025 • 132

upvoted a paper 3 months ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Paper • 2511.12609 • Published Nov 16, 2025 • 105

upvoted 5 papers 4 months ago

DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking

Paper • 2510.20168 • Published Oct 23, 2025 • 28

HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application

Paper • 2510.19631 • Published Oct 22, 2025 • 28

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published Oct 20, 2025 • 75

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published Oct 19, 2025 • 109

Watch and Learn: Learning to Use Computers from Online Videos

Paper • 2510.04673 • Published Oct 6, 2025 • 12

upvoted 3 papers 5 months ago

UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published Oct 15, 2025 • 63

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 49

upvoted 5 papers 6 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 196

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28, 2025 • 117

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 232

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 125

upvoted a paper 8 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 251

upvoted 2 papers 9 months ago

AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation

Paper • 2506.10540 • Published Jun 12, 2025 • 37

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Paper • 2506.09790 • Published Jun 11, 2025 • 53