1 2

Yuquan Xie

xieyuquan

xieyuquanxx

AI & ML interests

LLM, multi-modal

Organizations

Collections 5

View 5 collections

Papers 5

spaces 1

狼人杀Agent示例

🚀

Create and compete with AI Agents inWerewolf and Spy games

models 3

datasets 3

xieyuquan/AppApksForAndroid

Updated Dec 3, 2025 • 3

xieyuquan/google_apps_step3000_historyimageFalse_uitars_actionspace

Viewer • Updated Mar 18, 2025 • 3k • 86 • 1

xieyuquan/google_apps_step3000_historyimageFalse

Viewer • Updated Mar 18, 2025 • 3k • 5

Yuquan Xie

AI & ML interests

Organizations

Collections 5

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Understanding and Diagnosing Deep Reinforcement Learning

A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Understanding and Diagnosing Deep Reinforcement Learning

A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Papers 5

spaces 1

狼人杀Agent示例

models 3

xieyuquan/Optimus3-Policy

xieyuquan/Optimus3-Task-Router

xieyuquan/Optimus3-32B-SFT

datasets 3

xieyuquan/AppApksForAndroid

xieyuquan/google_apps_step3000_historyimageFalse_uitars_actionspace

xieyuquan/google_apps_step3000_historyimageFalse

Yuquan Xie

AI & ML interests

Organizations

Collections 5

Papers 5

spaces 1

狼人杀Agent示例

models 3 Sort: Recently updated

datasets 3 Sort: Recently updated

models 3

datasets 3