Zixian Ma's picture

9 5 11

Zixian Ma

zixianma

·

AI & ML interests

Human-AI interaction and collaboration

Organizations

upvoted 2 papers 2 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 96

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 56

upvoted a collection 6 months ago

TACO Models

This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated Apr 18 • 8

upvoted a collection 7 months ago

CoTA Datasets

This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated Apr 18 • 7

upvoted a paper 11 months ago

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Paper • 2408.07060 • Published Aug 13, 2024 • 43