Liu's picture

4 11 7

Liu

Zuxin

·

https://www.zuxin.me

liuzuxin

AI & ML interests

Reinforcement learning, imitation learning

Recent Activity

liked a dataset 25 days ago

pandalla/Machine_Mindset_MBTI_dataset

upvoted a collection 3 months ago

upvoted a paper 3 months ago

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

View all activity

Organizations

authored a paper 8 months ago

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 36

authored 3 papers 11 months ago

Learning from Sparse Offline Datasets via Conservative Density Estimation

Paper • 2401.08819 • Published Jan 16, 2024

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

Paper • 2406.10290 • Published Jun 12, 2024

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Paper • 2408.07060 • Published Aug 13, 2024 • 43

authored a paper about 1 year ago

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Paper • 2406.18518 • Published Jun 26, 2024 • 25

authored 4 papers over 1 year ago

Learning Shared Safety Constraints from Multi-task Demonstrations

Paper • 2309.00711 • Published Sep 1, 2023

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models

Paper • 2310.05905 • Published Oct 9, 2023 • 2

AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System

Paper • 2402.15538 • Published Feb 23, 2024 • 6

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Paper • 2402.15506 • Published Feb 23, 2024 • 17

authored a paper almost 2 years ago

Constrained Decision Transformer for Offline Safe Reinforcement Learning

Paper • 2302.07351 • Published Feb 14, 2023