8 42 72

Long(Tony) Lian

longlian

https://tonylian.com/

TonyLianLong

AI & ML interests

None yet

Recent Activity

liked a model about 2 hours ago

MiniMaxAI/MiniMax-VL-01

upvoted a paper 1 day ago

Teaching Large Language Models to Reason with Reinforcement Learning

liked a model 1 day ago

nvidia/Nemotron-4-340B-Reward

View all activity

Organizations

longlian's activity

upvoted a paper 1 day ago

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7, 2024 • 50

upvoted a paper 4 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 93

upvoted a paper 5 days ago

Self-Steering Language Models

Paper • 2504.07081 • Published 7 days ago • 15

upvoted a paper 27 days ago

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published 28 days ago • 44

upvoted a paper 28 days ago

Atlas: Multi-Scale Attention Improves Long Context Image Modeling

Paper • 2503.12355 • Published Mar 16 • 11

upvoted a paper about 2 months ago

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26 • 48

upvoted a paper 2 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 49

upvoted 3 papers 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 381

TransPixar: Advancing Text-to-Video Generation with Transparency

Paper • 2501.03006 • Published Jan 6 • 27

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published Dec 30, 2024 • 23

upvoted a paper 4 months ago

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published Dec 23, 2024 • 33

upvoted 7 papers 6 months ago

upvoted 2 papers 7 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

Language Models Learn to Mislead Humans via RLHF

Paper • 2409.12822 • Published Sep 19, 2024 • 10