Tianduo Wang's picture

3 23 4

Tianduo Wang

Tianduo

·

TianduoWang

AI & ML interests

nlp, representation learning

Recent Activity

upvoted a paper about 1 month ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

upvoted a paper about 2 months ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

upvoted a paper about 2 months ago

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

View all activity

Organizations

upvoted a paper about 1 month ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published Dec 19, 2025 • 51

upvoted 2 papers about 2 months ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published Dec 17, 2025 • 65

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44

upvoted 2 papers 7 months ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16, 2025 • 43

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25, 2025 • 64

upvoted 6 papers 8 months ago

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published Jun 24, 2025 • 53

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 56

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

On-Policy RL with Optimal Reward Baseline

Paper • 2505.23585 • Published May 29, 2025 • 14

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Paper • 2505.23604 • Published May 29, 2025 • 23

From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition

Paper • 2505.16972 • Published May 22, 2025 • 9

upvoted a paper 11 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144

upvoted a paper 12 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6, 2025 • 51

upvoted 7 papers over 1 year ago

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 30

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Paper • 2407.21646 • Published Jul 31, 2024 • 18

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 33

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Paper • 2407.11963 • Published Jul 16, 2024 • 44

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 168

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Paper • 2406.18629 • Published Jun 26, 2024 • 42