TY.Zheng

aaabiao

https://scholar.google.com/citations?user=Vq-VZnUAAAAJ&hl=zh-CN

Zheng0428

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

upvoted a paper 2 months ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

liked a model 2 months ago

IQuestLab/IQuest-Coder-V1-40B-Base

View all activity

Organizations

upvoted a paper 7 days ago

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Paper • 2602.22675 • Published 8 days ago • 21

upvoted a paper 2 months ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published Dec 31, 2025 • 65

upvoted 2 papers 3 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 156

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 300

upvoted 3 papers 5 months ago

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

Paper • 2510.14616 • Published Oct 16, 2025 • 13

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

Paper • 2510.14763 • Published Oct 16, 2025 • 14

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30, 2025 • 48

upvoted 2 papers 6 months ago

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9, 2025 • 59

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 149

upvoted a collection 6 months ago

Code Synthetic RL Rollout

Collection

5 items • Updated Sep 23, 2025 • 1

upvoted a paper 6 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

upvoted a paper 7 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158

upvoted 3 papers 8 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9, 2025 • 24

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8, 2025 • 93

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1, 2025 • 79

upvoted a paper 11 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7, 2025 • 44

upvoted 2 papers 12 months ago

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24, 2025 • 31

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11, 2025 • 72

upvoted 2 papers about 1 year ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 108

Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM

Paper • 2502.06635 • Published Feb 10, 2025 • 6

TY.Zheng

AI & ML interests

Recent Activity

Organizations

aaabiao's activity