AceCoder

community

https://jdf-prog.github.io/

DongfuJiang

jdf-prog

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

JasperHaozhe authored a paper 12 days ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

JasperHaozhe authored a paper 12 days ago

Reverse-Engineered Reasoning for Open-Ended Generation

JasperHaozhe authored a paper 12 days ago

VideoScore2: Think before You Score in Generative Video Evaluation

View all activity

JasperHaozhe

authored 13 papers 12 days ago

Dr. Bench: A Multidimensional Evaluation for Deep Research Agents, from Answers to Reports

Paper • 2510.02190 • Published Jan 29 • 19

Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing

Paper • 2510.15349 • Published Oct 17, 2025

TR-DQ: Time-Rotation Diffusion Quantization

Paper • 2503.06564 • Published Mar 9, 2025

From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning

Paper • 2511.23031 • Published Nov 28, 2025 • 1

CogDoc: Towards Unified thinking in Documents

Paper • 2512.12658 • Published Dec 14, 2025

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92

Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

Paper • 2603.11103 • Published Mar 11 • 9

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

Paper • 2603.16124 • Published Mar 17 • 3

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published about 1 month ago • 145

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 16 days ago • 100

JasperHaozhe

submitted a paper to Daily Papers 13 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 16 days ago • 100

DongfuJiang

authored 3 papers about 1 month ago

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

Paper • 2603.12698 • Published Mar 13 • 1

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published Mar 19 • 66

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 96

chiruan

updated a model 4 months ago

CodeDPO/filtered_original_acecoderv3

Updated Jan 2

chiruan

published a dataset 5 months ago

CodeDPO/filtered_original_acecoderv3_unused

Viewer • Updated Dec 4, 2025 • 29k • 9

chiruan

updated a dataset 5 months ago

CodeDPO/filtered_original_acecoderv3_unused

Viewer • Updated Dec 4, 2025 • 29k • 9

AI & ML interests

Recent Activity

Team members 4

CodeDPO's activity