jasonjiang's picture

jasonjiang

mikinyaa

·

jasonjiang8866

AI & ML interests

None yet

Recent Activity

upvoted an article 2 days ago

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

upvoted a paper 23 days ago

Self-Distilled Agentic Reinforcement Learning

upvoted a paper 23 days ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

View all activity

Organizations

None yet

upvoted an article 2 days ago

Article

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

nvidia

•

3 days ago

• 37

upvoted 2 papers 23 days ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 25 days ago • 111

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 25 days ago • 86

upvoted a paper 24 days ago

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper • 2605.13724 • Published 26 days ago • 101

upvoted 2 papers 25 days ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published 28 days ago • 78

δ-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published 27 days ago • 126

upvoted a paper 26 days ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published 28 days ago • 110

upvoted a paper 29 days ago

Audio-Visual Intelligence in Large Foundation Models

Paper • 2605.04045 • Published May 5 • 35

upvoted 12 papers about 1 month ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models

Paper • 2605.00877 • Published Apr 25 • 15

Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction

Paper • 2604.27221 • Published Apr 29 • 39

Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing

Paper • 2604.22782 • Published Apr 3 • 8

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published Apr 30 • 42

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Paper • 2604.25819 • Published Apr 28 • 17

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published Apr 23 • 37

dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model

Paper • 2604.22152 • Published Apr 24 • 5

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 275

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published Apr 15 • 32

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published Apr 24 • 227

EvoMaster: A Foundational Agent Framework for Building Evolving Autonomous Scientific Agents at Scale

Paper • 2604.17406 • Published Apr 19 • 6