Yang Chen's picture

Yang Chen

ychenNLP

·

https://edchengg.github.io/

AI & ML interests

NLP

Recent Activity

new activity 3 days ago

nvidia/Nemotron-Cascade-2-30B-A3B:187 tok/s on RTX 3090, 625K Context, Agent Coding (IQ4_XS + Hermes Agent)

new activity 4 days ago

nvidia/Nemotron-Cascade-2-30B-A3B:Official quantizations?

new activity 4 days ago

nvidia/Nemotron-Cascade-2-30B-A3B:no quants working

View all activity

Organizations

authored a paper 8 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 12 days ago • 63

authored a paper 3 months ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Paper • 2512.13607 • Published Dec 15, 2025 • 37

authored 2 papers 10 months ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16, 2025 • 25

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22, 2025 • 36

authored a paper over 1 year ago

AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling

Paper • 2412.15084 • Published Dec 19, 2024 • 13

authored 4 papers over 2 years ago

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

Paper • 2311.17136 • Published Nov 28, 2023 • 8

Can Language Models be Instructed to Protect Personal Information?

Paper • 2310.02224 • Published Oct 3, 2023 • 1

Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities

Paper • 2302.11154 • Published Feb 22, 2023 • 1

Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?

Paper • 2302.11713 • Published Feb 23, 2023 • 1