10 17 28

Yang Chen

ychenNLP

https://edchengg.github.io/

AI & ML interests

NLP

Recent Activity

new activity 20 days ago

nvidia/AceReason-Nemotron-14B:Is it possible to open-source the 2k+ difficult samples from math stage3 separately, as well as the code training data?

new activity about 1 month ago

nvidia/AceReason-Nemotron-14B:Add link to paper and project page

new activity about 1 month ago

nvidia/AceReason-Math:Add task category and link to new model paper

View all activity

Organizations

New activity in nvidia/AceReason-Nemotron-14B 20 days ago

Is it possible to open-source the 2k+ difficult samples from math stage3 separately, as well as the code training data?

#2 opened 27 days ago by

Suu

New activity in nvidia/AceReason-Nemotron-14B about 1 month ago

Add link to paper and project page

#1 opened about 1 month ago by

nielsr

New activity in nvidia/AceReason-Math about 1 month ago

Add task category and link to new model paper

#1 opened about 1 month ago by

nielsr

commented a paper about 1 month ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 24 •

commented a paper 2 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 33 •

commented a paper 7 months ago

AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling

Paper • 2412.15084 • Published Dec 19, 2024 • 13 •

New activity in Qwen/Qwen2.5-Math-RM-72B 10 months ago

NameError: name '_flash_attention_forward' is not defined

➕ 18

#5 opened 10 months ago by

ychenNLP

New activity in google/gemma-7b about 1 year ago

8-bit precision error

#32 opened over 1 year ago by

saireddy

New activity in Salesforce/instructblip-flan-t5-xl about 2 years ago

is there any vqa fine-tuning script i can borrow?

#2 opened about 2 years ago by

ychenNLP

New activity in ychenNLP/arabic-relation-extraction about 3 years ago

Fix `language` tag

#1 opened about 3 years ago by

julien-c

Yang Chen

AI & ML interests

Recent Activity

Organizations

ychenNLP's activity

Is it possible to open-source the 2k+ difficult samples from math stage3 separately, as well as the code training data?

Add link to paper and project page

Add task category and link to new model paper

NameError: name '_flash_attention_forward' is not defined

8-bit precision error

is there any vqa fine-tuning script i can borrow?

Fix `language` tag