jdzw2014
jdzw2014
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 2 hours ago
m-a-p/Writing-Preference-Bench
commented on
a paper
5 months ago
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL
in Enhancing LLM Reasoning