Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
33
152
43
KABI
dongguanting
Follow
samusenps's profile picture
TingchenFu's profile picture
asusevski's profile picture
47 followers
·
85 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
12 days ago
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
upvoted
a
paper
12 days ago
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
upvoted
a
paper
12 days ago
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
View all activity
Organizations
dongguanting
's datasets
11
Sort: Recently updated
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
Aug 12
•
54.6k
•
390
•
9
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
Aug 12
•
10k
•
170
•
3
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
Jul 29
•
1.07k
•
130
•
4
dongguanting/RAG-Error-Critic-100K
Viewer
•
Updated
Jun 28
•
100k
•
22
•
2
dongguanting/Tool-Star-SFT-54K
Viewer
•
Updated
May 29
•
54k
•
297
•
8
dongguanting/Multi-Tool-RL-10K
Viewer
•
Updated
May 25
•
10k
•
130
•
4
dongguanting/RAG-QA-40K
Viewer
•
Updated
Dec 27, 2024
•
32.8k
•
19
•
2
dongguanting/ShareGPT-12K
Viewer
•
Updated
Dec 27, 2024
•
12.9k
•
28
•
1
dongguanting/VIF-RAG-QA-110K
Viewer
•
Updated
Dec 27, 2024
•
111k
•
48
•
7
dongguanting/DotamathQA
Viewer
•
Updated
Dec 26, 2024
•
574k
•
60
•
2
dongguanting/VIF-RAG-QA-20K
Viewer
•
Updated
Nov 1, 2024
•
20k
•
7
•
4