Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
3
Kai Yang
yangkaiSIGS
Follow
21world's profile picture
1 follower
·
1 following
https://yk7333.github.io/
yk7333
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
upvoted
a
paper
3 days ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
upvoted
a
paper
3 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
View all activity
Organizations
Papers
11
arxiv:
2602.12125
arxiv:
2511.15248
arxiv:
2509.26226
arxiv:
2505.11044
Expand 11 papers
spaces
1
Running
Entropic
📉
Display research findings on EntroPIC for LLM training
models
1
yangkaiSIGS/EntroPIC-Nemotron-1.5b
Text Generation
•
2B
•
Updated
Feb 3
•
3
•
1
datasets
1
yangkaiSIGS/d3po_datasets
Viewer
•
Updated
Mar 19, 2024
•
1.2k
•
23
•
5