Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
6
Zhaolin Gao
GitBag
Follow
dark-pen's profile picture
vinhnx90's profile picture
kirankc's profile picture
3 followers
·
0 following
https://zhaolingao.github.io/
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
updated
a model
2 days ago
GitBag/reasoning_rebel_uf_dp_1k3k_from1735956551_rfst_eta_1e4_lr_3e-7_1738016708
published
a model
2 days ago
GitBag/reasoning_rebel_uf_dp_1k3k_from1735956551_rfst_eta_1e4_lr_3e-7_1738016708
updated
a model
2 days ago
GitBag/reasoning_rebel_uf_dp_1k3k_from1735956551_rfst_eta_1e2_lr_3e-7_1737991767
View all activity
Articles
RLHF 101: A Technical Dive into RLHF
Dec 11, 2024
•
5
Organizations
GitBag
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
GitBag/multiturn_1_4
5 months ago
Dataset Viewer issue: ResponseNotFound
1
#1 opened 5 months ago by
GitBag
New activity in
Cornell-AGI/REBEL-Llama-3-epoch_2
8 months ago
model weights
1
#1 opened 8 months ago by
maldv