Zhaolin Gao's picture

2 1 6

Zhaolin Gao

GitBag

·

https://zhaolingao.github.io/

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a model 2 days ago

GitBag/reasoning_rebel_uf_dp_1k3k_from1735956551_rfst_eta_1e4_lr_3e-7_1738016708

published a model 2 days ago

GitBag/reasoning_rebel_uf_dp_1k3k_from1735956551_rfst_eta_1e4_lr_3e-7_1738016708

updated a model 2 days ago

GitBag/reasoning_rebel_uf_dp_1k3k_from1735956551_rfst_eta_1e2_lr_3e-7_1737991767

View all activity

Articles

RLHF 101: A Technical Dive into RLHF

Organizations

GitBag's activity

New activity in GitBag/multiturn_1_4 5 months ago

Dataset Viewer issue: ResponseNotFound

#1 opened 5 months ago by

New activity in Cornell-AGI/REBEL-Llama-3-epoch_2 8 months ago

model weights

#1 opened 8 months ago by