Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Alexander Bukharin
alexwb
Follow
0 followers
·
1 following
AI & ML interests
None yet
Organizations
Papers
2
arxiv:
2410.01257
arxiv:
2306.03109
models
8
Sort: Recently updated
alexwb/reward_modeling_anthropic_hh_rm1e-3
0.3B
•
Updated
Aug 7, 2024
•
10
alexwb/reward_modeling_anthropic_hh_rm1e-4
0.3B
•
Updated
Aug 7, 2024
•
8
alexwb/reward_modeling_anthropic_hh_rm1.4e-5
0.3B
•
Updated
Aug 4, 2024
•
6
alexwb/reward_modeling_anthropic_hh_rm1e-6
0.3B
•
Updated
Aug 3, 2024
•
6
alexwb/reward_modeling_anthropic_hh_rm0.99
0.3B
•
Updated
Aug 2, 2024
•
11
alexwb/reward_modeling_anthropic_hh_rm0.9_lr5e-5
0.3B
•
Updated
Aug 2, 2024
•
6
alexwb/reward_modeling_anthropic_hh
Text Classification
•
0.3B
•
Updated
Aug 1, 2024
•
20
alexwb/sft_trl_test
Updated
May 15, 2024
•
21
datasets
0
None public yet