Qwen models used in school of reward hacks
James Chua
thejaminator
AI & ML interests
None yet
Recent Activity
updated
a model
1 day ago
thejaminator/12sep_grp16_1e5_lr-step-60
published
a model
1 day ago
thejaminator/12sep_grp16_1e5_lr-step-60
updated
a model
3 days ago
thejaminator/checkpoints_multiple_datasets_layer_1_decoder-fixed