Qwen models used in school of reward hacks
James Chua
thejaminator
AI & ML interests
None yet
Recent Activity
updated
a model
about 3 hours ago
thejaminator/12sep_grp16_1e5_lr-step-60
published
a model
about 3 hours ago
thejaminator/12sep_grp16_1e5_lr-step-60
updated
a model
2 days ago
thejaminator/checkpoints_multiple_datasets_layer_1_decoder-fixed