arxiv:2501.04682
Anikait Singh
Asap7772
AI & ML interests
Deep Learning, Reinforcement Learning, Robotics
Recent Activity
updated
a dataset
about 8 hours ago
Asap7772/contextual_attack_prompts
published
a dataset
about 8 hours ago
Asap7772/contextual_attack_prompts
updated
a dataset
about 14 hours ago
Asap7772/wildjailbreak_llamagen_safety_score
Organizations
models
18
Asap7772/prm_datamath-mc-full_objbce_lr5e-06_epoch0
Text Generation
•
Updated
•
8
Asap7772/prm_datamath-mc-full_objbce_lr1e-07_epoch0
Text Generation
•
Updated
•
1
Asap7772/prm_datamath-mc-full_objbce_lr1e-06_epoch0
Text Generation
•
Updated
•
5
Asap7772/prm_datamath-mc-full_objbce_lr5e-05_epoch0
Text Generation
•
Updated
•
8
Asap7772/prm_datamath-mc-full_objbce_lr1e-05_epoch0
Text Generation
•
Updated
•
7
Asap7772/prm_datamath-mc-full_objbce_lr5e-07_epoch0
Text Generation
•
Updated
•
1
Asap7772/prm_datamath-mc-full_objbce_lr0.0005_epoch0
Text Generation
•
Updated
•
3
Asap7772/prm_datamath-mc-full_objbce_lr5e-06_checkpoint2400
Updated
Asap7772/prm_datamath-mc-full_objbce_lr5e-05_checkpoint2400
Updated
Asap7772/prm_datamath-mc-full_objbce_lr1e-05_checkpoint2400
Updated
datasets
730
Asap7772/contextual_attack_prompts
Viewer
•
Updated
•
8.42k
Asap7772/wildjailbreak_llamagen_safety_score
Viewer
•
Updated
•
262k
•
2
Asap7772/cosafe_all_rollouts_mistral_guardgen
Viewer
•
Updated
•
749
Asap7772/cosafe_all_rollouts_mistral
Viewer
•
Updated
•
750
Asap7772/wildjailbreak_llamagen
Viewer
•
Updated
•
262k
•
14
Asap7772/cosafe_all_rollouts_guardgen
Viewer
•
Updated
•
1.8k
•
1
Asap7772/cosafe_all_rollouts
Viewer
•
Updated
•
1.8k
•
5
Asap7772/review_eval_8shot_infbase_winrate_gpt-4o-mini_pref_train
Viewer
•
Updated
•
2.43k
•
7
Asap7772/review_eval_4shot_infbase_winrate_gpt-4o-mini_pref_train
Viewer
•
Updated
•
2.25k
•
7
Asap7772/review_eval_8shot_infipo0.05_winrate_gpt-4o-mini_pref_train
Viewer
•
Updated
•
2.43k
•
7