Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas"
Nishant Balepur
nbalepur
AI & ML interests
NLP
Recent Activity
updated
a dataset
11 days ago
nbalepur/MCQA_IWF
published
a dataset
11 days ago
nbalepur/MCQA_IWF
updated
a model
12 days ago
nbalepur/google-query-wellformedness-distilroberta
Organizations
Collections
2
models
9

nbalepur/google-query-wellformedness-distilroberta
Text Classification
•
Updated
•
62

nbalepur/Llama-3.1-8B-PT-DPO-HHH
Updated

nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic
Updated

nbalepur/Llama-3.1-8B-PT-DPO-BeaverTails
Text Generation
•
Updated
•
16

nbalepur/Llama-3.1-8B_copy_persona_False_Mnemonic_dpo_chosen
Text Generation
•
Updated
•
19

nbalepur/Llama-3.1-8B_copy_persona_False_Safe_RLHF_dpo_chosen
Text Generation
•
Updated
•
14

nbalepur/LLama-2-70b-Mnemonic-Tokenizer
Updated

nbalepur/LLama-2-70b-Mnemonic-SFT
Text Generation
•
Updated
•
46

nbalepur/LLama-2-70b-Mnemonic-DPO
Text Generation
•
Updated
•
33
datasets
101
nbalepur/MCQA_IWF
Viewer
•
Updated
•
217
•
146
nbalepur/google-query-wellformedness
Viewer
•
Updated
•
25.1k
•
75
nbalepur/BenchBench_test
Viewer
•
Updated
•
1.19k
•
80
nbalepur/cheating-reasoners
Viewer
•
Updated
•
9.39k
•
521
nbalepur/planorama_irt_swap_oneslope
Viewer
•
Updated
•
300
•
9
nbalepur/planorama_without_label_swap_fixed2
Viewer
•
Updated
•
300
•
9
nbalepur/planorama_irt_swap_newslope
Viewer
•
Updated
•
300
•
11
nbalepur/planorama_without_label_swap_fixed
Viewer
•
Updated
•
300
•
10
nbalepur/planorama_irt_swap2
Viewer
•
Updated
•
300
•
8
nbalepur/planorama_irt_swap
Viewer
•
Updated
•
300
•
11