Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a model
about 19 hours ago
hamishivi/1003_rl_rag_dpo_8b_extra_dpo_lr_1e-6__42__1759823252
published
a model
about 19 hours ago
hamishivi/1003_rl_rag_dpo_8b_extra_dpo_lr_1e-6__42__1759823252
updated
a model
about 19 hours ago
hamishivi/1002_rl_rag_dpo_8b_lr_5e-6__42__1759750525