Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
about 17 hours ago
rl-rag/gpt-oss-20b-eval-react-serper
published
a dataset
about 17 hours ago
rl-rag/gpt-oss-20b-eval-react-serper
updated
a model
3 days ago
hamishivi/1708_miromind_8b_dpo_rl_rag__1__1755577229_step_100