Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
about 4 hours ago
hamishivi/qwen2.5_orz_upload
published
a dataset
about 4 hours ago
hamishivi/qwen2.5_orz_upload
published
a model
about 4 hours ago
hamishivi/qwen2.5_orz_upload
Organizations
models
35

hamishivi/qwen2.5_orz_upload
Updated

hamishivi/s1k_seq_orig_hyper__42__1740446762
Updated
•
2

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt
Updated
•
4

hamishivi/tulu-2-wildchat-326k-sft
Updated

hamishivi/tulu-2-arena-hard-326k-sft
Updated
•
3

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft
Updated
•
4

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft
Updated
•
6

hamishivi/tulu-2-multitask-rrmax-326k-sft
Updated
•
3

hamishivi/qwen2_math_tokenizer_tweaked
Updated

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350
Updated
datasets
93
hamishivi/qwen2.5_orz_upload
Viewer
•
Updated
•
20.4k
hamishivi/open_scholar_rl_no_prompt
Viewer
•
Updated
•
60.2k
•
23
hamishivi/open_scholar_rl
Viewer
•
Updated
•
60.2k
•
36
hamishivi/tulu_3_rewritten_400k_string_f1_only_v2
Viewer
•
Updated
•
264k
•
58
hamishivi/tulu_3_rewritten_400k_string_f1_only
Viewer
•
Updated
•
264k
•
61
hamishivi/o3_generations_big_rl
Viewer
•
Updated
•
258k
•
65
hamishivi/combined_o3_val_data_1
Viewer
•
Updated
•
9.25k
•
61
hamishivi/o3-test
Viewer
•
Updated
•
99
•
51
hamishivi/combined_o3_val_data_1sample
Viewer
•
Updated
•
2.46M
•
132
hamishivi/combined_o3_val_data
Viewer
•
Updated
•
12.3M
•
68