Isadora White's picture

8 1

Isadora White

izzcw

·

https://icwhite.github.io/website/

AI & ML interests

LLMs, Reinforcement Learning, agents, embodiment, multi-agent collaboration

Recent Activity

upvoted a paper 27 days ago

Steering Autoregressive Music Generation with Recursive Feature Machines

upvoted a paper 4 months ago

Group Sequence Policy Optimization

published a model 6 months ago

izzcw/dpo_model_3.1_8k

View all activity

Organizations

Papers 3

arxiv:2504.17950

arxiv:2408.04900

arxiv:2311.18232

models 26

izzcw/dpo_model_3.1_8k

izzcw/qwen_large_crafting_sft_success

Text Generation • 2B • Updated Jun 1 • 3

izzcw/large_crafting_sft_success

Text Generation • 2B • Updated Jun 1

izzcw/trajectory_crafting_dpo_pairs

izzcw/trajectory_crafting_dpo_pairs.json

izzcw/llama_3.1_large_crafting_sft_success

Text Generation • 8B • Updated May 31 • 1

izzcw/llama_3b_crafting_sft_success_new_mem

Text Generation • 3B • Updated May 27

izzcw/mini_llama_crafting_sft_success_new_mem

Text Generation • 1B • Updated May 27 • 1

izzcw/cooking_sft_fail_new_mem

Text Generation • 8B • Updated May 24

izzcw/crafting_sft_fail_new_mem

Text Generation • 8B • Updated May 24 • 3

datasets 6

izzcw/trajectory_crafting_dpo_pairs

Viewer • Updated Jun 1 • 244 • 24

izzcw/dpo_pairs_crafting_filtered

Viewer • Updated May 26 • 876 • 20

izzcw/cooking-filtered

Updated May 15 • 8

izzcw/minecollab_filtered_construction_data

Viewer • Updated Apr 28 • 9.23k • 38 • 1

izzcw/minecollab_filtered_crafting

Viewer • Updated Apr 28 • 3.57k • 11

izzcw/minecollab_filtered_cooking

Viewer • Updated Apr 28 • 3.98k • 22