Fine Tuning Molmo

by chrishoertnagl - opened Sep 26, 2024

Sep 26, 2024

Really cool work!
Is it possible to futher fine-tune the model e.g. on an OCR dataset or something? Are there any notebooks or examples you could share?

chrisc36

Ai2 org Sep 26, 2024

Thank you! We will release full training code for these models but we do not have that ready yet. For now these models do support a forward pass and backpropagation so you can place them into a standard torch training loop yourself.

mavericklsd

Nov 14, 2024

Any update about releasing the fine tuning molmo script? thanks in advance

amanrangapur

Ai2 org Nov 14, 2024

Hey @mavericklsd , we are planning to release the code along with checkpoints and weights by the end of November.

delip

Nov 25, 2024

@amanrangapur don't want to be that guy, but can't help it. Is this released yet?

amanrangapur

Ai2 org Nov 25, 2024

Hey @delip , we're planning to release this week. Stay tuned.

delip

Nov 25, 2024

Thanks @amanrangapur and MolMo team!

scm-aiml

Nov 29, 2024

@amanrangapur I saw the Molmo dataset(s) were released. Is there still a plan to release the training code on GitHub as well this week?

amanrangapur

Ai2 org Nov 29, 2024

Hey @scm-aiml ! Since it’s the holidays here, we’ll be releasing next week!

amanrangapur

Ai2 org Dec 5, 2024

•

edited Dec 18, 2024

Hey @scm-aiml @delip , the wait is over: https://github.com/allenai/molmo

syazvinski

Dec 18, 2024

Any update on a fine-tuning script? Ive been patiently waiting quite some time. Updates would be appreciated. 🙃

amanrangapur

Ai2 org Dec 18, 2024

•

edited Dec 18, 2024

Hey @syazvinski , We are not releasing any custom fine-tuning script until next month. I will keep you posted if there is anything around.

chisarie

Jan 13

Hi @amanrangapur , thank you for the great work. Do you know if there is any timeline for the release of a fine-tuning script? Just to know whether I should wait, or find an alternative approach/model for my project. Thank you!

amanrangapur

Ai2 org Jan 13

Hey @chisarie , we have released bunch of training code and stuff on GitHub, and you can fine-tune the model on custom data, you probably need to work on Data Loaders and other things.. As of now we're not on releasing anything like custom fine-tuning script.

chisarie

Jan 14

All right, thank you for the prompt response!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment