Fine tuning the LLM backbone

#76

by antogrk - opened 6 days ago

6 days ago

I'm working on a text-only task (which ultimately will be expanded to a multimodal task in the future). I was wondering if it's possible to fine-tune only the language model that's used as the backbone of the model on only textual data. Also, is it possible to apply LoRA on the LLM and train only the linear layers of it?
It would be very helpful if you could provide a very basic script of how the fine-tuning can be done for a causal language modeling task.

antogrk changed discussion status to closed 5 days ago

antogrk changed discussion status to open 5 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment