Training Gemma on Domain-Specific Data by Unsupervised Learning t for Specific Instructional Tasks

#79

by H2dddhxh - opened Mar 24, 2024

Mar 24, 2024

I aim to adapt Gemma for a domain-specific task through unsupervised learning, initially training it with my text corpus as both input and output. This step is intended to prepare Gemma for subsequent, more focused training on tasks relevant to my specialty. My primary apprehension is that this deviates from the conventional prompt-based training paradigm, which necessitates an 'instruction, input, and output' format. How should I proceed with unsupervised pre-training？Can such an approach effectively prime Gemma for improved performance on domain-specific questions？

lkv

Google org 2 days ago

•

edited 2 days ago

Hi Sorry for the delay,

Yes, your proposed approach is a powerful and recommended technique known as Continued Pre-training or Domain-Adaptive Pre-training . It's highly effective for preparing a general model like Gemma for specialized tasks. Your apprehension comes from confusing two distinct but complementary training stages. The unsupervised method is for teaching the model new knowledge, while the instruction-based method teaches it new skills. Thanks.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment