about num_classes of emb_extractor

#512

by allenxiao - opened 14 days ago

14 days ago

Hi, thanks for the great tool!
If I have finetuned geneformer use this set:
model = CustomSequenceClassification.from_pretrained(pretrain_model_path, num_labels=1, problem_type="regression",...).to("cuda")
i.e. I set num_labels=1, because this is a regression scenario.

When I extract embeddings using emb_extractor based on the finetuned model, how to set the num_classes parameter?
Setting num_classes=1 is OK?

ctheodoris

Owner 14 days ago

Thanks for your question! You would need to check if the model loads properly as one of the options we have, such as CellClassification (sequence classification). If not you should load as you have above and extract embedding by running a forward pass through the model. Alternatively you could save only the trunk of the model and load it as Pretrained.

ctheodoris changed discussion status to closed 14 days ago

allenxiao

14 days ago

•

edited 14 days ago

Thanks for your question! You would need to check if the model loads properly as one of the options we have, such as CellClassification (sequence classification). If not you should load as you have above and extract embedding by running a forward pass through the model. Alternatively you could save only the trunk of the model and load it as Pretrained.

Thanks for your reply!
So if the model loads successfully through EmbExtractor when setting num_classes=1, this way of extraction is feasible, right?

ctheodoris

Owner 14 days ago

I would suggest loading it separately yourself to confirm that it was loaded properly since it is not one of the model types explicitly provided in the function to load the model on this repository. This way you can ensure it doesn’t load a randomly initialized head on top of your model, or if it does that you know how to adjust your layer selection to not choose this layer for extraction.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment