Embedding from transformers

by tillwenke - opened Sep 20, 2024

Sep 20, 2024

•

Why do you divide by the sum of ALL tokens across all sentences that are embedded in the model card?

outputs = torch.sum(
outputs * inputs["attention_mask"][:, :, None], dim=1) / torch.sum(inputs["attention_mask"])

doesn't do any harm for cos sim but I d rather divide by the number of tokens for each sentence.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment