google/shieldgemma-2-4b-it · how do i interpret the results

Mar 19

I'm using the default policies and got the following results. How should i interpret the results? if i understand correctly, the logits tensor shape (3,2) corresponds to 3 default policies and every police has two values yes and no. so my question is why do i always get two [nan, nan]?

ShieldGemma2ImageClassifierOutputWithNoAttention(loss=None, logits=tensor([[ nan, nan],
[ nan, nan],
[35.1562, 53.1972]], device='cuda:0'), hidden_states=None, probabilities=tensor([[ nan, nan],
[ nan, nan],
[1.4618e-08, 1.0000e+00]], device='cuda:0'))

BITDDD

Mar 24

Use Latest Version Transformers，I get results like this: tensor([[5.3998e-02, 9.4600e-01],
[3.6518e-01, 6.3482e-01],
[1.8238e-08, 1.0000e+00]])，The probability of looking pornographic, dangerous, bloody and violent

merve

Google org Mar 25

Hello, the policy outputs scores for Yes and No respectively for each policy (policy is provided by text as you put) @BITDDD
@cuiyi0326 can you put your inference code here for me to reproduce?

merve

Google org Mar 26

Hello again, I built a notebook on how to properly use ShieldGemma 2 with custom policies: https://github.com/merveenoyan/smol-vision/blob/main/ShieldGemma_2_for_Vision_LM_Safety.ipynb

iampelle

17 days ago

Use Latest Version Transformers，I get results like this: tensor([[5.3998e-02, 9.4600e-01],
[3.6518e-01, 6.3482e-01],
[1.8238e-08, 1.0000e+00]])，The probability of looking pornographic, dangerous, bloody and violent

Here the order is "dangerous", "sexual", "violence", but I'm not sure. I guess inputing images of the three categories is the only way to find out?