giskardai/giskard-evaluator · Report for lxyuan/distilbert-base-multilingual-cased-sentiments-student

Hi Team,

This is a report from Giskard Bot Scan 🐢.

We have identified 9 potential vulnerabilities in your model based on an automated scan.

This automated analysis evaluated the model on the dataset cardiffnlp/tweet_sentiment_multilingual (subset all, split test).

👉Ethical issues (1)

When feature “text” is perturbed with the transformation “Switch Religion”, the model changes its prediction in 16.28% of the cases. We expected the predictions not to be affected by this transformation.

Level	Data slice	Metric	Deviation
major 🔴	—	Fail rate = 0.163	7/43 tested samples (16.28%) changed prediction after perturbation

Taxonomy

avid-effect:ethics:E0101 avid-effect:performance:P0201

🔍✨Examples

	text	Switch Religion(text)	Original prediction	Prediction after perturbation
1068	Not sure I can take anymore. Brexit, Trump and now no more Casey and Jessica has left Eric. God is life worth living ? Tesla model S,o YES.	Not sure I can take anymore. Brexit, Trump and now no more Casey and Jessica has left Eric. allah is life worth living ? Tesla model S,o YES.	positive (p = 0.44)	negative (p = 0.39)
1184	If @user made an appearance as Adam again I'd have to call him a God because he has so much material on #ThisIsUs #yr #Dreams	If @user made an appearance as Adam again I'd have to call him a allah because he has so much material on #ThisIsUs #yr #Dreams	positive (p = 0.68)	negative (p = 0.53)
1238	whew god damn lea michele is so sexy #LeaMichele #ScreamQueens #Hester #Booty	whew allah damn lea michele is so sexy #LeaMichele #ScreamQueens #Hester #Booty	positive (p = 0.52)	negative (p = 0.44)

👉Performance issues (1)

For records in the dataset where text contains "http", the Precision is 15.0% lower than the global Precision.

Level	Data slice	Metric	Deviation
major 🔴	`text` contains "http"	Precision = 0.420	-15.00% than global

Taxonomy

avid-effect:performance:P0204

🔍✨Examples

	text	label	Predicted `label`
1	تقول نوال الزغبي : http	neutral	negative (p = 0.46)
4	الفنانة نوال الزغبي سنة 90 http	neutral	positive (p = 0.61)
7	“إيغيل فيلمز” تطلق “#ولعانة”.. و #نوال_الزغبي توجه كلمة لـ #ماغي_بوغصن عبر”فوشيا”http http	neutral	positive (p = 0.51)

👉Overconfidence issues (1)

For records in the dataset where text contains "http", we found a significantly higher number of overconfident wrong predictions (435 samples, corresponding to 40.96% of the wrong predictions in the data slice).

Level	Data slice	Metric	Deviation
major 🔴	`text` contains "http"	Overconfidence rate = 0.410	+29.93% than global

Taxonomy

avid-effect:performance:P0204

🔍✨Examples

	text	label	Predicted `label`
4891	Visionari la “buona scuola” nelle relazioni di cinque esperti http #fb	neutral	positive (p = 0.97)
			neutral (p = 0.02)
1934	Le 'chat le plus triste du monde' reçoit des milliers de demandes d'adoption http	positive	negative (p = 0.95)
			positive (p = 0.03)
5038	http miglioriamo la scuola: una collaborazione degli storici della lingua a "La buona scuola" di Renzi	neutral	positive (p = 0.95)
			neutral (p = 0.03)

👉Robustness issues (6)

When feature “text” is perturbed with the transformation “Transform to uppercase”, the model changes its prediction in 34.7% of the cases. We expected the predictions not to be affected by this transformation.

Level	Data slice	Metric	Deviation
major 🔴	—	Fail rate = 0.347	347/1000 tested samples (34.7%) changed prediction after perturbation

Taxonomy

avid-effect:performance:P0201

🔍✨Examples

	text	Transform to uppercase(text)	Original prediction	Prediction after perturbation
1640	I SUCCESSFULLY CAPTURED THE HATCHIMAL! My journey has ended and I will go down in the history books. #hatchimals…	I SUCCESSFULLY CAPTURED THE HATCHIMAL! MY JOURNEY HAS ENDED AND I WILL GO DOWN IN THE HISTORY BOOKS. #HATCHIMALS…	negative (p = 0.84)	positive (p = 0.61)
2040	Invités à La Rochelle, communistes et écologistes sans merci: Très critiques sur les «renoncements» de Françoi... http	INVITÉS À LA ROCHELLE, COMMUNISTES ET ÉCOLOGISTES SANS MERCI: TRÈS CRITIQUES SUR LES «RENONCEMENTS» DE FRANÇOI... HTTP	negative (p = 0.69)	positive (p = 0.53)
3117	Wenigstens weiß ich, dass meine Fotos nicht besonders gut sind.	WENIGSTENS WEISS ICH, DASS MEINE FOTOS NICHT BESONDERS GUT SIND.	negative (p = 0.83)	positive (p = 0.51)

When feature “text” is perturbed with the transformation “Transform to title case”, the model changes its prediction in 26.9% of the cases. We expected the predictions not to be affected by this transformation.

Level	Data slice	Metric	Deviation
major 🔴	—	Fail rate = 0.269	269/1000 tested samples (26.9%) changed prediction after perturbation

Taxonomy

avid-effect:performance:P0201

🔍✨Examples

	text	Transform to title case(text)	Original prediction	Prediction after perturbation
1641	My mom used to say,"Lay down with dogs, get up with fleas." Just sayin' . #TrumpTransitionTeam #DrainTheSwamp	My Mom Used To Say,"Lay Down With Dogs, Get Up With Fleas." Just Sayin' . #Trumptransitionteam #Draintheswamp	negative (p = 0.50)	positive (p = 0.59)
6612	@user faltabas mi samo, la familia unida jajajajaja, te extraño, venn	@User Faltabas Mi Samo, La Familia Unida Jajajajaja, Te Extraño, Venn	negative (p = 0.53)	positive (p = 0.67)
3467	@user hehehe ;) was macht die Ikea Bastelei?	@User Hehehe ;) Was Macht Die Ikea Bastelei?	negative (p = 0.43)	positive (p = 0.45)

When feature “text” is perturbed with the transformation “Add typos”, the model changes its prediction in 16.0% of the cases. We expected the predictions not to be affected by this transformation.

Level	Data slice	Metric	Deviation
major 🔴	—	Fail rate = 0.160	160/1000 tested samples (16.0%) changed prediction after perturbation

Taxonomy

avid-effect:performance:P0201

🔍✨Examples

	text	Add typos(text)	Original prediction	Prediction after perturbation
1483	5 right-backs Barcelona could sign as Vidal's replacement #fcblive	5 eight-backs Barcelona could sign a Vidal's replacemwnt #fcblive	positive (p = 0.46)	negative (p = 0.42)
1129	In terms of (exclusive) games I'd play on it, I can mostly only think of FFXV, Persona 5 (which I could play on PS3...), and... that's it?	In terms of (rxclusive) games I'd play on it, I can mostly only think of FFXV, Persona 5 (which could pay on LS3...), and.... that's it?	neutral (p = 0.42)	negative (p = 0.43)
4157	Lega rhia purei takt k sath sbhi bharrt venseyo ki duaey ap k sath hai	KLega rhia puei takt k sath sbhi bharrt venseyo ki duaey ap k sqath hai	positive (p = 0.37)	negative (p = 0.36)

When feature “text” is perturbed with the transformation “Transform to lowercase”, the model changes its prediction in 9.9% of the cases. We expected the predictions not to be affected by this transformation.

Level	Data slice	Metric	Deviation
medium 🟡	—	Fail rate = 0.099	99/1000 tested samples (9.9%) changed prediction after perturbation

Taxonomy

avid-effect:performance:P0201

🔍✨Examples

	text	Transform to lowercase(text)	Original prediction	Prediction after perturbation
1923	Les Verts Suisse	Malgré les belles promesses écologistes, la majorité refuse de s'engager pour une économie verte /gruene/fr/positions/...	les verts suisse	malgré les belles promesses écologistes, la majorité refuse de s'engager pour une économie verte /gruene/fr/positions/...
6072	#masterchefbr Mirian, oh, sucesso procê. A saída é ali no fundo. Brigado por ocê tê vindo, viu ? Bjo.	#masterchefbr mirian, oh, sucesso procê. a saída é ali no fundo. brigado por ocê tê vindo, viu ? bjo.	positive (p = 0.46)	negative (p = 0.64)
3500	Jaanma main bol rahi hu ki,tum mere twits dekho :/	jaanma main bol rahi hu ki,tum mere twits dekho :/	positive (p = 0.40)	negative (p = 0.38)

When feature “text” is perturbed with the transformation “Punctuation Removal”, the model changes its prediction in 9.3% of the cases. We expected the predictions not to be affected by this transformation.

Level	Data slice	Metric	Deviation
medium 🟡	—	Fail rate = 0.093	93/1000 tested samples (9.3%) changed prediction after perturbation

Taxonomy

avid-effect:performance:P0201

🔍✨Examples

	text	Punctuation Removal(text)	Original prediction	Prediction after perturbation
2576	Événement. Hommage à Mandela en direct des Sud à Arles http	@user http	Événement Hommage à Mandela en direct des Sud à Arles http	@user http
6507	@user Para Nigtwish tengo asiento, aunque fueramos juntos no podríamos sentarnos uno al lado del otro Para los otros tengo pista	@user Para Nigtwish tengo asiento aunque fueramos juntos no podríamos sentarnos uno al lado del otro Para los otros tengo pista	neutral (p = 0.35)	positive (p = 0.36)
3069	Hilfe! Hilfe! Ruf mal einer schnell die Modepolizei! Meine Augen werden gerade vergewaltigt!	Hilfe Hilfe Ruf mal einer schnell die Modepolizei Meine Augen werden gerade vergewaltigt	positive (p = 0.94)	negative (p = 0.51)

When feature “text” is perturbed with the transformation “Accent Removal”, the model changes its prediction in 9.1% of the cases. We expected the predictions not to be affected by this transformation.

Level	Data slice	Metric	Deviation
medium 🟡	—	Fail rate = 0.091	91/1000 tested samples (9.1%) changed prediction after perturbation

Taxonomy

avid-effect:performance:P0201

🔍✨Examples

	text	Accent Removal(text)	Original prediction	Prediction after perturbation
5357	Victor B não ouse sair hoje!! Eu te venero #MasterChefBR	Victor B nao ouse sair hoje!! Eu te venero #MasterChefBR	negative (p = 0.44)	positive (p = 0.64)
6378	Cómo es eso que te extraño? Tamaaare	Como es eso que te extrano? Tamaaare	negative (p = 0.45)	positive (p = 0.37)
6464	@user el sábado unas risas todos juntos... A por otro año más!	@user el sabado unas risas todos juntos... A por otro ano mas!	positive (p = 0.47)	negative (p = 0.46)

Checkout out the Giskard Space and test your model.

Disclaimer: it's important to note that automated scans may produce false positives or miss certain vulnerabilities. We encourage you to review the findings and assess the impact accordingly.