43 47 147

Stefano Fiorucci PRO

anakin87

AI & ML interests

Contributing to Haystack LLM framework 🏗️. Language Models: orchestration, post-training, synthetic data...

Recent Activity

upvoted a collection 1 day ago

T5Gemma

upvoted an article 2 days ago

cocogold: training Marigold for text-grounded segmentation

posted an update 8 days ago

🛡️ AI Guardrails with Open Language Models - Tutorial 📓 https://haystack.deepset.ai/cookbook/safety_moderation_open_lms How do you ensure your AI application is safe from harmful or inappropriate user inputs? This is a core requirement for real-world AI deployments. Luckily, several open Language Models are built specifically for safety moderation. I've been exploring them and put together a hands-on tutorial using the Haystack framework to build your own AI guardrails. In the notebook, you'll learn how to use and customize: 🔹 Meta Llama Guard (via Hugging Face API) 🔹 IBM Granite Guardian (via Ollama), which can also evaluate RAG specific risk dimensions 🔹 Google ShieldGemma (via Ollama) 🔹 Nvidia NemoGuard models family, including a model for topic control You'll also see how to integrate content moderation into a 🔎 RAG pipeline.

View all activity

Organizations

Posts 16

Post

353

🛡️ AI Guardrails with Open Language Models - Tutorial

📓 https://haystack.deepset.ai/cookbook/safety_moderation_open_lms

How do you ensure your AI application is safe from harmful or inappropriate user inputs?

This is a core requirement for real-world AI deployments. Luckily, several open Language Models are built specifically for safety moderation.

I've been exploring them and put together a hands-on tutorial using the Haystack framework to build your own AI guardrails.

In the notebook, you'll learn how to use and customize:
🔹 Meta Llama Guard (via Hugging Face API)
🔹 IBM Granite Guardian (via Ollama), which can also evaluate RAG specific risk dimensions
🔹 Google ShieldGemma (via Ollama)
🔹 Nvidia NemoGuard models family, including a model for topic control

You'll also see how to integrate content moderation into a 🔎 RAG pipeline.

View all Posts