43 47 147

Stefano Fiorucci PRO

anakin87

AI & ML interests

Contributing to Haystack LLM framework 🏗️. Language Models: orchestration, post-training, synthetic data...

Recent Activity

upvoted a collection 4 days ago

T5Gemma

upvoted an article 5 days ago

cocogold: training Marigold for text-grounded segmentation

posted an update 10 days ago

🛡️ AI Guardrails with Open Language Models - Tutorial 📓 https://haystack.deepset.ai/cookbook/safety_moderation_open_lms How do you ensure your AI application is safe from harmful or inappropriate user inputs? This is a core requirement for real-world AI deployments. Luckily, several open Language Models are built specifically for safety moderation. I've been exploring them and put together a hands-on tutorial using the Haystack framework to build your own AI guardrails. In the notebook, you'll learn how to use and customize: 🔹 Meta Llama Guard (via Hugging Face API) 🔹 IBM Granite Guardian (via Ollama), which can also evaluate RAG specific risk dimensions 🔹 Google ShieldGemma (via Ollama) 🔹 Nvidia NemoGuard models family, including a model for topic control You'll also see how to integrate content moderation into a 🔎 RAG pipeline.

View all activity

Organizations

upvoted a collection 4 days ago

T5Gemma

Collection

32 items • Updated 4 days ago • 48

upvoted an article 5 days ago

Article

cocogold: training Marigold for text-grounded segmentation

•

6 days ago

• 25

upvoted an article 2 months ago

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

May 7

• 38

upvoted a collection 3 months ago

Qwen Scheduler GRPO

Collection

Train a SLM to create a schedule from a list of events and priorities - Article: https://t.ly/-Dejx - Code: https://t.ly/1J_VG • 2 items • Updated Apr 29 • 4

upvoted an article 3 months ago

Article

I trained a Language Model to schedule events with GRPO!

•

Apr 29

• 80

upvoted an article 4 months ago

Article

Training a Gemma 2 2B-IT for Reasoning with GRPO

•

Mar 18

• 5

upvoted 2 articles 5 months ago

Article

Argunauts Training Phase II: Selfplay Finetuning Line-By-Line

•

Feb 19

• 5

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

Feb 10

• 58

upvoted an article 6 months ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

Jan 20

• 41

upvoted 2 collections 6 months ago

Gemma Neogenesis 💎🌍🇮🇹

Collection

Datasets and models for Neogenesis: Post-training recipe for improving Gemma 2 for a specific language. Notebook: https://t.ly/iuKdy • 12 items • Updated Mar 10 • 5

Dolphin 3.0

Collection

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated Feb 7 • 174

upvoted a collection 7 months ago

alignment_24_best

Collection

33 items • Updated Oct 21, 2024 • 2

upvoted 3 papers 7 months ago

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 3

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published Dec 16, 2024 • 18

Reverse Thinking Makes LLMs Stronger Reasoners

Paper • 2411.19865 • Published Nov 29, 2024 • 23

upvoted a collection 8 months ago

🇮🇹👓 LLaVA-NDiNO

Collection

HF Collection for the models of the paper "LLaVA-NDiNO: Empowering LLMs with Multimodality for the Italian Language" • 7 items • Updated Oct 20, 2024 • 3

upvoted 3 papers 8 months ago

upvoted an article 8 months ago

Article

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive

•

Nov 9, 2024

• 9

Stefano Fiorucci PRO

AI & ML interests

Recent Activity

Organizations

anakin87's activity

cocogold: training Marigold for text-grounded segmentation

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

I trained a Language Model to schedule events with GRPO!

Training a Gemma 2 2B-IT for Reasoning with GRPO

Argunauts Training Phase II: Selfplay Finetuning Line-By-Line

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

Fine-tune ModernBERT for RAG with Synthetic Data

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive