MinorBench: A hand-built benchmark for content-based risks for children Paper ⢠2503.10242 ⢠Published 15 days ago ⢠4
Safe at the Margins: A General Approach to Safety Alignment in Low-Resource English Languages -- A Singlish Case Study Paper ⢠2502.12485 ⢠Published Feb 18 ⢠1
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper ⢠2411.12946 ⢠Published Nov 20, 2024 ⢠22
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper ⢠2412.03555 ⢠Published Dec 4, 2024 ⢠134
Off Topic Guardrail đĄď¸ Collection Fast, lightweight zero-shot classifiers for user prompt's relevance to the system prompt. ⢠5 items ⢠Updated Nov 25, 2024 ⢠4
LionGuard đŚ Collection A Singapore-contextualized moderation classifier. ⢠2 items ⢠Updated Nov 25, 2024 ⢠1
Harnessing the Potential of Gen-AI Coding Assistants in Public Sector Software Development Paper ⢠2409.17434 ⢠Published Sep 25, 2024 ⢠1
LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content Paper ⢠2407.10995 ⢠Published Jun 24, 2024 ⢠1
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Paper ⢠2411.14432 ⢠Published Nov 21, 2024 ⢠25
1.5-Pints Collection 1.5 Pints is a Large Language Model that significantly advances the efficiency of LLM training by emphasizing data quality over quantity. ⢠4 items ⢠Updated Aug 8, 2024 ⢠3
WebInstruct đ Embeddings 𧹠Models Collection A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses ⢠3 items ⢠Updated Sep 4, 2024 ⢠11
Sailor: Open Language Models for South-East Asia Paper ⢠2404.03608 ⢠Published Apr 4, 2024 ⢠20
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. ⢠12 items ⢠Updated Jan 6 ⢠129