Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
11
mahmoudi
nizar125
Follow
MoezBouchoucha's profile picture
Youssefkrich's profile picture
hamzabouajila's profile picture
3 followers
ยท
10 following
AI & ML interests
None yet
Recent Activity
reacted
to
SeaWolf-AI
's
post
with ๐ฅ
2 days ago
๐ Introducing MARL โ Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning Now available on PyPI ยท GitHub ยท ClawHub ยท HuggingFace AI models sense they could be wrong, but they can't actually fix what's broken. ๐ค Live A/B test: https://huggingface.co/spaces/VIDraft/MARL We evaluated 9 SOTA models (GPT-5.2, Claude Opus 4.6, Gemini 3 Pro, etc.) across 1,800 assessments in FINAL Bench and found a 39.2%p gap between "recognizing potential errors (MA=0.694)" and "actually finding and fixing them (ER=0.302)." MARL (Model-Agnostic Runtime Middleware for LLMs) was built to close this metacognitive gap. It decomposes a single LLM call into a 5-stage expert pipeline (Hypothesis โ Solver โ Auditor โ Adversarial Verifier โ Synthesizer), transforming "answer in one shot" into "think, doubt, correct, and rewrite." No weight modification โ works instantly with GPT-5.4, Claude, Gemini, Llama, or any OpenAI API-compatible LLM by changing one line: base_url. Ships with 9 domain-specific emergence engines (invention, pharma, genomics, chemistry, ecology, law, and more โ 5,538 expert data items) activated by a simple tag like model="gpt-5.4::pharma". pip install marl-middleware MARL is also officially registered on ClawHub, the skill marketplace of OpenClaw โ an AI agent platform with 260K+ developers and 3,200+ skills. It's the first middleware in the Reasoning Enhancement category. One command โ clawhub install marl-middleware โ gives your AI agent a metacognition upgrade. ๐ Technical deep dive: https://huggingface.co/blog/FINAL-Bench/marl-middleware ๐ฆ PyPI: https://pypi.org/project/marl-middleware/ ๐ GitHub: https://github.com/Vidraft/MARL ๐ฆ ClawHub: https://clawhub.ai/Cutechicken99/marl-middleware #MARL #LLM #Hallucination #Metacognition #MultiAgent #AIMiddleware #FINALBench #OpenClaw #ClawHub #PyPI #AGI #HuggingFace #ReasoningAI #SelfCorrection #GlassBoxAI
liked
a model
14 days ago
linagora/linto-asr-ar-tn-0.1
liked
a dataset
14 days ago
linagora/linto-dataset-audio-ar-tn
View all activity
Organizations
spaces
3
Sort:ย Recently updated
pinned
Sleeping
Agent
๐ฅ
Duplicate this leaderboard to initialize your own!
Runtime error
First Agent Template
โก
Sleeping
Template Final Assignment
๐ต
models
1
nizar125/Fanar-1-9B-Instruct-AWQ
9B
โข
Updated
Feb 2
โข
2
datasets
0
None public yet