Young Sik Hong's picture

Young Sik Hong

RICHARDYHONG

·

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

FINAL-Bench/Darwin-60B-DUO

reacted to SeaWolf-AI's post with ❤️ about 19 hours ago

Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀 We're excited to release Darwin-60B-DUO, the Darwin family's first DUO model. Take two domain-verified specialists, hide them behind a single OpenAI-compatible endpoint, and let a router decide which one (or both) answers. You see one model, one API — but get the best of both. The number that matters: on the full 198-question GPQA Diamond, Darwin-60B-DUO hits 88.38%. The constituents alone land at 69.70% (Darwin-28B-REASON) and 77.27% (AWAXIS-Think-31B); a naive cascade only reaches 83.84%. The DUO clears them all. Two small specialists, intelligently routed, beat one big generalist on cost and quality. Both are independently verified — Darwin-28B-REASON is #3 on the HF GPQA Diamond leaderboard, AWAXIS-Think-31B is #1 on Korea's national K-AI Leaderboard (MSIT). The brains is a Hybrid-A router picking one of five strategies on the fly. Korean → AWAXIS, English/STEM → Darwin (single-backend, ~70% of traffic at 1× cost). When a Korean answer needs rigorous English reasoning, split_refine fires — Darwin drafts, AWAXIS polishes; MCQ/short-answer runs both with self-consistency + cross-verify. Net effective cost: only ~1.3× a single 30B model. The part the community will care about: the gateway is model-agnostic and Apache-2.0. Point it at any two OpenAI-compatible backends and you've got a DUO in minutes — teach router.py when to use which, and parallel calls, response merging, and routing transparency via _duo_route are handled for you. Fork it and tell us what you built. Painless deploy: docker compose up for both vLLM backends + gateway; FP8 ~30GB colocates on a single B200/H100. One git clone (~120GB). Text-only for now, streaming in v1.1. Two SOTAs, one endpoint. Come build your own on the Community tab. 👇 🔗 https://huggingface.co/FINAL-Bench/Darwin-60B-DUO

reacted to SeaWolf-AI's post with 👍 about 19 hours ago

Darwin-60B-DUO: Two SOTAs, One Endpoint — 88.38% on GPQA Diamond 🚀 We're excited to release Darwin-60B-DUO, the Darwin family's first DUO model. Take two domain-verified specialists, hide them behind a single OpenAI-compatible endpoint, and let a router decide which one (or both) answers. You see one model, one API — but get the best of both. The number that matters: on the full 198-question GPQA Diamond, Darwin-60B-DUO hits 88.38%. The constituents alone land at 69.70% (Darwin-28B-REASON) and 77.27% (AWAXIS-Think-31B); a naive cascade only reaches 83.84%. The DUO clears them all. Two small specialists, intelligently routed, beat one big generalist on cost and quality. Both are independently verified — Darwin-28B-REASON is #3 on the HF GPQA Diamond leaderboard, AWAXIS-Think-31B is #1 on Korea's national K-AI Leaderboard (MSIT). The brains is a Hybrid-A router picking one of five strategies on the fly. Korean → AWAXIS, English/STEM → Darwin (single-backend, ~70% of traffic at 1× cost). When a Korean answer needs rigorous English reasoning, split_refine fires — Darwin drafts, AWAXIS polishes; MCQ/short-answer runs both with self-consistency + cross-verify. Net effective cost: only ~1.3× a single 30B model. The part the community will care about: the gateway is model-agnostic and Apache-2.0. Point it at any two OpenAI-compatible backends and you've got a DUO in minutes — teach router.py when to use which, and parallel calls, response merging, and routing transparency via _duo_route are handled for you. Fork it and tell us what you built. Painless deploy: docker compose up for both vLLM backends + gateway; FP8 ~30GB colocates on a single B200/H100. One git clone (~120GB). Text-only for now, streaming in v1.1. Two SOTAs, one endpoint. Come build your own on the Community tab. 👇 🔗 https://huggingface.co/FINAL-Bench/Darwin-60B-DUO

View all activity

Organizations

None yet

upvoted a paper 14 days ago

Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

Paper • 2605.14386 • Published 15 days ago • 60

upvoted an article about 1 month ago

Article

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

FINAL-Bench

•

Apr 15

• 13

upvoted an article about 2 months ago

Article

"Darwin-27B-Opus: Surpassing the Foundation Model Without Training"

FINAL-Bench

•

Apr 13

• 13

upvoted a collection about 2 months ago

DARWIN-Family

비드래프트 • 45 items • Updated about 19 hours ago • 20

upvoted 3 articles about 2 months ago

Article

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

FINAL-Bench

•

Apr 8

• 11

Article

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

FINAL-Bench

•

Mar 31

• 14

Article

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

FINAL-Bench

•

Mar 29

• 13