Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop š
38.7
TFLOPS
13
14
93
fahrizalfarid
akahana
Follow
John6666's profile picture
agentlans's profile picture
21world's profile picture
11 followers
Ā·
50 following
fahrizalfarid
fahrizalfarid
AI & ML interests
NLP
Recent Activity
reacted
to
SeaWolf-AI
's
post
with š„
about 12 hours ago
šļø Smol AI WorldCup: A 4B Model Just Beat 8B ā Here's the Data We evaluated 18 small language models from 12 makers on 125 questions across 7 languages. The results challenge the assumption that bigger is always better. Community Article: https://huggingface.co/blog/FINAL-Bench/smol-worldcup Live Leaderboard: https://huggingface.co/spaces/ginigen-ai/smol-worldcup Dataset: https://huggingface.co/datasets/ginigen-ai/smol-worldcup What we found: ā Gemma-3n-E4B (4B, 2GB RAM) outscores Qwen3-8B (8B, 5.5GB). Doubling parameters gained only 0.4 points. RAM cost: 2.75x more. ā GPT-OSS-20B fits in 1.5GB yet matches Champions-league dense models requiring 8.5GB. MoE architecture is the edge AI game-changer. ā Thinking models hurt structured output. DeepSeek-R1-7B scores 8.7 points below same-size Qwen3-8B and runs 2.7x slower. ā A 1.3B model fabricates confident fake content 80% of the time when prompted with nonexistent entities. Qwen3 family hits 100% trap detection across all sizes. ā Qwen3-1.7B (1.2GB) outscores Mistral-7B, Llama-3.1-8B, and DeepSeek-R1-14B. Latest architecture at 1.7B beats older architecture at 14B. What makes this benchmark different? Most benchmarks ask "how smart?" ā we measure five axes simultaneously: Size, Honesty, Intelligence, Fast, Thrift (SHIFT). Our ranking metric WCS = sqrt(SHIFT x PIR_norm) rewards models that are both high-quality AND efficient. Smart but massive? Low rank. Tiny but poor? Also low. Top 5 by WCS: 1. GPT-OSS-20B ā WCS 82.6 ā 1.5GB ā Raspberry Pi tier 2. Gemma-3n-E4B ā WCS 81.8 ā 2.0GB ā Smartphone tier 3. Llama-4-Scout ā WCS 79.3 ā 240 tok/s ā Fastest model 4. Qwen3-4B ā WCS 76.6 ā 2.8GB ā Smartphone tier 5. Qwen3-1.7B ā WCS 76.1 ā 1.2GB ā IoT tier Built in collaboration with the FINAL Bench research team. Interoperable with ALL Bench Leaderboard for full small-to-large model comparison. Dataset is open under Apache 2.0 (125 questions, 7 languages). We welcome new model submissions.
updated
a dataset
18 days ago
akahana/wikipedia-id-conv
published
a dataset
18 days ago
akahana/wikipedia-id-conv
View all activity
Organizations
None yet
akahana
's datasets
56
Sort:Ā Recently updated
akahana/wikipedia-id-conv
Viewer
ā¢
Updated
18 days ago
ā¢
666k
ā¢
22
akahana/LLaVA-Instruct-150K
Preview
ā¢
Updated
Jan 13
ā¢
17
akahana/wikipedia-full
Viewer
ā¢
Updated
Dec 24, 2025
ā¢
61.6M
ā¢
724
akahana/Medical-Reasoning-SFT-GPT-OSS-120B
Viewer
ā¢
Updated
Dec 23, 2025
ā¢
200k
ā¢
33
akahana/alpaca-gpt4-indonesian
Viewer
ā¢
Updated
Dec 23, 2025
ā¢
50k
ā¢
14
ā¢
1
akahana/tesis
Preview
ā¢
Updated
Dec 19, 2025
ā¢
16
akahana/doodle-blip-captions
Viewer
ā¢
Updated
Dec 18, 2025
ā¢
1k
ā¢
13
akahana/pokemon-blip-captions
Viewer
ā¢
Updated
Dec 18, 2025
ā¢
833
ā¢
14
akahana/geo
Updated
Dec 16, 2025
ā¢
17
akahana/flickr30k
Updated
Dec 16, 2025
ā¢
9
akahana/english-indonesia-wikimatrix-token
Viewer
ā¢
Updated
Dec 11, 2025
ā¢
1.02M
ā¢
30
akahana/english-indonesia-wikimatrix
Viewer
ā¢
Updated
Dec 9, 2025
ā¢
1.02M
ā¢
12
akahana/english-indonesia
Viewer
ā¢
Updated
Dec 9, 2025
ā¢
1M
ā¢
10
akahana/ubuntu
Updated
Nov 27, 2025
ā¢
7
akahana/anti-spoofing-nuaaaa
Viewer
ā¢
Updated
Jun 4, 2025
ā¢
8.6k
ā¢
10
akahana/anti-spoofing-casiafasd
Viewer
ā¢
Updated
Jun 4, 2025
ā¢
4.06k
ā¢
9
akahana/hifi-gan
Updated
Jun 1, 2025
ā¢
7
akahana/Driver-Drowsiness-Dataset
Viewer
ā¢
Updated
May 14, 2025
ā¢
41.8k
ā¢
24
ā¢
2
akahana/mpii-face-gaze
Updated
May 12, 2025
ā¢
24
akahana/common-voice-11-eng-sample
Updated
May 9, 2025
ā¢
12
akahana/children-codes-stories
Updated
Mar 19, 2025
ā¢
31
akahana/vlm
Updated
Mar 18, 2025
ā¢
15
akahana/medical
Updated
Mar 15, 2025
ā¢
900
akahana/llm-opus-ParaCrawl-english-id-v2
Updated
Mar 13, 2025
ā¢
22
akahana/llamacpp
Updated
Mar 11, 2025
ā¢
6
akahana/camel-ai-sains
Updated
Mar 10, 2025
ā¢
8
akahana/big-machine-translations
Updated
Mar 9, 2025
ā¢
65
akahana/rocov2-full
Updated
Mar 8, 2025
ā¢
9
akahana/dolphin-r1
Viewer
ā¢
Updated
Feb 3, 2025
ā¢
814k
ā¢
22
akahana/OpenThoughts-114k
Viewer
ā¢
Updated
Feb 3, 2025
ā¢
114k
ā¢
55
Previous
1
2
Next