-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 287k • 209 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 23.7k • 105 -
jinaai/jina-reranker-v2-base-multilingual
Text Classification • 0.3B • Updated • 1.32M • 298 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 139k • 726
Bjorn Melin
BjornMelin
AI & ML interests
Large Language Models, AI Agents, Multi-Agent Orchestrations, Deep Learning, NLP, Local LLM Optimization.
Recent Activity
updated
a collection
2 days ago
Smol Models
liked
a model
2 days ago
nvidia/NVIDIA-Nemotron-Nano-9B-v2
liked
a model
2 days ago
deepseek-ai/DeepSeek-V3.1
Organizations
None yet
Datasets
Fine Tuning
Legendary VL Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 297k • 293 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 214k • • 199 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 812k • • 788 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 368k • • 532
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 151k • 14 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.84M • • 1.67k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 14.1M • • 4.51k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 155k • 25
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 481k • • 3.96k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 45 -
internlm/internlm3-8b-instruct
Text Generation • 9B • Updated • 84k • 227 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 1.47k • • 549
Embedding Models
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 3.07k • • 109 -
Qwen/Qwen3-32B-AWQ
Text Generation • 6B • Updated • 447k • 102 -
all-hands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 2.85k • • 388 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 218k • • 553
Leaderboards
-
Running137137
smolagents LLM leaderboard
🏆A leaderboard for LLMs powering smolagents
-
Running371371
LLM Performance Leaderboard
🐨View LLM performance rankings
-
Running180180
Low-bit Quantized Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running1.03k1.03k
UGI Leaderboard
📢Uncensored General Intelligence Leaderboard
Coding Models
Google
-
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text • 27B • Updated • 9.85k • 326 -
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text • 27B • Updated • 54.5k • 151 -
google/gemma-3-27b-it
Image-Text-to-Text • 27B • Updated • 540k • • 1.57k -
google/gemma-3n-E4B-it
Image-Text-to-Text • 8B • Updated • 102k • 735
Qwen
Rerankers
-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 287k • 209 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 23.7k • 105 -
jinaai/jina-reranker-v2-base-multilingual
Text Classification • 0.3B • Updated • 1.32M • 298 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 139k • 726
Embedding Models
Datasets
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 3.07k • • 109 -
Qwen/Qwen3-32B-AWQ
Text Generation • 6B • Updated • 447k • 102 -
all-hands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 2.85k • • 388 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 218k • • 553
Fine Tuning
Leaderboards
-
Running137137
smolagents LLM leaderboard
🏆A leaderboard for LLMs powering smolagents
-
Running371371
LLM Performance Leaderboard
🐨View LLM performance rankings
-
Running180180
Low-bit Quantized Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running1.03k1.03k
UGI Leaderboard
📢Uncensored General Intelligence Leaderboard
Legendary VL Models
Coding Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 297k • 293 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 214k • • 199 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 812k • • 788 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 368k • • 532
Google
-
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text • 27B • Updated • 9.85k • 326 -
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text • 27B • Updated • 54.5k • 151 -
google/gemma-3-27b-it
Image-Text-to-Text • 27B • Updated • 540k • • 1.57k -
google/gemma-3n-E4B-it
Image-Text-to-Text • 8B • Updated • 102k • 735
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 151k • 14 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.84M • • 1.67k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 14.1M • • 4.51k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 155k • 25
Qwen
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 481k • • 3.96k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 45 -
internlm/internlm3-8b-instruct
Text Generation • 9B • Updated • 84k • 227 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 1.47k • • 549