michaelbenayoun/deepseekv3-tiny-4kv-heads-4-layers-random Text Generation • 0.0B • Updated Jul 24 • 13
michaelbenayoun/granite-tiny-4kv-heads-4layers-random Text Generation • 0.0B • Updated Jun 18 • 2.57k
michaelbenayoun/llama-2-tiny-4kv-heads-2layers-random Feature Extraction • 0.0B • Updated May 7, 2024 • 11
michaelbenayoun/llama-2-tiny-4kv-heads-8layers-random Feature Extraction • 0.0B • Updated May 3, 2024 • 6
michaelbenayoun/llama-2-tiny-16layers-32kv-heads-random Feature Extraction • 0.0B • Updated Jan 4, 2024 • 6
michaelbenayoun/mistral-tiny-4layers-8kv-heads-random Text Generation • 0.0B • Updated Nov 9, 2023 • 18