-
deepseek-ai/DeepSeek-V3-Base
Updated • 6.4k • 1.64k -
TransMLA: Multi-head Latent Attention Is All You Need
Paper • 2502.07864 • Published • 50 -
2
Qwen2.5 Bakeneko 32b Instruct Awq
⚡Generate text-based responses for chat interactions
-
2
Deepseek R1 Distill Qwen2.5 Bakeneko 32b Awq
⚡Generate detailed responses based on user queries
Eduardo Espina
Edespina
·
AI & ML interests
None yet
Recent Activity
updated
a Space
17 days ago
Edespina/yris
published
a Space
17 days ago
Edespina/yris
Organizations
None yet
Collections
1
models
0
None public yet
datasets
0
None public yet