Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Rayen's picture
12 1 4

Rayen

Lissanro
https://Dragon.Studio
  • Lissanro

AI & ML interests

None yet

Recent Activity

new activity 1 day ago
deepseek-ai/DeepSeek-V3.1:Context length: is it 128K (as mentioned in the model card) or 160K (as specified in config.json)?
new activity 3 months ago
tngtech/DeepSeek-R1T-Chimera:Any plans to release an updated version based on DeepSeek-V3-0526 + R1, or how to create the merge myself?
new activity 4 months ago
bullerwins/DeepSeek-R1T-Chimera-GGUF:Please consider creating ik_llama.cpp compatible quants (without llama.cpp-specific MLA tensors)
View all activity

Organizations

None yet

upvoted a collection about 1 year ago

SSMs

Collection
A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated 8 days ago • 28
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs