Transformers does not recognize this architecture

#6
by eva20150932-atlascloud - opened

ValueError: The checkpoint you are trying to load has model type deepseek_v32 but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

wait new version or pull from github

DeepSeek org

We don't provide a modeling.py file like Hugging Face Transformers. You can try our inference demo code instead.

We'll work on porting the model to transformers asap 🤗

This comment has been hidden (marked as Resolved)

Sign up or log in to comment