Transformers does not recognize this architecture
#6
by
eva20150932-atlascloud
- opened
ValueError: The checkpoint you are trying to load has model type deepseek_v32
but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
wait new version or pull from github
We don't provide a modeling.py
file like Hugging Face Transformers. You can try our inference demo code instead.
We'll work on porting the model to transformers
asap 🤗
This comment has been hidden (marked as Resolved)