An experimental coding instruct model. This is a full finetune of DeepSeek-Coder-Instruct-1.3B for 15 hours on 1xA6000 using a bespoke distillation trainer.
- Downloads last month
- 5
Inference API (serverless) is not available, repository is disabled.
An experimental coding instruct model. This is a full finetune of DeepSeek-Coder-Instruct-1.3B for 15 hours on 1xA6000 using a bespoke distillation trainer.