arxiv:2510.03270

CoDA: Coding LM via Diffusion Adaptation

Published on Sep 27

· Submitted by

Weiran Yao on Oct 8

Salesforce

Upvote

Authors:

Haolin Chen ,

Can Qin ,

Weiran Yao

Abstract

CoDA, a 1.7B-parameter diffusion coder, achieves competitive performance with smaller models through confidence-guided sampling and is released with open-source tools.

AI-generated summary

Diffusion language models promise bidirectional context and infilling capabilities that autoregressive coders lack, yet practical systems remain heavyweight. We introduce CoDA, a 1.7B-parameter diffusion coder trained on TPU with a fully open-source training pipeline. CoDA pairs large-scale diffusion pre-training with code-centric mid-training and instruction tuning, enabling confidence-guided sampling that keeps inference latency competitive. On Humaneval, MBPP, and EvalPlus, CoDA-1.7B-Instruct matches or surpasses diffusion models up to 7B parameters. Our release includes model checkpoints, evaluation harnesses, and TPU training pipelines to accelerate research on lightweight diffusion-based coding assistants.

View arXiv page View PDF Project page GitHub 38 Add to collection

Community

weirayao

Paper submitter 3 days ago

CoDA-1.7B is born for code editing ✍️tasks while overall coding performances on par with 7B models. The cool part is it does parallel decoding so it’s blazingly fast ⚡️during inference!

The models, pre/mid/post-training code and frameworks have all been open-sourced:

→ 🤗 𝗛𝘂𝗴𝗴𝗶𝗻𝗴 𝗙𝗮𝗰𝗲: https://huggingface.co/Salesforce/CoDA-v0-Instruct
→ 🤖 𝗚𝗶𝘁𝗛𝘂𝗯: https://github.com/SalesforceAIResearch/CoDA/
→ 📑 𝗧𝗲𝗰𝗵 𝗥𝗲𝗽𝗼𝗿𝘁: https://www.arxiv.org/abs/2510.03270