Prepare version of SmolLM2 models with MLA (Multihead latent attention)

#9
by verion1 - opened

Sign up or log in to comment