exllamav3
Collection
exllamav3 aka exl3 quantizations. See https://github.com/turboderp-org/exllamav3
•
9 items
•
Updated
Kevin (K(ernel D)evin) is a 32B parameter model finetuned to write efficient CUDA kernels.
We use KernelBench as our benchmark, and train the model through multi-turn reinforcement learning.
For the details, see our blogpost at https://cognition.ai/blog/kevin-32b