yahavb's picture
Add NxD compiled model yahavb/inf2-bs32-tp16-mml16k-llama-31-8b-vllm for vLLM; after converting checkpoints
afe0afe verified