yahavb's picture
Add NxD compiled model yahavb/inf2-bs16-tp16-mml16k-llama-31-8b-vllm for vLLM; after converting checkpoints
bf643eb verified