Fix bug of vllm version 0.8.5:

    self.q_size = self.num_heads * self.head_dim
TypeError: unsupported operand type(s) for *: 'int' and 'NoneType'
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment