I used vLLm to load Vintern Vl 1Bv3_5, but it is not as I wish. Its accuracy was compromised when loading on the vLLM. I think that Its config on the vLLM is different from config when load normal model ?
Can you help to explain this problem?
· Sign up or log in to comment