only 40k context accepted

by rolead - opened 4 days ago

4 days ago

Serving by vLLM would trigger this 1M model refer to Coder-0.6B safetensor downloading (1.5G file). and response with no more than 40k context window. Anything wrong?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment