Getting error when hosted this model on GCP and using large context

by pulkitmehtametacube - opened Jun 25

Jun 25

Hi All ,

We deployed this model on vertex ai and it works fine for smaller prompts and context but when we pass 16k token paul graham essay in input , getting below error . Please suggest .

google.api_core.exceptions.InternalServerError: 500 {"error":"Incomplete generation","error_type":"Incomplete generation"}

hdadlani

Owner Jun 25

Not sure about vertexAI, never worked with it, try vLLMs maybe

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment