How was this made? (question about context size)

by robbiemu - opened Apr 24

Apr 24

If it was llama.cpp -- did you modify the config.json to add the Qwen2ForCausalLM yarn parameters to support the long context (or would it be 32k context when run in ollama/llama.cpp)?

csabakecskemeti

DevQuasar org Apr 24

made with llama.cpp no mod

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment