128k version of YARN

by sovetboga - opened Apr 30

Apr 30

Hello, does this model have the same situation as in Qwen3. Is it possible to get the 128k version?
https://github.com/THUDM/GLM-4#model-and-prompt-implementation

Unsloth AI org May 1

Hi there good idea we'll see waht we can do :)

truder

May 1

It would be awesome, looking for it.

May 5

•

Would love to see this!

GLM-4 32b seems a lot more reliable for coding that Qwen 3, just needs it's context extended.

May 19

This would be great!

2 days ago

Yes so where is the long context window size version this is ridiculous it is still not available.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment