When will GLM4.5 Flash be released

#5
by WilliamKing9 - opened

I find that model in your website, it will be a good model to run locally if it is released.

Screenshot_2025-07-30-08-49-01-81_df198e732186825c8df26e3c5a10d7cd.jpg

Looking forward to it as well

Flash is based on the Air model with inference optimizations, making the inference cost relatively low. It includes constraints on output tokens, with the main purpose of allowing community developers to experience the latest models for free.

Flash is based on the Air model with inference optimizations, making the inference cost relatively low. It includes constraints on output tokens, with the main purpose of allowing community developers to experience the latest models for free.

I'm wondering if there is a plan of releasing a smaller model like the GLM4 9b? or a 16B MoE would be very interesting.

Flash is based on the Air model with inference optimizations, making the inference cost relatively low. It includes constraints on output tokens, with the main purpose of allowing community developers to experience the latest models for free.

I'm wondering if there is a plan of releasing a smaller model like the GLM4 9b? or a 16B MoE would be very interesting.

I also want these models, Qwen3 Coder works well on bigger projects but GLM4.5 make better results on GUI and three.js .

Sign up or log in to comment