When will GLM4.5 Flash be released
Looking forward to it as well
Flash is based on the Air model with inference optimizations, making the inference cost relatively low. It includes constraints on output tokens, with the main purpose of allowing community developers to experience the latest models for free.
Lets' go !!!
Flash is based on the Air model with inference optimizations, making the inference cost relatively low. It includes constraints on output tokens, with the main purpose of allowing community developers to experience the latest models for free.
I'm wondering if there is a plan of releasing a smaller model like the GLM4 9b? or a 16B MoE would be very interesting.
Flash is based on the Air model with inference optimizations, making the inference cost relatively low. It includes constraints on output tokens, with the main purpose of allowing community developers to experience the latest models for free.
I'm wondering if there is a plan of releasing a smaller model like the GLM4 9b? or a 16B MoE would be very interesting.
I also want these models, Qwen3 Coder works well on bigger projects but GLM4.5 make better results on GUI and three.js .