Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
onekqΒ 
posted an update 7 days ago
Post
477
The Qwen3 235B (MoE) is awfully slow 🐒🐒🐒.

I heard it is able to switch between reasoning and non-reasoning, but for my question, it always goes straight to the reasoning mode without an override switch. I tried Fireworks, DeepInfra, and OpenRouter, and they are all the same.

What is your experience with Qwen3?

You need to use something like /no_think in system prompt or in the user input, isn't it?

Β·

Ah thanks! this works

In this post