Flash 2.5 is a
wonderful
model, as is o3 and the others, but for quick text manipulation, I don't need the thinking or reasoning. They add extra time to process and use extra tokens. Can we have an option to disable thinking on either a per model or (preferably) a per-action basis? For instance, Fix Grammar and Spelling shouldn't need thinking.
For Gemini, setting the thinking budget to 0 will disable thinking altogether: https://ai.google.dev/gemini-api/docs/thinking#set-budget
I'm not sure about the other models, but I think this would be hugely helpful.