Allow Disable of "Thinking/Reasoning" Per Model/Per Action
H
Harsha K
Flash 2.5 is a
wonderful
model, as is o3 and the others, but for quick text manipulation, I don't need the thinking or reasoning. They add extra time to process and use extra tokens. Can we have an option to disable thinking on either a per model or (preferably) a per-action basis? For instance, Fix Grammar and Spelling shouldn't need thinking. For Gemini, setting the thinking budget to 0 will disable thinking altogether: https://ai.google.dev/gemini-api/docs/thinking#set-budget
I'm not sure about the other models, but I think this would be hugely helpful.
H
Harsha K
A follow-up request would be being able to set the budget explicitly on a per-action basis...for some actions, I might want the model to think long and hard.