Skip to content

Effort levels

Effort levels are one-click presets that tune how hard the model thinks before replying. They map to reasoning-effort, max-tokens, and sub-agent budgets in the background, so you don’t have to configure each provider individually.

The selector lives next to the Send button. It applies to the next message only — pick a level, send, and the composer returns to your default.

LevelWhen to useWhat it does under the hood
QuickSimple edits, one-liners, “what does this do?”Minimal reasoning effort, tight max-token budget, no sub-agents. Fastest and cheapest.
BalancedNormal development work — refactors, small features, code review.Default reasoning effort, standard budget, sub-agents allowed but not encouraged.
ThoroughMulti-file features, debugging hard issues, architectural design.High reasoning effort, generous budget, autopilot encouraged to spawn specialists.
  • Claude (Anthropic API & OpenRouter) — adjusts thinking.budget_tokens and max_tokens.
  • GPT-5 / Codex — sets reasoning.effort (low, medium, high).
  • Gemini — adjusts thinking budget on 2.5-series models.
  • Ollama / local models — adjusts context size and sampling where supported.
  • Copilot — maps to the underlying upstream model’s effort controls when the selected Copilot model exposes them.

Your default effort level is per-workspace and lives in workspace settings. The in-chat selector overrides the default for one message at a time.