Models

Zed’s plans offer hosted versions of major LLMs, generally with higher rate limits than using your API keys. We’re working hard to expand the models supported by Zed’s subscription offerings, so please check back often.

ModelProviderToken TypeProvider Price per 1M tokensZed Price per 1M tokens
Claude Opus 4.1AnthropicInput$15.00$16.50
AnthropicOutput$75.00$82.50
AnthropicInput - Cache Write$18.75$20.625
AnthropicInput - Cache Read$1.50$1.65
Claude Sonnet 4.5AnthropicInput$3.00$3.30
AnthropicOutput$15.00$16.50
AnthropicInput - Cache Write$3.75$4.125
AnthropicInput - Cache Read$0.30$0.33
Claude Sonnet 4AnthropicInput$3.00$3.30
AnthropicOutput$15.00$16.50
AnthropicInput - Cache Write$3.75$4.125
AnthropicInput - Cache Read$0.30$0.33
Claude Sonnet 3.7AnthropicInput$3.00$3.30
AnthropicOutput$15.00$16.50
AnthropicInput - Cache Write$3.75$4.125
AnthropicInput - Cache Read$0.30$0.33
GPT-5OpenAIInput$1.25$1.375
OpenAIOutput$10.00$11.00
OpenAICached Input$0.125$0.1375
GPT-5 miniOpenAIInput$0.25$0.275
OpenAIOutput$2.00$2.20
OpenAICached Input$0.025$0.0275
GPT-5 nanoOpenAIInput$0.05$0.055
OpenAIOutput$0.40$0.44
OpenAICached Input$0.005$0.0055
Gemini 2.5 ProGoogleInput$1.25$1.375
GoogleOutput$10.00$11.00
Gemini 2.5 FlashGoogleInput$0.30$0.33
GoogleOutput$2.50$2.75

Usage

Any usage of a Zed-hosted model will be billed at the Zed Price (rightmost column above). See Plans and Usage for details on Zed's plans and limits for use of hosted models.

We encourage you to think through what model is best for your needs before leaving the Agent Panel to work. All LLMs can "spiral" and occasionally enter unending loops that require user intervention.

Context Windows

A context window is the maximum span of text and code an LLM can consider at once, including both the input prompt and output generated by the model.

ModelProviderZed-Hosted Context Window
Claude Opus 4.1Anthropic200k
Claude Sonnet 4Anthropic200k
Claude Sonnet 3.7Anthropic200k
GPT-5OpenAI400k
GPT-5 miniOpenAI400k
GPT-5 nanoOpenAI400k
Gemini 2.5 ProGoogle200k
Gemini 2.5 FlashGoogle200k

We're planning on expanding supported context windows for hosted Sonnet 4 and Gemini 2.5 Pro/Flash in the near future. Stay tuned!

Each Agent thread and text thread in Zed maintains its own context window. The more prompts, attached files, and responses included in a session, the larger the context window grows.

For best results, it’s recommended you take a purpose-based approach to Agent thread management, starting a new thread for each unique task.

Tool Calls

Models can use tools to interface with your code, search the web, and perform other useful functions.