Max input tokens · converted to words & pages
Compare every major model's context window — in tokens and in plain pages of text — so you can see what actually fits in a single prompt.
Largest here: Gemini 2.5 Flash at 1,048.576k tokens — roughly 786,432 words, or about 1,573 pages of text in a single prompt.
Context = max input tokens. ~0.75 words and ~500 words/page used for the human-readable estimates. Updated 2026-05-22 (LiteLLM table).
Each bar shows a model's maximum input in tokens, with a plain-English page estimate (~500 words per page) so the number means something.
If you feed whole contracts or codebases, you need a large window (or a chunking strategy). For short prompts, a small window is fine and often cheaper.
Context is shared between your input and the model's reply. A 200k window doesn't mean 200k of input if you also want a long answer.
Very long contexts can degrade accuracy and raise cost. Often retrieval (RAG) beats stuffing everything into one giant prompt.
A free tool gives you a hypothesis. The 30-minute diagnostic is where we pressure-test it against your actual workflows — and decide whether the project is worth building, buying, or skipping.