Free tool · No signup

Free Page Token Inspector

Enter any URL and see the page the way an AI engine reads it: estimated token count, what share of a context window it consumes, the focus words that dominate, and the stripped-down text AI engines actually extract.

Loading…

Free · Unlimited checks · No signup required

How it works

Step 1

Enter a URL

Any public page — an article, a landing page, a doc.

Step 2

We strip it to text

Markup, navigation, scripts, and styles removed — just the extractable text an AI engine works with.

Step 3

Token + window math

Estimated tokens, word and character counts, and fill percentages for 128K, 200K, and 1M context windows.

Step 4

Read the signals

Top focus words show what the page is actually 'about' to an engine — plus a thin-content flag under 300 words.

Why it matters

AI engines read tokens, not pages — and tokens are a budget.

When an AI engine considers your page for an answer, it loads the extractable text into a fixed-size context window measured in tokens. A bloated page competes with every other source for that budget; a lean, well-structured one fits cheaply and fully. Knowing your page's token footprint tells you how it behaves inside that constraint.

What survives extraction is all that exists.

Engines don't see your design, your JavaScript widgets, or your carefully styled hero section — they see the visible text after markup is stripped. If the stripped version is mostly navigation labels and footer links, that's what your page 'says' to an AI engine. The preview in this tool shows exactly that version, which is often a wake-up call.

Thin pages don't get cited.

A page with under roughly 300 words of extractable text rarely contains enough substance for an AI engine to quote or cite — there's simply not enough material to extract an answer from. If your important pages come back thin, the fix is real prose: definitions, explanations, FAQs, and data that an engine can lift directly into an answer.

With Meev

Meev writes pages that are already shaped for extraction.

Token-efficient structure, answer-first prose, and enough substance to be citable — every article Meev publishes to your domain is built for how AI engines actually read. Then visibility tracking shows whether those pages earn citations across every major AI search surface.

  • Auto-published articles with extraction-friendly structure and real depth
  • Answer-first formatting that AI engines can lift directly
  • Visibility tracking across every major AI search surface

Frequently asked

How accurate is the token estimate?

It's an estimate based on the standard heuristic of roughly 4 characters per token for English text — expect it to be within about ±10% of what an AI engine's tokenizer would report. That's accurate enough for the decisions this tool informs: comparing pages, spotting bloat, and judging context-window fit.

What is a context window?

A context window is the maximum amount of text — measured in tokens — an AI model can consider at once. Common sizes today are 128K, 200K, and 1M tokens. Everything the model uses to answer, including your page, must fit inside it, so pages that consume less of the window are cheaper for engines to use fully.

Why does the tool strip my page down to plain text?

Because that's what AI engines do before reading it. HTML markup, scripts, styles, and hidden elements are removed; what remains is the extractable text the engine reasons over. The preview shows you that exact version — if key content is missing from it, engines never see that content either.

What are focus words and why do they matter?

Focus words are the most frequent meaningful words on your page after common words like 'the' and 'with' are filtered out. They're a quick proxy for what the page is topically 'about' from an engine's perspective. If your target topic isn't represented in the top focus words, the page probably reads as being about something else.

Why is under 300 words flagged as thin?

Pages with very little extractable text rarely get cited because there isn't enough substance to extract an answer from. Around 300 words is a practical floor — below it, a page is usually a stub, a redirect shell, or a JavaScript app whose content never made it into the HTML. Either way, it's invisible as a source.

Does a lower token count always mean a better page?

No — the goal is substance per token, not minimalism. A 50,000-token page padded with boilerplate is wasteful, but a 200-token page has nothing to cite. The strongest pages for AI search pack real, answer-shaped information densely: enough depth to be a credible source, without bloat that buries it.

Stop fixing pages one at a time.

Meev tracks your visibility across every major AI search surface and publishes quality-gated content that earns citations — automatically.

Card required, no charge until day 8. Cancel anytime.

More free tools

View all →