Free tool · No signup

Free XML Sitemap Validator

Enter your domain and we find your sitemap.xml, lint it against the sitemap spec — structure, absolute URLs, size limits, lastmod dates — and spot-check a sample of listed URLs to confirm they actually resolve. Paste an exact sitemap URL if yours lives somewhere custom.

Loading…

Free · Unlimited checks · No signup required

How it works

Step 1

Enter a domain or sitemap URL

A bare domain checks /sitemap.xml; paste the full URL if your sitemap lives elsewhere.

Step 2

We fetch and parse it

Sitemap index files are detected automatically — we list the children and validate the first one.

Step 3

Structure is linted

Missing <loc> elements, relative URLs, cross-host entries, invalid dates, and the 50,000-URL / 50MB spec limits.

Step 4

URLs are spot-checked

A sample of listed URLs is requested to confirm they resolve — a sitemap full of 404s actively hurts.

Why it matters

Your sitemap is the crawl map you hand to every engine.

Search and AI crawlers discover most pages by following links, but the sitemap is the one place you explicitly declare everything you want crawled — including new pages nothing links to yet and deep pages crawlers rarely reach. A broken or stale sitemap means your freshest content waits to be discovered by accident instead of by invitation.

An invalid sitemap fails silently.

Engines don't email you when your sitemap has malformed XML, relative URLs, or entries pointing at the wrong host — they just skip what they can't parse. A single structural error can quietly void hundreds of entries. Validation is the only way to know the file you're serving is the file engines are actually able to use.

Accurate lastmod dates earn you faster recrawls.

Google has confirmed it uses the lastmod field — when it's trustworthy — to prioritize recrawling pages that genuinely changed. Sites that stamp every URL with today's date burn that trust and lose the benefit; sites with honest lastmod values get updated content re-indexed faster. It's one of the few crawl-priority levers you directly control.

With Meev

Meev keeps your content pipeline sitemap-ready by default.

A sitemap only stays valid if publishing keeps it valid. Articles published through Meev ship with clean canonical URLs and complete metadata, so the pages your sitemap lists are pages engines actually want to crawl, index, and cite.

  • Every published article has a clean, absolute, canonical URL from day one
  • Content updates carry honest modification signals engines can trust
  • Visibility tracking confirms the pages are getting found — not just listed

Frequently asked

Where should my sitemap live?

The convention is /sitemap.xml at the root of your domain — that's where this tool (and many crawlers) look first. It can technically live anywhere, but then you must declare it: add a 'Sitemap: https://example.com/path/to/sitemap.xml' line to your robots.txt and submit the URL in Google Search Console and Bing Webmaster Tools.

What are the sitemap size limits?

A single sitemap file may contain at most 50,000 URLs and weigh at most 50MB uncompressed. Bigger sites split their URLs across multiple sitemap files and list them in a sitemap index. Exceed either limit and engines may truncate or reject the file entirely.

Does the lastmod date actually matter?

Yes — Google has stated it uses lastmod to prioritize recrawling, but only when the values are credible. Set lastmod to the date the content meaningfully changed, in YYYY-MM-DD or full ISO 8601 format. If you stamp every URL with the current date on every deploy, engines learn to ignore your lastmod entirely and you lose the recrawl benefit.

What is a sitemap index file?

A sitemap of sitemaps: instead of <url> entries it contains <sitemap> entries, each pointing at a child sitemap file. It's how large sites stay under the 50,000-URL limit — split URLs across many files, list those files in one index, and submit just the index. This tool detects index files automatically and validates the first child.

Does a sitemap guarantee my pages get indexed?

No. A sitemap is a hint, not a command — it tells engines which URLs exist and when they changed, but engines still decide independently whether each page is worth crawling and indexing based on its quality, uniqueness, and the rest of your site's signals. A sitemap gets you discovered faster; it doesn't get you ranked.

Why does the tool spot-check URLs from my sitemap?

Because a structurally perfect sitemap full of dead URLs is worse than no sitemap. Engines that repeatedly fetch sitemap entries and hit 404s or server errors learn to distrust the whole file. We request a small, evenly-spaced sample of the listed URLs and flag any that fail, so you catch rot before crawlers do.

Stop fixing pages one at a time.

Meev tracks your visibility across every major AI search surface and publishes quality-gated content that earns citations — automatically.

Card required, no charge until day 8. Cancel anytime.

More free tools

View all →