Orkkid

Technical SEO

Sitemap

Also known as: XML Sitemap

A sitemap is a file (usually XML) that lists the important URLs on your website along with metadata about each one. It helps search engines and AI crawlers discover and prioritize your content.

What it means

A sitemap is a file at a known location (typically /sitemap.xml) that lists the URLs on a website along with optional metadata such as last modified date, change frequency, and priority. The XML format is the standard, defined by the sitemaps.org protocol.

For a small site, a single sitemap file is sufficient. For larger sites, multiple sitemap files are organized under a sitemap index.

Why it matters

Search engines discover pages primarily through links, but a sitemap provides a clean, complete list as a backup. Without a sitemap, isolated pages with few internal links can take much longer to be indexed.

For AEO purposes, AI crawlers including ClaudeBot and PerplexityBot often consult sitemaps to understand site structure. A clean sitemap, paired with a clean llms.txt, improves the odds of full content discovery.

How it's used

Best practices for sitemaps in 2026:

  • Include only canonical URLs (no duplicates, no noindexed pages)
  • Update lastmod accurately on real content changes
  • Submit the sitemap to Google Search Console and Bing Webmaster Tools
  • Reference the sitemap in your robots.txt file (Sitemap: https://orkkid.com/sitemap.xml)
  • Keep it under 50,000 URLs and 50 MB; split across multiple files if larger
  • Generate it programmatically so it stays fresh as content changes

Most modern frameworks (including Next.js) generate sitemaps automatically.

Want help applying this to your business?

Book a free AI citation audit. We run thirty prompts across ChatGPT, Perplexity, Gemini, and Google AI Overviews and send a personalised report within 72 hours.

Get a free citation audit