AegisSitemap Master User Guide

AegisSitemap User Guide

Version 1.2.15 • Updated February 09, 2026

About this guide

This is the master user guide for AegisSitemap, the Aegisify XML sitemap and robots.txt manager for WordPress. It covers how to enable sitemap output, control included content types, manage exclusions, publish optional HTML/video/news sitemaps, and configure robots.txt safely.

NOTE: This guide is maintained as an active, continuously improved resource. As AegisBackup evolves, new capabilities, interface refinements, and optimization recommendations may be introduced. To ensure accuracy and reliability, instructions, definitions, and screenshots may be updated periodically. Always refer to the latest version of this guide for the most up-to-date information and recommended workflows.

Audience and scope

Primary audience

  • WordPress site owners and administrators responsible for indexing and crawl health
  • SEO and marketing teams who need reliable sitemap endpoints and clean robots rules
  • Technical admins who require safe defaults, exclusions, and operational visibility

In scope

  • XML sitemap output (/sitemap.xml and /sitemap_index.xml)
  • Post type and taxonomy inclusion
  • Folder exclusions
  • HTML sitemap listing
  • robots.txt management (virtual + physical file options)

Out of scope

  • Teaching search engine indexing fundamentals; this guide focuses on AegisSitemap features and workflows
  • Replacing full SEO suites (AegisSitemap focuses on sitemaps + robots controls)

Safety, limitations, and responsibilities

Important: robots.txt changes can affect crawling and indexing. Always review the Robots.txt Preview before saving, and avoid blocking critical content unintentionally.
  • Change control: Make changes gradually and re-check endpoints after each save.
  • Hosting limits: Very large sites should keep sitemap files smaller and rely on index splitting to avoid timeouts.
  • robots.txt limitations: robots.txt is not a security tool; it guides crawlers but does not protect private content.

Core concepts

Sitemap endpoints

  • XML sitemap: a crawler-friendly list of URLs (with optional images metadata).
  • Sitemap index: a master file that references multiple smaller sitemap files for scale.
  • Child sitemaps: split by content type (posts, pages, taxonomies) to stay within limits.

robots.txt management

  • Virtual robots.txt: served through WordPress without writing a root file.
  • Physical robots.txt: written to the site root (ABSPATH) when your host allows it.
  • Optimization defaults: safe disallow patterns that protect crawl budget.

Quick start paths

Publish a compliant XML sitemap (10–15 minutes)

  1. Open AegisSitemapSitemap
  2. Enable XML sitemap output
  3. Enable /sitemap_index.xml for scale
  4. Open the listed sitemap URLs to confirm they load

Configure robots.txt safely (10–15 minutes)

  1. Open AegisSitemapRobots
  2. Enable robots.txt management
  3. Apply safe default optimizations
  4. Review Robots.txt Preview, then Save

Exclude private directories (5–10 minutes)

  1. In the Sitemap tab, use Folder Exclusions to exclude path fragments
  2. In the Robots tab, add directory exclusions if you want crawlers guided away from those paths
  3. Re-check sitemap endpoints to confirm excluded URLs are removed

Features

Enable XML Sitemap

Where: WP Admin → AegisSitemap → Sitemap → Enable Sitemap.

Enables dynamic XML sitemap generation for your WordPress site. When enabled, AegisSitemap generates sitemap endpoints on-demand and updates them automatically when content changes.

Enable XML Sitemap

Configuration

  1. Enable XML sitemap output.
  2. Confirm your Sitemap URL and Sitemap Index URL are visible on the left panel.
  3. Open the listed sitemap endpoints to verify they load (HTTP 200).

Sitemap Index

Where: WP Admin → AegisSitemap → Sitemap → Sitemap Index.

Generates a sitemap index file that references multiple child sitemaps. Content is split into smaller sitemap files to stay within search engine limits and improve crawl efficiency.

Sitemap Index

Configuration

  1. Enable /sitemap_index.xml if your site has more than a few hundred URLs.
  2. Open the index URL and confirm it lists child sitemap files (posts, pages, taxonomies).

Maximum URLs Per Sitemap

Where: WP Admin → AegisSitemap → Sitemap → Max URLs per Sitemap.

Controls the number of URLs included in each sitemap file. Once the limit is reached, a new child sitemap is created automatically.

Maximum URLs Per Sitemap

Configuration

  1. Set a max URL count per sitemap file (defaults are safe for most sites).
  2. If you have a large site, keep files smaller to reduce crawl time and memory spikes.

Include Attached Images

Where: WP Admin → AegisSitemap → Sitemap → Attached Images.

Includes media attachments referenced in posts and pages. Attachment URLs are added as additional sitemap entries.

Configuration

  1. Enable attached images only if you need deeper image coverage (large sites may crawl slower).
  2. Prefer featured images for performance on high‑volume sites.

Video Sitemap

Where: WP Admin → AegisSitemap → Sitemap → Video Sitemap.

Generates a video-specific sitemap when video content is detected. Video metadata is exposed in a dedicated sitemap endpoint.

Video Sitemap

Configuration

  1. Enable /video-sitemap.xml if you publish embedded/hosted video and want video discovery.
  2. Validate the endpoint loads and your content actually contains video metadata.

News Sitemap

Where: WP Admin → AegisSitemap → Sitemap → News Sitemap.

Enables Google News–compatible sitemap output. Recent news-type content is added to a dedicated news sitemap.

News Sitemap

Configuration

  1. Enable /news-sitemap.xml only if your content is eligible for Google News and your site runs a news workflow.
  2. Confirm the endpoint loads and that included content matches your news criteria.

HTML Sitemap

Where: WP Admin → AegisSitemap → Sitemap → HTML Sitemap.

Creates a human-readable HTML sitemap. A public HTML page lists indexed content for users and crawlers.

HTML Sitemap

Configuration

  1. Enable the HTML sitemap listing if you want a human-friendly sitemap page (and optional internal navigation hub).
  2. Use the displayed HTML Sitemap URL in menus or footers if desired.

Folder Exclusions

Where: WP Admin → AegisSitemap → Sitemap → Folder Exclusions.

Excludes specific URL paths from sitemap output. Any URL containing a defined fragment is skipped during sitemap generation.

Configuration

  1. Add one folder/path fragment per line in Folder Exclusions.
  2. Use exclusions to keep private folders, temp uploads, or sensitive paths out of sitemap URLs.
  3. Save and re-check a sitemap file to confirm excluded paths no longer appear.

Post Type Inclusion

Where: WP Admin → AegisSitemap → Sitemap → Post Types.

Controls which post types appear in sitemaps. Only selected post types are queried and included.

Configuration

  1. Select which post types should appear in the sitemap (Posts, Pages, and other supported types).
  2. If you run WooCommerce, product-related types can be included via Plugins or Post Types settings.
  3. Save and confirm the generated endpoints list the expected post types.

Taxonomy Inclusion

Where: WP Admin → AegisSitemap → Sitemap → Taxonomies.

Controls which taxonomies are indexed. Selected taxonomies generate their own sitemap endpoints.

Configuration

  1. Select which taxonomies to include (Categories, Tags, etc.).
  2. If you don’t want taxonomy archives indexed, disable them here rather than relying on robots alone.

Ping Search Engines on Publish

Where: WP Admin → AegisSitemap → Sitemap → Ping on Publish.

Automatically notifies search engines when content is updated. Sitemap ping requests are sent to supported engines.

Configuration

  1. Enable ping on publish if you want faster discovery when new content goes live.
  2. Use responsibly; high-frequency sites should avoid aggressive pinging.

Enable Robots.txt Management

Where: WP Admin → AegisSitemap → Robots → Enable Robots.

Enables robots.txt control inside WordPress. Robots rules are generated dynamically and served virtually or physically.

Enable Robots.txt Management

Configuration

  1. Enable robots.txt management in the Robots tab.
  2. Review the Robots.txt Preview panel before writing a physical file.

Write Physical robots.txt File

Where: WP Admin → AegisSitemap → Robots → Write robots.txt.

Writes a robots.txt file to the site root. A physical file is created when filesystem permissions allow.

Write Physical robots.txt File

Configuration

  1. Enable “Write a physical robots.txt file to the site root (ABSPATH)” if your host allows writes.
  2. Check the Robots.txt File Status box to confirm it’s writable and whether a file exists.

Robots Optimization

Where: WP Admin → AegisSitemap → Robots → Optimization.

Applies safe default crawl optimizations. Blocks low-value URLs based on Google best practices.

Robots Optimization

Configuration

  1. Enable safe default optimizations to protect crawl budget (recommended for most sites).
  2. Review excluded directories and adjust only if you have a specific requirement.

Robots Directory Exclusions

Where: WP Admin → AegisSitemap → Robots → Excluded Directories.

Blocks directories from crawler access. Disallow rules are generated for each defined path.

Robots Directory Exclusions

Configuration

  1. Add directory patterns to disallow (one item per entry).
  2. Keep /wp-admin/ and common system paths disallowed as a baseline.

Custom Robots Rules

Where: WP Admin → AegisSitemap → Robots → Custom Rules.

Allows manual robots.txt directives. Custom rules are appended after generated rules.

Custom Robots Rules

Configuration

  1. Enable “Append custom robots.txt rules” if you need advanced directives.
  2. Paste rules into the custom block and validate syntax in the Preview panel before saving.

AegisSitemap: Settings and stored options (inventory)

Advanced Where: WP Admin → AegisSitemap

Important: This inventory map reflects stored options and operational signals used for diagnostics; change settings via the UI, not by editing database options directly.

Configuration

  1. Use Tools/License to validate license status (if applicable).
  2. Use the inventory list to confirm which features are enabled and which endpoints are generated.

Operational checklists

Pre-launch sitemap checklist

  1. Enable XML sitemap output and sitemap index.
  2. Confirm Post Types and Taxonomies match your indexing goals.
  3. Set a safe max URLs per sitemap (smaller for large sites).
  4. Confirm sitemap endpoints return HTTP 200 and contain expected URLs.
  5. If adding images, verify entries exist only when applicable.

Robots safety checklist

  1. Enable robots.txt management and review Preview.
  2. Keep baseline disallows for admin/system paths.
  3. Do not block key content folders unless intentional.
  4. If writing a physical file, confirm file status is writable.
  5. After changes, test /robots.txt in a browser.

Troubleshooting

Symptom: Sitemap URL returns 404

Likely meaning: sitemap output is disabled, permalinks/rewrite rules are not flushed, or a security/caching layer is blocking the endpoint.

  1. Open AegisSitemapSitemap and confirm XML output is enabled.
  2. Confirm sitemap endpoints listed in the UI match what you’re opening.
  3. Save permalinks (Settings → Permalinks → Save) to flush rewrite rules.
  4. Temporarily disable aggressive caching/WAF rules for /sitemap* paths and retry.

Symptom: robots.txt does not update

Likely meaning: robots manager is disabled, or the host blocks writing to the site root.

  1. Enable robots.txt management in the Robots tab and Save.
  2. Check Robots.txt File Status to see if a physical file exists and is writable.
  3. If the host blocks root writes, rely on the virtual robots.txt served by WordPress.

Symptom: Content type is missing from the sitemap index

Likely meaning: the post type/taxonomy is disabled, has zero published items, or is excluded by rules.

  1. Confirm the Post Types and Taxonomies checkboxes include the missing item.
  2. Confirm you have published content for that type.
  3. Review Folder Exclusions and remove any accidental exclusion fragments.
  4. Reload /sitemap_index.xml and verify the endpoint is now listed.

Glossary

TermMeaning in AegisSitemap
XML sitemapA machine-readable list of URLs for crawlers.
Sitemap indexA master file (/sitemap_index.xml) that references multiple child sitemaps.
Child sitemapA sitemap file generated for a specific content group (posts/pages/taxonomies).
Folder exclusionA path fragment used to omit matching URLs from sitemap outputs.
robots.txtA crawler directive file served at /robots.txt.
Virtual robots.txtrobots.txt served through WordPress without writing a root file.
Physical robots.txtrobots.txt written to the site root (ABSPATH) when writable.
Crawl budgetThe amount of crawling effort search engines allocate to your site.