AegisSitemap User Guide
About this guide
This is the master user guide for AegisSitemap, the Aegisify XML sitemap and robots.txt manager for WordPress. It covers how to enable sitemap output, control included content types, manage exclusions, publish optional HTML/video/news sitemaps, and configure robots.txt safely.
Audience and scope
Primary audience
- WordPress site owners and administrators responsible for indexing and crawl health
- SEO and marketing teams who need reliable sitemap endpoints and clean robots rules
- Technical admins who require safe defaults, exclusions, and operational visibility
In scope
- XML sitemap output (
/sitemap.xmland/sitemap_index.xml) - Post type and taxonomy inclusion
- Folder exclusions
- HTML sitemap listing
- robots.txt management (virtual + physical file options)
Out of scope
- Teaching search engine indexing fundamentals; this guide focuses on AegisSitemap features and workflows
- Replacing full SEO suites (AegisSitemap focuses on sitemaps + robots controls)
Safety, limitations, and responsibilities
- Change control: Make changes gradually and re-check endpoints after each save.
- Hosting limits: Very large sites should keep sitemap files smaller and rely on index splitting to avoid timeouts.
- robots.txt limitations: robots.txt is not a security tool; it guides crawlers but does not protect private content.
Core concepts
Sitemap endpoints
- XML sitemap: a crawler-friendly list of URLs (with optional images metadata).
- Sitemap index: a master file that references multiple smaller sitemap files for scale.
- Child sitemaps: split by content type (posts, pages, taxonomies) to stay within limits.
robots.txt management
- Virtual robots.txt: served through WordPress without writing a root file.
- Physical robots.txt: written to the site root (ABSPATH) when your host allows it.
- Optimization defaults: safe disallow patterns that protect crawl budget.
Quick start paths
Publish a compliant XML sitemap (10–15 minutes)
- Open AegisSitemap → Sitemap
- Enable XML sitemap output
- Enable
/sitemap_index.xmlfor scale - Open the listed sitemap URLs to confirm they load
Configure robots.txt safely (10–15 minutes)
- Open AegisSitemap → Robots
- Enable robots.txt management
- Apply safe default optimizations
- Review Robots.txt Preview, then Save
Exclude private directories (5–10 minutes)
- In the Sitemap tab, use Folder Exclusions to exclude path fragments
- In the Robots tab, add directory exclusions if you want crawlers guided away from those paths
- Re-check sitemap endpoints to confirm excluded URLs are removed
Features
Enable XML Sitemap
Where: WP Admin → AegisSitemap → Sitemap → Enable Sitemap.
Enables dynamic XML sitemap generation for your WordPress site. When enabled, AegisSitemap generates sitemap endpoints on-demand and updates them automatically when content changes.

Configuration
- Enable XML sitemap output.
- Confirm your Sitemap URL and Sitemap Index URL are visible on the left panel.
- Open the listed sitemap endpoints to verify they load (HTTP 200).
Sitemap Index
Where: WP Admin → AegisSitemap → Sitemap → Sitemap Index.
Generates a sitemap index file that references multiple child sitemaps. Content is split into smaller sitemap files to stay within search engine limits and improve crawl efficiency.

Configuration
- Enable
/sitemap_index.xmlif your site has more than a few hundred URLs. - Open the index URL and confirm it lists child sitemap files (posts, pages, taxonomies).
Maximum URLs Per Sitemap
Where: WP Admin → AegisSitemap → Sitemap → Max URLs per Sitemap.
Controls the number of URLs included in each sitemap file. Once the limit is reached, a new child sitemap is created automatically.

Configuration
- Set a max URL count per sitemap file (defaults are safe for most sites).
- If you have a large site, keep files smaller to reduce crawl time and memory spikes.
Include Featured Images
Where: WP Admin → AegisSitemap → Sitemap → Images.
Adds featured images to XML sitemap entries. Image metadata is appended to sitemap URLs to improve image discovery.
Configuration
- Enable featured images in XML output if your theme uses featured images for posts/pages.
- Re-check a sitemap file and confirm
entries appear where applicable.
Include Attached Images
Where: WP Admin → AegisSitemap → Sitemap → Attached Images.
Includes media attachments referenced in posts and pages. Attachment URLs are added as additional sitemap entries.
Configuration
- Enable attached images only if you need deeper image coverage (large sites may crawl slower).
- Prefer featured images for performance on high‑volume sites.
Video Sitemap
Where: WP Admin → AegisSitemap → Sitemap → Video Sitemap.
Generates a video-specific sitemap when video content is detected. Video metadata is exposed in a dedicated sitemap endpoint.

Configuration
- Enable
/video-sitemap.xmlif you publish embedded/hosted video and want video discovery. - Validate the endpoint loads and your content actually contains video metadata.
News Sitemap
Where: WP Admin → AegisSitemap → Sitemap → News Sitemap.
Enables Google News–compatible sitemap output. Recent news-type content is added to a dedicated news sitemap.

Configuration
- Enable
/news-sitemap.xmlonly if your content is eligible for Google News and your site runs a news workflow. - Confirm the endpoint loads and that included content matches your news criteria.
HTML Sitemap
Where: WP Admin → AegisSitemap → Sitemap → HTML Sitemap.
Creates a human-readable HTML sitemap. A public HTML page lists indexed content for users and crawlers.

Configuration
- Enable the HTML sitemap listing if you want a human-friendly sitemap page (and optional internal navigation hub).
- Use the displayed HTML Sitemap URL in menus or footers if desired.
Folder Exclusions
Where: WP Admin → AegisSitemap → Sitemap → Folder Exclusions.
Excludes specific URL paths from sitemap output. Any URL containing a defined fragment is skipped during sitemap generation.
Configuration
- Add one folder/path fragment per line in Folder Exclusions.
- Use exclusions to keep private folders, temp uploads, or sensitive paths out of sitemap URLs.
- Save and re-check a sitemap file to confirm excluded paths no longer appear.
Post Type Inclusion
Where: WP Admin → AegisSitemap → Sitemap → Post Types.
Controls which post types appear in sitemaps. Only selected post types are queried and included.
Configuration
- Select which post types should appear in the sitemap (Posts, Pages, and other supported types).
- If you run WooCommerce, product-related types can be included via Plugins or Post Types settings.
- Save and confirm the generated endpoints list the expected post types.
Taxonomy Inclusion
Where: WP Admin → AegisSitemap → Sitemap → Taxonomies.
Controls which taxonomies are indexed. Selected taxonomies generate their own sitemap endpoints.
Configuration
- Select which taxonomies to include (Categories, Tags, etc.).
- If you don’t want taxonomy archives indexed, disable them here rather than relying on robots alone.
Ping Search Engines on Publish
Where: WP Admin → AegisSitemap → Sitemap → Ping on Publish.
Automatically notifies search engines when content is updated. Sitemap ping requests are sent to supported engines.
Configuration
- Enable ping on publish if you want faster discovery when new content goes live.
- Use responsibly; high-frequency sites should avoid aggressive pinging.
Enable Robots.txt Management
Where: WP Admin → AegisSitemap → Robots → Enable Robots.
Enables robots.txt control inside WordPress. Robots rules are generated dynamically and served virtually or physically.

Configuration
- Enable robots.txt management in the Robots tab.
- Review the Robots.txt Preview panel before writing a physical file.
Write Physical robots.txt File
Where: WP Admin → AegisSitemap → Robots → Write robots.txt.
Writes a robots.txt file to the site root. A physical file is created when filesystem permissions allow.

Configuration
- Enable “Write a physical robots.txt file to the site root (ABSPATH)” if your host allows writes.
- Check the Robots.txt File Status box to confirm it’s writable and whether a file exists.
Robots Optimization
Where: WP Admin → AegisSitemap → Robots → Optimization.
Applies safe default crawl optimizations. Blocks low-value URLs based on Google best practices.

Configuration
- Enable safe default optimizations to protect crawl budget (recommended for most sites).
- Review excluded directories and adjust only if you have a specific requirement.
Robots Directory Exclusions
Where: WP Admin → AegisSitemap → Robots → Excluded Directories.
Blocks directories from crawler access. Disallow rules are generated for each defined path.

Configuration
- Add directory patterns to disallow (one item per entry).
- Keep
/wp-admin/and common system paths disallowed as a baseline.
Custom Robots Rules
Where: WP Admin → AegisSitemap → Robots → Custom Rules.
Allows manual robots.txt directives. Custom rules are appended after generated rules.

Configuration
- Enable “Append custom robots.txt rules” if you need advanced directives.
- Paste rules into the custom block and validate syntax in the Preview panel before saving.
AegisSitemap: Settings and stored options (inventory)
Advanced Where: WP Admin → AegisSitemap
Configuration
- Use Tools/License to validate license status (if applicable).
- Use the inventory list to confirm which features are enabled and which endpoints are generated.
Operational checklists
Pre-launch sitemap checklist
- Enable XML sitemap output and sitemap index.
- Confirm Post Types and Taxonomies match your indexing goals.
- Set a safe max URLs per sitemap (smaller for large sites).
- Confirm sitemap endpoints return HTTP 200 and contain expected URLs.
- If adding images, verify
entries exist only when applicable.
Robots safety checklist
- Enable robots.txt management and review Preview.
- Keep baseline disallows for admin/system paths.
- Do not block key content folders unless intentional.
- If writing a physical file, confirm file status is writable.
- After changes, test
/robots.txtin a browser.
Troubleshooting
Symptom: Sitemap URL returns 404
Likely meaning: sitemap output is disabled, permalinks/rewrite rules are not flushed, or a security/caching layer is blocking the endpoint.
- Open AegisSitemap → Sitemap and confirm XML output is enabled.
- Confirm sitemap endpoints listed in the UI match what you’re opening.
- Save permalinks (Settings → Permalinks → Save) to flush rewrite rules.
- Temporarily disable aggressive caching/WAF rules for
/sitemap*paths and retry.
Symptom: robots.txt does not update
Likely meaning: robots manager is disabled, or the host blocks writing to the site root.
- Enable robots.txt management in the Robots tab and Save.
- Check Robots.txt File Status to see if a physical file exists and is writable.
- If the host blocks root writes, rely on the virtual robots.txt served by WordPress.
Symptom: Content type is missing from the sitemap index
Likely meaning: the post type/taxonomy is disabled, has zero published items, or is excluded by rules.
- Confirm the Post Types and Taxonomies checkboxes include the missing item.
- Confirm you have published content for that type.
- Review Folder Exclusions and remove any accidental exclusion fragments.
- Reload
/sitemap_index.xmland verify the endpoint is now listed.
Glossary
| Term | Meaning in AegisSitemap |
|---|---|
| XML sitemap | A machine-readable list of URLs for crawlers. |
| Sitemap index | A master file (/sitemap_index.xml) that references multiple child sitemaps. |
| Child sitemap | A sitemap file generated for a specific content group (posts/pages/taxonomies). |
| Folder exclusion | A path fragment used to omit matching URLs from sitemap outputs. |
| robots.txt | A crawler directive file served at /robots.txt. |
| Virtual robots.txt | robots.txt served through WordPress without writing a root file. |
| Physical robots.txt | robots.txt written to the site root (ABSPATH) when writable. |
| Crawl budget | The amount of crawling effort search engines allocate to your site. |

