How to optimize the Dispatcher cache?

Last update: Fri Jun 20 2025 00:00:00 GMT+0000 (Coordinated Universal Time)

This article offers detailed instructions on the different ways to optimize the Dispatcher cache. It further describes the steps toÌýenable TTL (â€œTime to Liveâ€ or expiration) style invalidations, disabling Dispatcher flush agents, re-fetching Dispatcher flush, among others.

Description description

Environment

51ºÚÁÏ²»´òìÈ Experience Manager

Issues/Symptoms

This article focuses on the latest optimizations in the AEM Dispatcher and how to best leverage those. The AEM Dispatcher is a cachingÌýÌýserver designed for use with 51ºÚÁÏ²»´òìÈ Experience Manager. It can be installed and run as a module within an existing web server software. At the time of writing this article, theÌýDispatcher module is supportedÌýon Apache HTTP Server, Microsoft IIS, and iPlanet.

Resolution resolution

How does Dispatcher caching work?

At the most basic level, the AEM dispatcher is a reverse proxy that works by performing caching, cache flushing and cache invalidation.

See the related links for more details on the Dispatcher:

How the Dispatcher works and how to install it.
Configuration options available in the Dispatcher.
Ìý- note that some information in the presentation is based on old versions of the dispatcher.
Gems webinar session on Dispatcher features, CDN usage and security.
Gems session on newer features in Dispatcher (after v4.1.9).

Optimizing the Dispatcher cache

Here are some ways to optimize the Dispatcher cache:

Cache almost everything Ìý- This means cache any content that would be requested more than once by users.
Cache personalized content for different periods of time Ìý- If your site has personalized content then consider usingÌýApache Sling Dynamic IncludesÌýin your AEM application to leverage Ajax (Asynchronous JavaScript and XML calls at the browser level), SSI (Server Side Includes at the Web Server level), and ESI (Edge-side Includes at the CDN level) to cache different parts of the page for different periods of time.
Never delete the Dispatcher cache on a live Dispatcher Ìý- If a Dispatcher is serving live content and you delete the cache, itÌýcauses a massive flood of requests to go back to AEM.Ìý Due to this, the Dispatcher cache should never be deleted on a live Dispatcher.
Prime the cache Ìý- BeforeÌýdeleting the Dispatcher cache, pull the Dispatcher off your load balancer, delete the cache, thenÌýrun a web crawler tool to cache files on the Dispatcher before putting it on the load balancer.
Cache error pages Ìý- Leverage the Ìý(Apache Web Server specific)Ìýdirective to serve error pages such as 404s from the Dispatcher cache.
GZip compress all file types except for those that are pre-compressed Ìý- In Apache Web Server,ÌýÌýcould be used, but make sure thatÌý Vary: User-Agent ÌýheaderÌýisnâ€™t set.Ìý In Microsoft IIS, useÌý.

Apache configuration example (specifying only certain content types to avoid precompressed file types):

AddOutputFilterByType DEFLATE text/html text/plain text/xml text/css text/javascript application/javascript
·¡²Ô²¹²ú±ô±ðÌý/serveStaleOnError Ìý in the /cache configuration - Serve the old cache file when AEM instances are serving errors.
AddÌý Ìý to the /cache configuration - Define the number of seconds a stale, auto-invalidated resource may still be served from the cache after the last content publish event (â€œactivationâ€).Ìý This reduces the number of requests that go back to the publish instances during a large content publishing activity such as a â€œTree Activationâ€.
Add rules toÌý Ìý- Ignore querystring parameters that are not required or used by the application.Ìý This allows caching of URLs even when a querystring is present.
Cache the Cache-Control and Last-Modified response headers Ìý- Use theÌý Ìýconfiguration to cache the HTTP response headersÌý Cache-Control ÌýandÌý Last-Modified Ìý(and/orÌý ETag Ìýheader if youÌýare sending it from AEM).Ìý This helps in simplifying and optimizing caching at the CDN and browser levels.Ìý Caching these headers makes it so only AEM sets the headers, not the web server itself.Ìý Note that when you do this, then youÌýneed to start sending the headers from your AEM application.
Cache content for as long as possible ÌýandÌý reduce requests that go back to AEM Ìý- Optimize flush requests by enablingÌýrefetching flush on all flush agents. See the below section titled Re-fetching Dispatcher Flush.ÌýOr useÌý /enableTTL Ìýand setÌý Cache-Control: max-age=â€¦ Ìýheader to cache files as long as possible.Ìý SeeÌýbelowÌýfor details on this topic.

Using TTLs

As of Dispatcher version 4.1.11,Ìý/enableTTL 1Ìýcan be setÌýin any fileÌýconfiguration.Ìý This setting makes the Dispatcher respect cache expirations set in the HTTP Cache-Control response header.Ìý In other words, the Dispatcher will function similar to a CDN where primary form of cache invalidation occurs when files expire.Ìý Once you implement this and start sendingÌý Cache-Control: max-age=â€¦ Ìýfor all responses from AEM, then you can safely disable your Dispatcher flush agents in the publish instances.

After disabling flush agents on the publish instances then you may still want to be able to flush the Dispatcher cache.Ìý In that case, you can useÌý.Ìý This tool is installed on the author instance.Ìý It gives users a UI where they can perform manual cache flush requests.

I. Steps to enable TTL (â€œTime to Liveâ€ or expiration) style invalidations:

Modify source code in the AEM application to sendÌý Cache-Control Ìýheader andÌý Last-Modified Ìýfor all requests where itâ€™s not already set.
Install Dispatcher 4.1.11 or later.
SetÌý Ìýin any farm configuration of the site.
Set theÌý Ìýconfiguration to cache theÌý Cache-Control ÌýandÌý Last-Modified Ìýheaders.
Restart the web server.

II. Disable Dispatcher flush agents on the publish instances:

The Dispatcher will now use the Cache-Control header to control invalidation of the cache files.Ìý Since that is the case, then Dispatcher flushing from the publish instances is no longer required.

Go to /etc/replication/agents.publish.html on each publish instance.
Go to each flush agentâ€™s configuration and disable the agent.

III. Allow manual Dispatcher flush requests from the author instance:

Now that flush agents are disabled, you would rely entirely on theÌý Cache-Control Ìýheader to control when content is refreshed on the dispatcher.Ìý You canÌýstill allow users to issue manual flushes of the Dispatcher cache:

InstallÌýÌýon the author instance.
Configure flush agents on the author instance.
In each of the agent configurations, setÌý Triggers Ìý=> Ìý Ignore Default Ìýoption to enabled. This option makes the flush agents ignore when users clickÌý (Un)Publish ÌýorÌý (De)Activate Ìýin the AEM UI.

Re-fetching Dispatcher Flush

To optimize the Dispatcher flush requests, all Dispatcher flush agents should have a feature called refetching flush enabled.

To enable re-fetching the dispatcher flush, do the following:

Go toÌý http://aemhost:port/crx/packmgr/index.jsp Ìýand login as admin.
Download the package fromÌý.
Upload and install the package to package manager.
Go to your Dispatcher flush agent configuration. For exampleÌý /etc/replication/agents.author/flush.html
ClickÌý Edit
Set the following
- Serialization Type Ìý=Ìý Re-fetch Dispatcher Flush
- Extended Ìý=> Ìý HTTP Method Ìý=Ìý POST
ClickÌý Save

Note - The package installed above is just a basic example.Ìý To customize and optimize re-fetching flush you can modify the list of URIs that it sends.Ìý The code is open source and can be foundÌý.Ìý The code adds a list of URIs to the request body as parameters telling Dispatcher which paths to re-fetch.Ìý You can add more paths per your application requirements to optimize your siteâ€™s caching capabilities.

Detailed explanation of re-fetching flush

Normally a Dispatcher flush works by deleting files:

Touch .stat file(s)
Delete /content/foo.*
Delete /content/foo/_jcr_content

Due to the fact that files are deleted in step 2, the next time a user requests a file like /content/foo.html or /content/foo.json, while the file is being â€œre-fetchedâ€ then subsequent requests for the same file would also be sent to the publish instances until the file is cached.Ìý For slow responses or heavy traffic pages such as home pages this can cause flooding of the publish instance tier.

To solve this issue, enable a feature of the Dispatcher called re-fetching.Ìý This feature allows you to send a list of URIs that the Dispatcher should proactively â€œre-fetchâ€ and replace instead deleting.

See 22:41-27:05 in thisÌýÌýfor a demo of how it works and how to configure it.

recommendation-more-help

3d58f420-19b5-47a0-a122-5c9dab55ec7f