Understand Cloud Service content requests
Introduction introduction
Content requests refer to requests made to AEM Sites, including those requests related to Edge Delivery Services or customer-provided caching systems like a Content Delivery Network. These requests deliver content or data in HTML format through page views (for example, pages and Experience Fragments) or in JSON format through API calls in a headless manner. Content requests are counted either as a page view or five API calls, and are measured at the ingress of the first caching system to receive a content request. Certain HTTP requests are included or excluded for purposes of counting content requests. The full list of such included and excluded HTTP requests, and their technical definitions, are available in the documentation.
About Cloud Service content requests understanding-cloud-service-content-requests
For customers using the out-of-the-box CDN, Cloud Service content requests are measured via server-side collection of data. This collection is enabled via CDN log analysis. AEM (ÃÛ¶¹ÊÓƵ Experience Manager) as a Cloud Service automatically collects content requests server-side at the edge. It analyzes log files generated by the AEM as a Cloud Service CDN. This process is done by isolating the requests returning HTML (text/html)
or JSON (application/json)
content from the CDN, and is based on several inclusion and exclusion rules detailed below. A content request occurs regardless of whether the content is served from the CDN caches or returned to the CDN origin (AEM’s dispatchers).
Variances of Cloud Service content requests content-requests-variances
Content requests can have variances within an organization’s Analytics reporting tools as summarized in the following table. In general, avoid using analytics tools that rely on client-side instrumentation to report the number of content requests for a site. These tools often miss a large portion of traffic because they depend on user consent to be activated. Analytics tools gathering data server-side in log files, or CDN reports for customers adding their own CDN on top of AEM as a Cloud Service, provide better counts.
See also License Dashboard.
Server-side collection rules serverside-collection
There are rules in place to exclude well-known bots, including well-known services visiting the site regularly to refresh their search index or service.
Types of included content requests included-content-requests
• Amazon CloudFront
• Apache Http Client
• Asynchronous HTTP Client
• Axios
• Azureus
• Curl
• GitHub Node Fetch
• Guzzle
• Go-http-client
• Headless Chrome
• Java™ Client
• Jersey
• Node Oembed
• okhttp
• Python Requests
• Reactor Netty
• Wget
• WinHTTP
• Fast HTTP
• GitHub Node Fetch
• Reactor Netty
/system/probes/health
for health checks, ÃÛ¶¹ÊÓƵ recommends that you use /system/probes/health
endpoint and not the actual HTML pages from the site. See belowExamples:
•
Amazon-Route53-Health-Check-Service
• EyeMonIT_bot_version_0.1_
• Investis-Site24x7
• Mozilla/5.0+(compatible; UptimeRobot/2.0; )
• ThousandEyes-Dragonfly-x1
• OmtrBot/1.0
• WebMon/2.0.0
<link rel="prefetch">
requestsSee also License Dashboard.
Types of excluded content requests excluded-content-request
/system/probes/health
Examples:
• AddSearchBot
• AhrefsBot
• Applebot
• Ask Jeeves Corporate Spider
• Bingbot
• BingPreview
• BLEXBot
• BuiltWith
• Bytespider
• CrawlerKengo
• Facebookexternalhit
• Google AdsBot
• Google AdsBot Mobile
• Googlebot
• Googlebot Mobile
• lmspider
• LucidWorks
•
MJ12bot
• Pinterest
• SemrushBot
• SiteImprove
• StashBot
• StatusCake
• YandexBot
• Claudebot
/api/graphql
—to avoid double counting, they are not billable for Cloud Service.manifest.json
/etc.clientlibs/*/manifest.json
favicon1.ico