Expand my Community achievements bar.

Automation of meta data around content

Avatar

Level 1

I'm managing a couple of thousand pages across 2 websites and I want to automate the processes around identifying pages that haven't been updated recently i.e. by published date. I can output as an excel etc but are there any automations inside of AEM to assist with tasks like this?

Topics

Topics help categorize Community content and increase your ability to discover relevant content.

3 Replies

Avatar

Level 5

Hi @MikeHa25 ,

AEM doesn’t do it automatically out of the box, but you can easily automate it using Query Builder + scheduled workflow or ACS Commons report to regularly identify and manage outdated content.

In AEM, there’s no built-in automation that directly flags or reports outdated pages, but you can achieve this efficiently using a few AEM features and tools together.

Use Query Builder or SQL2 query:

You can run a query in AEM to list pages where the “last modified date” is older than the last published date (or beyond a certain time).

Example: find pages not updated in the last 6 months.

The result can be exported to Excel or JSON for reporting.

Automate with a Scheduled Workflow or Sling Job:

Create a small automation (custom workflow or scheduler) that runs this query regularly.

It can generate a report and email it to authors or store it in AEM DAM as a CSV.

Leverage AEM Reports or ACS Commons tools:

AEM has Reports and ACS Commons Report Builder, which can help you visually list and filter pages based on last modified or published date without coding.

Optional-Add Metadata Rules:

You can add logic or workflow to flag content that’s “stale” (not updated for X days) and notify the content owner automatically.

Avatar

Employee Advisor

Hello @MikeHa25 ,

Yes, AEM provides multiple automated methods to identify pages that haven’t been updated or published recently. You can use OOTB reporting, QueryBuilder automation, or scheduled workflows depending on your AEM setup.

  1. AEM Sites List View
    Switch to List View in the Sites console and enable columns such as Last Modified, Published Date, and Last Replicated By.
    You can sort or filter by these fields and export the results as CSV or Excel.

  2. Automated Queries (QueryBuilder + Scheduler)
    Use QueryBuilder API to detect stale pages and run it on a scheduled job (Sling Scheduler, Workflow step, or Groovy script).

Example Query:
{
"path": "/content/your-site",
"type": "cq:Page",
"property": "jcr:content/cq:lastModified",
"property.operation": "<",
"property.value": "2024-01-01T00:00:00.000Z"
}

Avatar

Employee

Hello @MikeHa25 
A few OOTB Options :

1. Sites console filters allow manual checking by Last Modified/Last Published.
2. Each page has properties: cq:lastModified, cq:lastReplicated, createdBy, etc - Can be queried directly to find outdated pages.
3. Create a workflow model that checks page dates and flags or reports old content; Configure a workflow launcher to run automatically.
4. Export Metadata via Sites Console (CSV Report) - https://experienceleague.adobe.com/en/docs/experience-manager-learn/sites/page-authoring/page-operat...
5. Use Query Builder or API Automation - Automate searches for pages where cq:lastModified or cq:lastReplicated is older than a threshold.