Your achievements

Level 1

0% to

Level 2

Tip /
Sign in

Sign in to Community

to gain points, level up, and earn exciting badges like the new
BedrockMission!

Learn More

View all

Sign in to view all badges

Access or export old versions of sites content to file

Avatar

Avatar
Level 1
clintg6
Level 1

Likes

0 likes

Total Posts

6 posts

Correct Reply

0 solutions
View profile

Avatar
Level 1
clintg6
Level 1

Likes

0 likes

Total Posts

6 posts

Correct Reply

0 solutions
View profile
clintg6
Level 1

12-05-2021

Hello AEM team,

 

I have a collection of policies (thousands) published through AEM. Each of these policies has as many as 10 older versions of the current one.

I am in need of extracting the older versions text content due to litigation. Do you know of a way to export the content from older versions as text or HTML or PDF files?

Accepted Solutions (1)

Accepted Solutions (1)

Avatar

Avatar
Affirm 50
MVP
Vaibhavi
MVP

Likes

216 likes

Total Posts

181 posts

Correct Reply

61 solutions
Top badges earned
Affirm 50
Validate 1
Ignite 1
Give Back 5
Give Back 3
View profile

Avatar
Affirm 50
MVP
Vaibhavi
MVP

Likes

216 likes

Total Posts

181 posts

Correct Reply

61 solutions
Top badges earned
Affirm 50
Validate 1
Ignite 1
Give Back 5
Give Back 3
View profile
Vaibhavi
MVP

13-05-2021

Hi @clintg6 , 

As you have more than 1000 policies, do not suggest manual extraction. 

You can extract with a simple custom solution.

  • Fetch the older version of nodes using path and jcr:created identifier.
  • Once you get the list of node paths, appending path with .infinity.json should extract the content. /content/nodeName/jcr_content.infinity.json
  • Copy the required content to any text document. 

Answers (1)

Answers (1)

Avatar

Avatar
Boost 10
Level 2
ibishika
Level 2

Likes

15 likes

Total Posts

13 posts

Correct Reply

1 solution
Top badges earned
Boost 10
Boost 5
Give Back
Applaud 5
Boost 3
View profile

Avatar
Boost 10
Level 2
ibishika
Level 2

Likes

15 likes

Total Posts

13 posts

Correct Reply

1 solution
Top badges earned
Boost 10
Boost 5
Give Back
Applaud 5
Boost 3
View profile
ibishika
Level 2

13-05-2021

Although it will depend on what you want to do with the extracted content as html, but you can get the content as xml files by packaging up from the package manager or by pulling them into your projects content folder using some IDE plugin and then convert the extracted xml files to html.