Hi guys,
I am facing a strange issue here, Our production setup consists of One Author Instance and One dispatcher and two publishers 1 and 2.
Each AEM instance running on AWS x3 large servers individually.
But still with those high performance machines also AEM instances are taking 4 hours to restart.
Our Production Servers are running from last 6 months. have arounf 10GB crx-repository folder size. Around 4GB content.
Some of the analysis which we had observed with the debugging is , that AEM instance restart process gets more for a java process around the below mentioned java API.
we went to the system profiler @ http://<Publisher1>/system/console/profiler and ran quite a few traces. The Author instance starts OK (within a couple of minutes) and then loops around these three packages:
org.apache.jackrabbit.oak.plugins.segment
org.eclipse.jetty.io.nio
org.eclipse.jetty.server.nio
We have 13066lines of:
30.03.2015 09:41:05.157 *WARN* [FelixStartLevel] org.apache.jackrabbit.oak.plugins.segment.file.TarReader Invalid entry checksum at offset 0 in tar file /opt/aem/crx-quickstart/repository/segmentstore/data00204a.tar, skipping...
(the bits in red change with each message. These run for 25 seconds so that can’t be the direct cause of the delay.
We also have 129lines of the format:
30.03.2015 09:41:31.885 *WARN* [FelixStartLevel] org.apache.jackrabbit.oak.plugins.segment.file.TarReader Invalid graph metadata in tar file /opt/aem/crx-quickstart/repository/segmentstore/data00203b.tar
Is there any cleanup jobs we need to perform on our Production Publisher instances? Is something gets loaded over time in AEM instance, which we need to cleanup periodically?
Any help in this regard is really helpfull.
Thanks