Expand my Community achievements bar.

Dive into Adobe Summit 2024! Explore curated list of AEM sessions & labs, register, connect with experts, ask questions, engage, and share insights. Don't miss the excitement.
SOLVED

Joining a node to a cluster fails

Avatar

Level 1

Our system is composed by two nodes joined in a cluster. Due to a lack of space on the server where one of the nodes was running this node crashes. After deleting old backups we tried to restart the crashed node but it does not start correctly. Finally we deleted the full crx-quickstart folder from the crashed node amb we made a new installation. After the startup we tried to join the node to the clustar again but it was not possible.

Can you help us? 

This is the log information:

19.08.2015 10:18:42.546 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] org.apache.jackrabbit.core.RepositoryImpl Starting repository...

19.08.2015 10:18:42.546 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] org.apache.jackrabbit.core.fs.local.LocalFileSystem LocalFileSystem initialized at path /cq561/authors/n1/crx-quickstart/crx.0002/repository

19.08.2015 10:18:42.566 *ERROR* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] org.apache.sling.commons.classloader.impl.ClassLoaderFacade Dynamic class loader has already been deactivated.

19.08.2015 10:18:42.566 *ERROR* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] org.apache.sling.commons.classloader.impl.ClassLoaderFacade Dynamic class loader has already been deactivated.

19.08.2015 10:18:42.566 *ERROR* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] org.apache.sling.commons.classloader.impl.ClassLoaderFacade Dynamic class loader has already been deactivated.

19.08.2015 10:18:42.606 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.core.cluster.ClusterController Trying to connect to a master, as the file clustered.txt exists.

19.08.2015 10:18:42.615 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.core.cluster.ClusterController Node 7fe8a4c7-c935-442b-a4fc-4eb5876e644a started as: slave, connected to address: /10.241.39.248:8088

19.08.2015 10:18:52.639 *INFO* [Shell Script Executor Thread for cpu.sh] com.adobe.granite.monitoring.impl.ScriptConfigImpl interval thread for cpu.sh finished

19.08.2015 10:18:52.640 *INFO* [Shell Script Executor Thread for diskusage.sh] com.adobe.granite.monitoring.impl.ScriptConfigImpl interval thread for diskusage.sh finished

19.08.2015 10:19:50.509 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.ClusterTarSet All tar file are new: re-creating index*.tar files

19.08.2015 10:19:50.514 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.index.IndexSet Deleting all index files in /cq561/authors/n1/crx-quickstart/crx.0002/tarJournal

19.08.2015 10:19:50.514 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.OptimizeThread Wait for pending delete operations

19.08.2015 10:20:02.132 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.index.IndexSet Deleting index file /cq561/authors/n1/crx-quickstart/crx.0002/tarJournal/index_1_0.tar

19.08.2015 10:20:02.150 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.TarSet scanning index /cq561/authors/n1/crx-quickstart/crx.0002/tarJournal/data_00177.tar id:177 length:268438528 append:-1 2118979126

19.08.2015 10:20:03.171 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.TarSet scanning index /cq561/authors/n1/crx-quickstart/crx.0002/tarJournal/data_00178.tar id:178 length:268439552 append:-1 763601305

19.08.2015 10:20:04.051 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.TarSet scanning index /cq561/authors/n1/crx-quickstart/crx.0002/tarJournal/data_00179.tar id:179 length:268439552 append:-1 199872395

19.08.2015 10:20:04.907 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.TarSet scanning index /cq561/authors/n1/crx-quickstart/crx.0002/tarJournal/data_00180.tar id:180 length:268439552 append:-1 1608845871

19.08.2015 10:20:05.743 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.TarSet scanning index /cq561/authors/n1/crx-quickstart/crx.0002/tarJournal/data_00181.tar id:181 length:268437504 append:-1 905954520

19.08.2015 10:20:06.589 *INFO* [10.117.10.220 [1440404313510] POST /libs/granite/cluster/content/admin/cluster/ HTTP/1.1] com.day.crx.persistence.tar.TarSet scanning index /cq561/authors/n1/crx-quickstart/crx.0002/tarJournal/data_00182.tar id:182 length:97697280 append:-1 1544263789

19.08.2015 10:21:47.603 *ERROR* [127.0.0.1 [1440404507602] GET / HTTP/1.1] org.apache.sling.engine.impl.SlingHttpContext handleSecurity: AuthenticationSupport service missing. Cannot authenticate request.

19.08.2015 10:21:47.603 *ERROR* [127.0.0.1 [1440404507602] GET / HTTP/1.1] org.apache.sling.engine.impl.SlingHttpContext handleSecurity: Possible reason is missing Repository service. Check AuthenticationSupport dependencies.

19.08.2015 10:27:51.151 *ERROR* [127.0.0.1 [1440404871150] GET / HTTP/1.1] org.apache.sling.engine.impl.SlingHttpContext handleSecurity: AuthenticationSupport service missing. Cannot authenticate request.

19.08.2015 10:27:51.151 *ERROR* [127.0.0.1 [1440404871150] GET / HTTP/1.1] org.apache.sling.engine.impl.SlingHttpContext handleSecurity: Possible reason is missing Repository service. Check AuthenticationSupport dependencies.

19.08.2015 10:27:51.151 *ERROR* [127.0.0.1 [1440404871150] GET / HTTP/1.1] org.apache.sling.engine.impl.SlingHttpContext handleSecurity: AuthenticationSupport service missing. Cannot authenticate request.

19.08.2015 10:27:51.151 *ERROR* [127.0.0.1 [1440404871150] GET / HTTP/1.1] org.apache.sling.engine.impl.SlingHttpContext handleSecurity: Possible reason is missing Repository service. Check AuthenticationSupport dependencies.

1 Accepted Solution

Avatar

Correct answer by
Employee

Hi Juan,

I'd consider creating a the slave from a clone as described here[1]. It can save time as you don't have to synch a brand new instance.

Regards,

Opkar

[1] http://cq-ops.tumblr.com/post/42523140404/quickly-creating-a-cq-cluster-using-manual-slave

View solution in original post

1 Reply

Avatar

Correct answer by
Employee

Hi Juan,

I'd consider creating a the slave from a clone as described here[1]. It can save time as you don't have to synch a brand new instance.

Regards,

Opkar

[1] http://cq-ops.tumblr.com/post/42523140404/quickly-creating-a-cq-cluster-using-manual-slave