Cold Standby for TarMK based AEM 6.5 Author (Does it work?)
I recently opened a ticket with Adobe support re: HA and disaster recovery options for AEM 6.5. Unfortunately, the engineer I spoke with didn't have much experience with it and referred me to some documentation, and said support could not assist with the configuration. I found this odd, especially given the official documentation. I imagine it's likely others could have questions about it too. Looking over the doc, I'm almost certain I will have questions.
This doc:
seems like it makes the most sense for our on-premises AEM 6.5 TarMK (file data store) based instance.
The instructions are somewhat complex/involved (I'm a little confused by the crx-quickstart/install config files for the standby instance instructions).
I recall earlier AEM 5.x clustering was not recommended and maybe did not work properly?
Does anyone know if the above cold standby config is supported / recommended by Adobe, and is anyone using it successfully? Have you promoted standby to primary and did it work well?
Is it better just to rely on backups? This might result in more downtime for authors (but would be simpler) vs having a cold standby available. It seems there may be some steps involved to make a standby primary in the event of primary failure. In this situation, does standby become primary and then a new standby would need to be created/cloned/configured to have HA/DR available again?
In the past we had a standby author instance by publishing to a separate author. Of course this isn't ideal as many things are not replicated, but it's was much easier to configure.
I'm curious to know what other on-premises AEM customers are using (or considering using) for author HA/DR? I'm also curious to know if anyone knows of any supplemental resources/instructions outlining cold standby configurations.