Problem with a .tokens explosion
Hi Everyone. We're having some trouble with our prod CQ 5.6.1 SP1 environment related to login tokens.
We have a 1 author node / 2 publish node setup, TAR PM, no clustering. We have a caching dispatcher in front of each publish node and use a custom login module. Everything has been working great for a couple of years. The problem started occurring a couple of months ago and occurs almost daily at this point.
At arbitrary times during the day on one of the publish nodes, the repo will start to fill up. A new tar file will be written almost every minute. When I look at the change history, I can see a user under /home/users/[s]/[so]/[som]/[some random user] who will have hundreds of login tokens. Tokens will keep on being added until I restart the node. Sometimes after restarting the exploding node, the other publish node will start exploding with the same user.
Has anyone ever seen something like this before?
When I look in the logs, the only thing I see are some lines like this. I don't know if they're related to the problem or not. Nothing stood out on Google.
error.log.2015-08-31:31.08.2015 21:20:10.801 *WARN* [pool-5-thread-5] org.apache.jackrabbit.core.ItemSaveOperation /home/users/[users different from the exploding .tokens user]/.tokens: failed to restore transient state
I appreciate any guidance I can get on this one. Thanks.