In Jira 8.10, automatic stale nodes clean-up solution was presented -
In some scenarios two days time period to move node from Active, no heartbeat state to the Offline state is too long. This problem becomes more significant in the case of managing the cluster deployed with Ansible, AWS/K8s auto-scaling etc.
Add system property to change the default retention period for stale nodes - "jira.not.alive.active.nodes.retention.period.in.hours"
ACTIVE → <5 minutes> → NO HEARTBEAT → < → OFFLINE → <2 days> → GONE
Flow with new system property:
When you will add to your JVM flags on startup this value:
Then the flow should looks like:
ACTIVE → <5 minutes> → NO HEARTBEAT → < > → OFFLINE → <2 days> → GONE
Node will be moved to Offline state after 3 hours instead of 2 days.