Uploaded image for project: 'Jira Data Center'
  1. Jira Data Center
  2. JRASERVER-66392

Remote invalidation of caches is performed in get/reset lock

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Low Low
    • 7.2.12, 7.6.1
    • 7.5.1, 7.2.11
    • Data Center - Other
    • None

      Summary

      Ehcache does a synchronous replication of invalidations through RMI during cache reset. This in turn happens inside atlassian-caches lock that does not allow any loads to proceed while reset is being processed. That basically mean that gets are blocked by potentially slow RMI replication. 

      Environment

      • JIRA datacenter with multiple nodes
        Node A is unresponsive because of extremely high load or high memory pressure or any other condition that makes it unresponsive. However, at this state node is not technically down and still registered as an 'Active' member in the cluster but not processing request either.
        Node B still consider node A as 'Active' so it keeps performing cache synchronisation to Node A which not responding to the request and put Node B in stale position. 

      Expected behaviour

      Cache replication happens outside of get/reset lock and is not blocking cache loading. 

              Unassigned Unassigned
              morzechowski Michal Orzechowski (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: