-
Bug
-
Resolution: Unresolved
-
Medium
-
None
-
7.6.4, 8.5.3, 8.16.0
-
7.06
-
30
-
Severity 2 - Major
-
4
-
We use AWS CloudFormation to run an internal JDC instance. When we upgrade, nodes are torn down and new ones are created. They are meant to recover the index from a snapshot in the shared-home directory.
We upgraded this morning, spinning up 1 node and waiting for it to serve on /status, then spinning up another node.
Timeline:
- 07:30: Node 1 spun up, upgrade commences
- 08:00: Node 2 spun up
- 08:01: Node 2 begins to recover the index, and completes in 30s (a successful recovery should take around 15 minutes for this instance)
- 08:35: We kick off a manual restore from index snapshot on Node 1
- 09:10: We realise Node 2's index is corrupt, despite a 'successful' recovery, and we kick off another restore from snapshot on Node 2
- 09:24: Index recovery completes and the indexes are both working now
As you can see from the attached log, Node 1 did not attempt index recovery after startup at all. Node 2 did, but it seems to have synced an empty index from Node 1.
NB: I confirmed that the snapshots were accessible by Jira, because we used them to recover the indexes manually.
- is related to
-
JRASERVER-66649 Index recovery should use index snapshots first if they are available
- Closed
- relates to
-
JRASERVER-66860 Index recovery can result in a corrupt index despite completing successfully
- Closed
-
JRASERVER-67261 As a JIRA Datacenter Administrator I want to do an automated cold recovery from index a snapshot
- Closed
-
JRASERVER-37896 Index Recovery should have an auto-recover functions when failure happen
- Gathering Interest
-
JRASERVER-68639 As an JIRA Administrator I want to see progress of index snapshot restore in UI
- Gathering Interest
-
JDEV-29925 Loading...
-
XPLN-802 Loading...
- is resolved by
-
ASCI-3 Loading...
- mentioned in
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...