Details
-
Bug
-
Resolution: Unresolved
-
Medium
-
None
-
7.6.4, 8.5.3, 8.16.0
-
7.06
-
30
-
Severity 2 - Major
-
3
-
Description
We use AWS CloudFormation to run an internal JDC instance. When we upgrade, nodes are torn down and new ones are created. They are meant to recover the index from a snapshot in the shared-home directory.
We upgraded this morning, spinning up 1 node and waiting for it to serve on /status, then spinning up another node.
Timeline:
- 07:30: Node 1 spun up, upgrade commences
- 08:00: Node 2 spun up
- 08:01: Node 2 begins to recover the index, and completes in 30s (a successful recovery should take around 15 minutes for this instance)
- 08:35: We kick off a manual restore from index snapshot on Node 1
- 09:10: We realise Node 2's index is corrupt, despite a 'successful' recovery, and we kick off another restore from snapshot on Node 2
- 09:24: Index recovery completes and the indexes are both working now
As you can see from the attached log, Node 1 did not attempt index recovery after startup at all. Node 2 did, but it seems to have synced an empty index from Node 1.
NB: I confirmed that the snapshots were accessible by Jira, because we used them to recover the indexes manually.
Attachments
Issue Links
- is related to
-
JRASERVER-66649 Index recovery should use index snapshots first if they are available
- Closed
- relates to
-
JRASERVER-66860 Index recovery can result in a corrupt index despite completing successfully
- Closed
-
JRASERVER-67261 As a JIRA Datacenter Administrator I want to do an automated cold recovery from index a snapshot
- Closed
-
JRASERVER-37896 Index Recovery should have an auto-recover functions when failure happen
- Gathering Interest
-
JRASERVER-68639 As an JIRA Administrator I want to see progress of index snapshot restore in UI
- Gathering Interest
-
JDEV-29925 Loading...
-
XPLN-802 Loading...
- is resolved by
-
ASCI-3 Loading...
- mentioned in
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...
-
Page Loading...