[JSDSERVER-6043] Issue update/create is slow due to SLA indexing takes a long time

Type: Bug
Resolution: Fixed
Priority: Highest
Fix Version/s: 4.2.0
Affects Version/s: 3.8.0, 3.9.6, 3.16.1
Component/s: SLA
Labels:
- cqt
- pse-request

Support reference count:
19
Symptom Severity:
Severity 2 - Major
UIS:
59
Bug Fix Policy:
View Atlassian Server bug fix policy

Summary

On big Jira Service desk instances, having JSD projects with a large number of SLAs, is going to impact the issue reindex performance significantly.
Foreground indexing becomes slow, issue operations like creation, adding a comment or transitions are also impacted as those require indexing.

Environment

JIRA 7.5.x or higher, JIRA Service Desk 3.8.x or higher
Large instance, more than 1 million issues.
Large number of SLAs defined on the instance, more than 100 SLAs.

Expected Results

The re-indexing process takes roughly the same time as for ServiceDesk 3.2.x

Actual Results

The re-indexing process can take more than 48 hours to finish

Notes

Problem is largely reduced by ~~JSDSERVER-5681~~ and ~~JSDSERVER-5685~~

Technical details

JIRA generates a lot of database queries to populate the index with SLA values. JIRA doesn't use the cache to update the index any more, the database is the source of truth.

The queries are fast but the sheer number of them add up.
On large instances, the re-indexing process can take more than 48 hours to finish, which doesn't fit in a weekend any more.

Enabling debug logging on the package: com.querydsl.sql.AbstractSQLQuery shows many queries going to: AO_54307E_TIMEMETRIC during an issue comment operation.
We go to the database as many times as we have SLA's in the scope of the issue:

2018-08-16 09:22:23,527 SdOffThreadEventJobRunner:thread-5 DEBUG <user> 000x000x2 qqq9999 <IP_address> /rest/api/2/issue/PROJECTKEY-1111/comment [c.querydsl.sql.AbstractSQLQuery] select * from (   select "AO_54307E_TIMEMETRIC"."ID", "AO_54307E_TIMEMETRIC"."NAME", "AO_54307E_TIMEMETRIC"."CUSTOM_FIELD_ID", "AO_54307E_TIMEMETRIC"."DEFINITION_CHANGE_DATE", "AO_54307E_TIMEMETRIC"."DEFINITION_CHANGE_MS_EPOCH", "AO_54307E_TIMEMETRIC"."GOALS_CHANGE_DATE", "AO_54307E_TIMEMETRIC"."GOALS_CHANGE_MS_EPOCH", "AO_54307E_TIMEMETRIC"."THRESHOLDS_CONFIG_CHANGE_DATE", "AO_54307E_TIMEMETRIC"."THRESHOLDS_CHANGE_MS_EPOCH", "AO_54307E_TIMEMETRIC"."CREATED_DATE" from "AO_54307E_TIMEMETRIC" "AO_54307E_TIMEMETRIC" where "AO_54307E_TIMEMETRIC"."SERVICE_DESK_ID" = ? and "AO_54307E_TIMEMETRIC"."CUSTOM_FIELD_ID" = ? ) where rownum <= ?
2018-08-16 09:22:23,531 SdOffThreadEventJobRunner:thread-5 DEBUG <user> 000x000x2 qqq9999 <IP_address> /rest/api/2/issue/PROJECTKEY-1111/comment [c.querydsl.sql.AbstractSQLQuery] select * from (   select "AO_54307E_TIMEMETRIC"."ID", "AO_54307E_TIMEMETRIC"."NAME", "AO_54307E_TIMEMETRIC"."CUSTOM_FIELD_ID", "AO_54307E_TIMEMETRIC"."DEFINITION_CHANGE_DATE", "AO_54307E_TIMEMETRIC"."DEFINITION_CHANGE_MS_EPOCH", "AO_54307E_TIMEMETRIC"."GOALS_CHANGE_DATE", "AO_54307E_TIMEMETRIC"."GOALS_CHANGE_MS_EPOCH", "AO_54307E_TIMEMETRIC"."THRESHOLDS_CONFIG_CHANGE_DATE", "AO_54307E_TIMEMETRIC"."THRESHOLDS_CHANGE_MS_EPOCH", "AO_54307E_TIMEMETRIC"."CREATED_DATE" from "AO_54307E_TIMEMETRIC" "AO_54307E_TIMEMETRIC" where "AO_54307E_TIMEMETRIC"."SERVICE_DESK_ID" = ? and "AO_54307E_TIMEMETRIC"."CUSTOM_FIELD_ID" = ? ) where rownum <= ?


 ~/ $ grep "PROJECTKEY-1111/comment" atlassian-jira.log|grep AO_54307E_TIMEMETRIC |wc -l
256
~/ $

Taking thread dumps while doing foreground indexing, we can see all the indexing threads are spending extended amount of time doing SLA work.

"IssueIndexer:thread-20" #416026 prio=5 os_prio=0 tid=0x0000000008c02800 nid=0x151e runnable [0x00007fdd8f46d000]
   java.lang.Thread.State: RUNNABLE
...
        at com.atlassian.pocketknife.internal.querydsl.DatabaseAccessorImpl.runInTransaction(DatabaseAccessorImpl.java:43)
        at com.atlassian.servicedesk.internal.sla.customfield.JIRACustomFieldValueStore.getTextValues(JIRACustomFieldValueStore.java:29)
        at com.atlassian.servicedesk.internal.sla.customfield.SLACFType.getValuesFromDatabase(SLACFType.java:471)
        at com.atlassian.servicedesk.internal.sla.customfield.SLACFType.loadSLAValue(SLACFType.java:440)
        at com.atlassian.servicedesk.internal.sla.customfield.SLACFType.lambda$getValueFromIssue$4(SLACFType.java:435)
        at com.atlassian.servicedesk.internal.sla.customfield.SLACFType$$Lambda$1107/517974037.get(Unknown Source)

It would be useful if we can tune the amount of times we need to go to the database while working with Service Desk SLAs, as doing database IO quite expensive.

is related to

JRASERVER-66890 JIRA performance is impacted by slow queries pulling data from the customfieldvalue table

Closed

JSDSERVER-5681 Non-optimal computation of SLA values in addDocumentFields() method

Closed

JSDSERVER-5685 While loading values for SLA CustomField getValueFromIssue method flushes EagerLoadingOfBizCustomFieldPersister cache

Closed

relates to

JSDSERVER-6238 Extremely slow SLA recalculation time when a Service Desk project contains a high number of issues

Closed

JSMDC-3414 You do not have permission to view this issue

mentioned in: Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...

(10 mentioned in)

Form Name

Aqqiela made changes - 14/Jul/2023 1:32 AM

Remote Link

Original: This issue links to "JSDS-3414 (JIRA Server (Bulldog))" [ 436513 ]

New: This issue links to "JSMDC-3414 (JIRA Server (Bulldog))" [ 436513 ]

Gonchik Tsymzhitov added a comment - 22/Jul/2021 5:23 AM - edited

Oops, we met as well. once had more than 960 SLA's

But I see it's going from 150 SLA's

Gonchik Tsymzhitov added a comment - 22/Jul/2021 5:23 AM - edited Oops, we met as well. once had more than 960 SLA's But I see it's going from 150 SLA's

Adam Mason added a comment - 11/Nov/2020 4:25 PM

Echo Susan above, with JSD 4.12.1. We're running a cluster and the cluster index replication is going haywire.

Adam Mason added a comment - 11/Nov/2020 4:25 PM Echo Susan above, with JSD 4.12.1. We're running a cluster and the cluster index replication is going haywire.

Susan Hauth [Jira Queen] added a comment - 29/Sep/2020 4:30 PM

We are on JSD 4.12. I just updated an SLA and it's killing the indexing. I don't believe this is fixed

Susan Hauth [Jira Queen] added a comment - 29/Sep/2020 4:30 PM We are on JSD 4.12. I just updated an SLA and it's killing the indexing. I don't believe this is fixed

Archana Menon made changes - 06/May/2020 6:35 PM

Remote Link

New: This issue links to "Page (Confluence)" [ 482511 ]

Kevin Allen made changes - 17/Apr/2020 8:29 PM

Remote Link

New: This issue links to "Page (Confluence)" [ 479574 ]

Kevin Allen made changes - 10/Apr/2020 8:00 PM

Remote Link

New: This issue links to "Page (Confluence)" [ 478592 ]

Marko Filipan made changes - 25/Mar/2020 1:38 PM

Remote Link

New: This issue links to "Page (Confluence)" [ 476342 ]

ferrari made changes - 06/Mar/2020 4:56 PM

Remote Link

New: This issue links to "Page (Confluence)" [ 474140 ]

moofoo (Inactive) made changes - 25/Nov/2019 1:09 AM

Remote Link

New: This issue links to "Page (Confluence)" [ 460330 ]

Assignee:: Markus Reil (Inactive)

Reporter:: Sherif Abdelfattah (Inactive)

Affected customers:: 8 This affects my team

Watchers:: 25 Start watching this issue

Created:: 26/Sep/2018 2:56 PM

Updated:: 14/Jul/2023 1:32 AM

Resolved:: 12/Aug/2019 5:34 AM

Details

Description

Summary

Environment

Expected Results

Actual Results

Notes

Technical details

Attachments

Issue Links

Forms

Activity

Collapse comment: Gonchik Tsymzhitov added a comment - 22/Jul/2021 5:23 AM, Edited by Gonchik Tsymzhitov - 23/Jul/2021 11:01 AM

Expand comment: Gonchik Tsymzhitov added a comment - 22/Jul/2021 5:23 AM, Edited by Gonchik Tsymzhitov - 23/Jul/2021 11:01 AM

Collapse comment: Adam Mason added a comment - 11/Nov/2020 4:25 PM

Expand comment: Adam Mason added a comment - 11/Nov/2020 4:25 PM

Collapse comment: Susan Hauth [Jira Queen] added a comment - 29/Sep/2020 4:30 PM

Expand comment: Susan Hauth [Jira Queen] added a comment - 29/Sep/2020 4:30 PM

People

Dates

Backbone Issue Sync