Loading...

Type: Bug
Resolution: Fixed
Priority: Medium
Fix Version/s: 9.12.25, 10.3.8, 10.7.2, 11.0.0
Affects Version/s: 8.20.27, 9.4.8, 9.12.2
Component/s: Custom Field Indexing, Data Center - Index, Indexing
Labels:

Introduced in Version:
8.2
Support reference count:
18
Symptom Severity:
Severity 2 - Major
UIS:
13

Issue Summary

According to the fixed bug JSWSERVER-20133, and the associated developer announcement, fields that exceed the limit of 32766 bytes will be removed before they are committed to Lucene. It appears the intention is to raise an ERROR upon such attempts to warn the plugin developer, but the index should be otherwise unaffected.

However, there is evidence Jira tries to commit the data to Lucene, causing a failure mode.

(1)

The large value still makes it's way to the Lucene library:

2023-10-17 02:07:51,592+0000 IssueIndexer:thread-1 WARN admin 127x701x1 tuce66 127.0.0.1 /secure/admin/IndexReIndex!reindex.jspa [c.a.jira.index.AccumulatingResultBuilder] Document contains at least one immense term in field="customfield_10300" (whose UTF8 encoding is longer than the max length 32766), all of which were skipped.  Please correct the analyzer to not produce such terms.  The prefix of the first immense term is: '[88, 95, 116, 99, 114, 100, 95, 119, 115, 100, 95, 112, 101, 114, 105, 109, 101, 116, 101, 114, 95, 115, 99, 97, 95, 118, 51, 32, 104, 97]...', original message: bytes can be at most 32766 in length; got 192093
java.lang.IllegalArgumentException: Document contains at least one immense term in field="customfield_10300" (whose UTF8 encoding is longer than the max length 32766), all of which were skipped.  Please correct the analyzer to not produce such terms.  The prefix of the first immense term is: '[88, 95, 116, 99, 114, 100, 95, 119, 115, 100, 95, 112, 101, 114, 105, 109, 101, 116, 101, 114, 95, 115, 99, 97, 95, 118, 51, 32, 104, 97]...', original message: bytes can be at most 32766 in length; got 192093
	at org.apache.lucene.index.DefaultIndexingChain$PerField.invert(DefaultIndexingChain.java:796)
	at org.apache.lucene.index.DefaultIndexingChain.processField(DefaultIndexingChain.java:430)
	at org.apache.lucene.index.DefaultIndexingChain.processDocument(DefaultIndexingChain.java:392)
	at org.apache.lucene.index.DocumentsWriterPerThread.updateDocument(DocumentsWriterPerThread.java:240)
	at org.apache.lucene.index.DocumentsWriter.updateDocument(DocumentsWriter.java:497)
	at org.apache.lucene.index.IndexWriter.updateDocument(IndexWriter.java:1729)
	at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:1464)
	at com.atlassian.jira.index.MonitoringIndexWriter.addDocument(MonitoringIndexWriter.java:65)
	at com.atlassian.jira.index.WriterWrapper.addDocuments(WriterWrapper.java:115)
	at com.atlassian.jira.index.WriterWithStats.addDocuments(WriterWithStats.java:133)
	at com.atlassian.jira.index.Operations$Create.perform(Operations.java:166)
	at com.atlassian.jira.index.Operations$Completion.perform(Operations.java:346)
	at com.atlassian.jira.index.DefaultIndexEngine$FlushPolicy.perform(DefaultIndexEngine.java:86)
	at com.atlassian.jira.index.DefaultIndexEngine.write(DefaultIndexEngine.java:150)
	at com.atlassian.jira.index.DefaultIndex.perform(DefaultIndex.java:28)
	at com.atlassian.jira.issue.index.DefaultIssueIndexer$IndexIssueOperation.createResult(DefaultIssueIndexer.java:1078)
	at com.atlassian.jira.issue.index.DefaultIssueIndexer$IndexIssueOperation.createResult(DefaultIssueIndexer.java:1052)
	at com.atlassian.jira.issue.index.DefaultIssueIndexer$EntityOperation.lambda$null$7(DefaultIssueIndexer.java:910)
	at java.base/java.util.Optional.map(Optional.java:265)
	at com.atlassian.jira.issue.index.DefaultIssueIndexer$EntityOperation.lambda$createResult$8(DefaultIssueIndexer.java:910)
	at java.base/java.util.HashMap.forEach(HashMap.java:1337)
	at com.atlassian.jira.issue.index.DefaultIssueIndexer$EntityOperation.createResult(DefaultIssueIndexer.java:909)
	at com.atlassian.jira.issue.index.DefaultIssueIndexer$EntityOperation.perform(DefaultIssueIndexer.java:884)
	at com.atlassian.jira.issue.index.DefaultIssueIndexer.lambda$processInnerBatch$6(DefaultIssueIndexer.java:338)
	at com.atlassian.jira.index.SimpleIndexingStrategy.apply(SimpleIndexingStrategy.java:7)
	at com.atlassian.jira.index.SimpleIndexingStrategy.apply(SimpleIndexingStrategy.java:5)
	at com.atlassian.jira.index.MultiThreadedIndexingStrategy$1.call(MultiThreadedIndexingStrategy.java:47)
	at com.atlassian.jira.index.MultiThreadedIndexingStrategy$1.call(MultiThreadedIndexingStrategy.java:43)
	at com.atlassian.jira.util.concurrent.BoundedExecutor$2.call(BoundedExecutor.java:68)
	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
	at java.base/java.lang.Thread.run(Thread.java:829)
Caused by: org.apache.lucene.util.BytesRefHash$MaxBytesLengthExceededException: bytes can be at most 32766 in length; got 192093
	at org.apache.lucene.util.BytesRefHash.add(BytesRefHash.java:263)
	at org.apache.lucene.index.TermsHashPerField.add(TermsHashPerField.java:151)
	at org.apache.lucene.index.DefaultIndexingChain$PerField.invert(DefaultIndexingChain.java:786)
	... 32 more

(2)

It does result in the failures flowing upward to the calling method:

The full reindex is marked as failed
- Although mitigated partially via JRASERVER-72235
Index recovery "catch up" phase is marked as failed and thus triggers a full reindex, see JRASERVER-76391

Steps to Reproduce

See replication steps in JRASERVER-76391

Expected Results

The message A document contained a potential immense term in field customfield_10220_timeline. The field has been removed from the document. is logged
No further errors are logged. The indexing proceeds normally. Possibly - the issue is not marked as failed.

Actual Results

org.apache.lucene.util.BytesRefHash$MaxBytesLengthExceededException is raised
The issue is marked as failed
The failure flows upward to the parent method (full reindex, index recovery, etc)

Workaround

Check the remotelink table to get the entries with length (url) by executing below mentioned query:
```
select length (url), * from remotelink r order by length (url) desc 
```

Observe the url length for the top most record. You can match the same number in the error message mentioned in Issue Summary field.
The data extracted from remotelink table has issueid column.
Export the issue details in CSV to have backup of the issue.
Delete the issue from User interface
Restart the instance and observe if the issue is fixed.

is caused by

JSWSERVER-20133 Fix the Lucene immense field indexing failure

Closed

is resolved by

JRASERVER-76391 Index recovery unhelpfully triggers a full lock reindex if an issue in the index catchup phase fails to reindex

Closed

mentioned in: Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...

(3 mentioned in)

Details

Description

Issue Summary

(1)

(2)

Steps to Reproduce

Expected Results

Actual Results

Workaround

Attachments

Issue Links

Activity

People

Dates