[CONFSERVER-8580] Indexing unprintable/encrypted PDFs fails

Type: Bug
Resolution: Fixed
Priority: Medium
Fix Version/s: 2.5.3, 2.6.0
Affects Version/s: 2.5
Component/s: Page - Export / Import
Labels:
- affects-server
- pdf-generation

Bug Fix Policy:
View Atlassian Server bug fix policy

While reindexing, the pdf extractor can report this error:

java.lang.NoClassDefFoundError: org/bouncycastle/jce/provider/BouncyCastleProvider

at org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:905)

at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489)

at com.atlassian.bonnie.search.extractor.PdfContentExtractor.extractText(PdfContentExtractor.java:46)

at com.atlassian.bonnie.search.extractor.BaseAttachmentContentExtractor.addFields(BaseAttachmentContentExtractor.java:31)

at com.atlassian.bonnie.search.BaseDocumentBuilder.getDocument(BaseDocumentBuilder.java:28)

at com.atlassian.confluence.search.lucene.ConfluenceObjectToDocumentConverter.convert(ConfluenceObjectToDocumentConverter.java:20)

at com.atlassian.confluence.search.lucene.ConfluenceObjectQueue$1.indexCollection(ConfluenceObjectQueue.java:75)

at com.atlassian.bonnie.index.QueueProcessingRunnableImpl.run(QueueProcessingRunnableImpl.java:39)

at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)

at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

at java.lang.reflect.Method.invoke(Method.java:585)

at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:284)

at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:155)

at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:122)

at org.springframework.transaction.interceptor.TransactionInterceptor.invoke(TransactionInterceptor.java:56)

at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:144)

at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:174)

at $Proxy62.run(Unknown Source)

at edu.emory.mathcs.backport.java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:987)

at edu.emory.mathcs.backport.java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:528)

at java.lang.Thread.run(Thread.java:595)

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List

pdfbox-0.7.2.jar
28/May/2007 5:23 AM
3.12 MB
Tom Davies

is related to

CONFSERVER-9871 Index Queue failing to flush automatically.

Closed

relates to

CONFSERVER-8598 Rev pdfbox to 0.7.4 when released

Closed

Tom Davies added a comment - 13/Jun/2007 11:17 PM

The root cause of this problem is that we were not catching the exception, this has been fixed under ~~CONF-8608~~

Tom Davies added a comment - 13/Jun/2007 11:17 PM The root cause of this problem is that we were not catching the exception, this has been fixed under CONF-8608

Scott Farquhar added a comment - 13/Jun/2007 12:09 PM

More importantly that this one issue - why does an error with indexing one document affect the whole indexing process?

Has this error been fixed?

Scott Farquhar added a comment - 13/Jun/2007 12:09 PM More importantly that this one issue - why does an error with indexing one document affect the whole indexing process? Has this error been fixed?

m@ (Inactive) added a comment - 28/May/2007 10:52 PM

This issue can silently stop the indexing process. Replacing the jar fixes that problem.

m@ (Inactive) added a comment - 28/May/2007 10:52 PM This issue can silently stop the indexing process. Replacing the jar fixes that problem.

m@ (Inactive) added a comment - 28/May/2007 5:39 AM

A couple of the Support Case's that exhibit this error suggest that this error maybe causing the indexing process to simply stop without feedback to the user at all.

m@ (Inactive) added a comment - 28/May/2007 5:39 AM A couple of the Support Case's that exhibit this error suggest that this error maybe causing the indexing process to simply stop without feedback to the user at all.

Tom Davies added a comment - 28/May/2007 5:22 AM

In fact the version of pdfbox in 2.5 doesn't correctly extract text from unprintable PDFs – we need to roll back to 0.7.2

The workaround for this bug is to replace pdfbox-0.7.3.jar in WEB-INF/lib with the pdfbox-0.7.2.jar attached to this issue.

Tom Davies added a comment - 28/May/2007 5:22 AM In fact the version of pdfbox in 2.5 doesn't correctly extract text from unprintable PDFs – we need to roll back to 0.7.2 The workaround for this bug is to replace pdfbox-0.7.3.jar in WEB-INF/lib with the pdfbox-0.7.2.jar attached to this issue.

m@ (Inactive) added a comment - 28/May/2007 12:36 AM

BouncyCastle is a dependency of PDFBox which is needed to open encrypted PDFs.

Until this issue is resolved you can download the jar from this page:
http://www.bouncycastle.org/latest_releases.html

m@ (Inactive) added a comment - 28/May/2007 12:36 AM BouncyCastle is a dependency of PDFBox which is needed to open encrypted PDFs. Until this issue is resolved you can download the jar from this page: http://www.bouncycastle.org/latest_releases.html

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

Collapse comment: Tom Davies added a comment - 13/Jun/2007 11:17 PM

Expand comment: Tom Davies added a comment - 13/Jun/2007 11:17 PM

Collapse comment: Scott Farquhar added a comment - 13/Jun/2007 12:09 PM

Expand comment: Scott Farquhar added a comment - 13/Jun/2007 12:09 PM

Collapse comment: m@ (Inactive) added a comment - 28/May/2007 10:52 PM

Expand comment: m@ (Inactive) added a comment - 28/May/2007 10:52 PM

Collapse comment: m@ (Inactive) added a comment - 28/May/2007 5:39 AM

Expand comment: m@ (Inactive) added a comment - 28/May/2007 5:39 AM

Collapse comment: Tom Davies added a comment - 28/May/2007 5:22 AM

Expand comment: Tom Davies added a comment - 28/May/2007 5:22 AM

Collapse comment: m@ (Inactive) added a comment - 28/May/2007 12:36 AM

Expand comment: m@ (Inactive) added a comment - 28/May/2007 12:36 AM

People

Dates