Uploaded image for project: 'Confluence Data Center'
  1. Confluence Data Center
  2. CONFSERVER-17989

Unknown encoding for '90ms-RKSJ-V' when indexing for PDF

XMLWordPrintable

      Caused by: java.io.IOException: Unknown encoding for '90ms-RKSJ-V'
      at org.pdfbox.encoding.EncodingManager.getEncoding(EncodingManager.java:83)
      at org.pdfbox.pdmodel.font.PDFont.getEncoding(PDFont.java:627)
      at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:476)
      at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:332)
      at org.pdfbox.util.operator.ShowText.process(ShowText.java:66)
      at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:494)
      at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:207)
      at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:160)
      at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:355)
      at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:268)
      at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:220)
      at com.atlassian.bonnie.search.extractor.PdfContentExtractor.extractText(PdfContentExtractor.java:49)
      ... 16 more
      

      This is a known bug and mentioned in http://issues.apache.org/jira/browse/PDFBOX-139

      I requested a Japanese Partner whose customers encountered this issue and after upgrading the PDFBOX jar to version 0.7.3 the problem appeared to be fixed.

              Unassigned Unassigned
              rhartono Roy Hartono [Atlassian]
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

                Created:
                Updated:
                Resolved: