Loading...

XML

Word

Printable

Type: Bug
Resolution: Obsolete
Priority: Low
Fix Version/s: None
Affects Version/s: 2.6.0, 2.7.1, 3.0.2
Component/s: Search - Core
Labels:
Environment:

All

Search does not work for PDF files containing Japanese text. Please try with the test file attached. Search was tested after rebuilding the Confluence index.

Seems like a problem with the extractor being used in Confluence 2.6.0, 2.7.1.

Update: If you rename the Japanese PDF to roman characters, you can find the file name itself in search results, but the file contents look corrupted as in the screenshot. This is expected since the extractor is unable to parse the Japanese PDF and so the contents of the Lucene index also contain corrupted characters.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List

PDF_Search.png
06/Jun/2008 10:44 AM
83 kB
Neeraj Jhanji
グリーンシップ募集要項.pdf
06/Jun/2008 10:44 AM
26 kB
Neeraj Jhanji

relates to

CONFSERVER-9833 Search not working for Powerpoint or PDF files containing Japanese text

Closed

Assignee:: Steve Haffenden (Inactive)
Reporter:: Neeraj Jhanji
Votes:: 6 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: 06/Jun/2008 10:43 AM
Updated:: 11/Oct/2018 8:53 AM
Resolved:: 02/Jan/2014 10:32 PM

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates