Details
-
Bug
-
Resolution: Fixed
-
Low
-
2.7, 2.7.1
Description
Confluence's Lucene cannot search for Chinese characters (both traditional and simplified) in PDF file.
The same characters can be indexed fine in Word DOC file.
It appears that Confluence PDF Extractor fails to extract the chinese characters (See picture). Alphabets can be searched without any problem.
Attachments
Issue Links
- duplicates
-
CONFSERVER-4747 Not all Chinese PDFs are indexing correctly
- Closed
- is caused by
-
CONFSERVER-4747 Not all Chinese PDFs are indexing correctly
- Closed
- is incorporated by
-
CONFSERVER-16525 Errors indexing PDF documents
- Closed
- is related to
-
CONFSERVER-4747 Not all Chinese PDFs are indexing correctly
- Closed