Details
-
Bug
-
Resolution: Duplicate
-
Medium
-
None
-
3.0
-
Confluence 3.0, Apache Tomcat/6.0.14, JDK 1.6.0_13, HSQL/MySQL, Windows XP/Vista/7
Description
I created two sample files as attached to reproduce this issue.
Test1.doc contains less characters and it can be searched/indexed properly.
Test2.doc contains more characters (add a new line) so that its content cannot be searched/indexed.
If a document contains ASCII characters only, there would be no problem.
Attachments
Issue Links
- duplicates
-
CONFSERVER-6888 Some word docs don't get correctly indexed
- Closed
- relates to
-
CONFSERVER-4747 Not all Chinese PDFs are indexing correctly
- Closed