|
[
Permlink
| « Hide
]
Neeraj Jhanji [Atlassian] added a comment - 19/Sep/07 06:25 PM
Could you decide the fix release for this bug?
I've flagged this issue with our product management team. We'll update the issue as soon as we have this issue scheduled in a release.
Matt,
This is a critical issue for Japanese users - I would say the highest priority issue among open issues related to Japanese internationalization. Can we target it for a fix in 2.6.1 if possible? Thanks, Neeraj Hi Neeraj,
This is classed as an improvement, so it doesn't qualify for fixing in a stable release like 2.6.1. I believe the problem with searching for single-byte characters is related to Lucene's internal encoding, so it's not a trivial fix either. I'll make sure your comment is flagged with our product manager. Regards, Hi, although I do understand this is a big problem, it is certainly not a blocker in the sense that the application crashes, lots of data is lost, and so on, so I am reducing the prioriy to "critical" Please http://confluence.atlassian.com/display/Support/JIRA+usage+guidelines
Hi Per,
Appreciate your guidance. I rated this as blocker from sales and market development perspective since we cannot be the best enterprise wiki in the class if our search simply does not find matching documents. For Japanese customers, this is the same as if search does not work. I am fine as long as this gets addressed within a reasonable time-frame because we are getting a lot of heat from our customers on this issue. cheers, Neeraj Attached a file
Hi Neeraj,
Thanks for providing the example documents. I am afraid I am not sure if understand the desired behavior completely. Steps: 1) Create a page containing the full width character カ. Please advise me if this behavior you are expecting. Unfortunately I have little experience with the Japanese language so I may be misinterpreting your request. Thanks, Hi Andrew,
Thanks for looking into this. Let me explain as below: PART 1 (Critical, applies to half-width katakana characters): PART 2 (High, applies to both half-width and full-width katakana characters): PART 3 (High, applies to both half-width and full-width numeric characters): Please let me know if you have further questions. regards, Thanks Neeraj,
I am unable to produce the behavior you have indicated for part 1. I have attached a screen shot of correct results being returned for a page containing a half width katakana character "ka". For part 3, when entering characters in both full width and half width, I was unable to retrieve any results. How exactly were you entering these characters? Thanks, Hi Andrew,
Attached is the system information of the machine the client tested this problem on. I have not verified this for all cases, and assume the client was searching via the search tab. Regarding the half and full-width numeric characters , you may well be right. I guess now atleast you are as aware of the problem as I am. We basically need to ensure that no matter which search box is used, Confluence is able to search for half and full width katakana and numeric characters correctly – i.e it should pass tests for part 1-3. Let me know how I can help further. Neeraj Hi Neeraj,
I realized that I was using the English indexer when performing my searches. After switching to CJK indexing, I was able to produce results consistent with your findings. I will investigate further. A quick update Neeraj,
Lucene's CJKAnalyzer is definitely not indexing half width characters correctly. I have raised an issue (http://issues.apache.org/jira/browse/LUCENE-1032 Customers who are experiencing problems such as the ones you outlined should use this Analyzer in place of CJKAnalyzer until the issue with Lucene is resolved, assuming they have a Sun JDK. Fixed by use of custom Analyzer.
Hi Andrew,
1. Where can I get the Custom Japanese Analyzer and what are the installation instructions for it? 2. Regarding the Lucene support issue you mention above, I did not see any mention of the problems with half-width Japanese characters. Please clarify. regards, Neeraj Hi Neeraj,
My apologies, I provide the wrong issue number. It should be LUCENE-1032. The custom Japanese analyzer will be available as an option in the Content Indexing tab from 2.6.2 onwards. Regards, Andrew Lynch We are focused on releasing Confluence 2.6 to Japanese customers. This is a major upgrade from the previous JP release 2.4.3 since it fixes a major issue surrounding PDF export. If possible, we'd like to include the search fix in this release as well since the next Japanese release will be further out (possibly next year). Is it possible to get a patch for 2.6?
Also, to confirm, will search for half width katakana and numeric characters work properly from the top search bar as well as from the search tab? Hi Neeraj,
The changes were not too numerous, so I should be able to provide a patch for 2.6.0 if essential. The search tab has not yet been fixed, I have created another issue (CONF-9834) which should be watched for updates. Agnes, could you please review these changes prior to 2.6.2?
Thanks. |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||