-
Bug
-
Resolution: Not a bug
-
High
-
None
-
2.0, 2.1.1, 4.3.7, 5.0, 5.1, 5.9.5
There is a slight issue with searching in that if you search for a part of a word and apply a wildcard, Lucene doesn't find the word you intended.
e.g. if you search for "Management" (no quotes) on CAC, it returns a bunch of results. A search for "Managemen*", however, only returns one.
The reason for this is that "Managemen" is not a real English word, and so is not stemmed. So, the query term does not match the stemmed version of "management", "manag" that we have in the index, and the correct results aren't returned. (Note: the attachment returned by the wildcard query is due to the indexing of the full filename, which then matches "managemen*")
A solution to this may be to store the original word (as well as the stemmed) in a different field in the index. When a wildcard search term comes through, search the full and stemmed words. The cache may be bigger, and there may be a slight performance hit, but it will make searching a bit more reliable in these edge cases.
- duplicates
-
CONFSERVER-37153 Confluence Search: Wildcards at the beginning of a word do not work
- Gathering Interest
- is duplicated by
-
CONFSERVER-41191 Words within long string not searchable in Confluence
-
- Closed
-
- is related to
-
CONFSERVER-7956 Livesearch macro doesn't find stemmed words
-
- Closed
-
-
CONFSERVER-16275 Appending asterisk to search terms causes problems...
-
- Closed
-
- relates to
-
CONFSERVER-34245 Wildcard search does not work when using in the beginning of a String.
-
- Closed
-
As per dloeng@atlassian.com comments, the issues relating to stemming and the use of wildcards were fixed in 5.2 when we moved to using KStem with the limitation that wildcards on the end of a full word will not match the root of the word. So authentication* will not match authenticate. However authent* and authenticate* will be stemmed and match authentication, authenticator, authenticate and this addresses the original ticket and most of the comments.
If there are other issues with search these are probably best addressed in a separate ticket to this one.