Loading...

XML

Word

Printable

Type: Suggestion
Resolution: Unresolved
Fix Version/s: None
Component/s: Search - Core
Labels:

UIS:
9
Support reference count:
39

NOTE: This suggestion is for Confluence Server. Using Confluence Cloud? See the corresponding suggestion.

At the moment, searching for "hello-to-the-world" in Confluence always returns the same results as "hello to the world". This situation also applies for symbols like plus, underscore, period, dollar sign, percent sign, and so on.

There's also no way to prevent asterisks being treated as wildcard characters in Lucene, so you can't search for a word like "plea" and match content with asterisks around the word.

Words are also not split on dots, so you can't search for "somefile" and find pages that contains "somefile.txt" or "somefile.doc" in the text.

Technical notes

This is due to how Confluence's search tokenises search requests. It splits the query up into words based on letter characters, and ignores all symbols in the request. We use Lucene's StandardTokenizer in our EnglishAnalyzer, and similar implementations for other languages.

Here is the description of the behaviour of StandardTokenizer from Lucene:

Splits words at punctuation characters, removing punctuation. However, a dot that's not followed by whitespace is considered part of a token.

Splits words at hyphens, unless there's a number in the token, in which case the whole token is interpreted as a product number and is not split.

Recognizes email addresses and internet hostnames as one token.

An example of the grammar for this tokenizer can be viewed here: StandardTokenizerImpl.jflex.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List

search-not-working-1.png
38 kB
07/Mar/2016 10:43 AM
search-not-working-2.png
18 kB
07/Mar/2016 10:43 AM
search-not-working-3.png
18 kB
07/Mar/2016 10:43 AM

causes

JSDSERVER-5826 Ability to search for URLs in Confluence KB

Closed

is duplicated by

CONFSERVER-28241 search impossible

Closed

CONFSERVER-1678 Allow search for terms which include non-letter symbols, like plus (+), hyphen (-), period (.), colon (:), etc.

Closed

CONFSERVER-14554 Change search so that searching for cat_dog returns only pages w/cat_dog, instead of pages with cat &/or dog

Closed

CONFSERVER-15383 Provide ability to escape dot (.) to search for a literal period in a string...

Closed

is related to

CONFSERVER-28957 Break search tokens on period/dot (".") as well as space (" ")

Closed

CONFSERVER-10412 Support substring (partial) searches for terms used in Confluence: page title, attachment names, etc.

Gathering Interest

relates to

AI-582 Allow search for words and phrases with non-letter symbols: plus (+), minus (-), period (.), dollar sign ($), asterisk (*), etc.

Reviewing

mentioned in: Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Page Loading...; Wiki Page Loading...

(2 is related to, 1 relates to, 9 mentioned in)

Assignee:: Unassigned
Reporter:: Roberto Fdez.
Votes:: 222 Vote for this issue
Watchers:: 136 Start watching this issue

Created:: 08/Apr/2005 12:29 PM
Updated:: 04/Jul/2026 4:07 AM

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates