Uploaded image for project: 'Atlassian Intelligence'
  1. Atlassian Intelligence
  2. AI-805

Break search tokens on period/dot (".") as well as space (" ")

    • 1
    • Our product teams collect and evaluate feedback from a number of different sources. To learn more about how we use customer feedback in the planning process, check out our new feature policy.

      NOTE: This suggestion is for Confluence Cloud. Using Confluence Server? See the corresponding suggestion.

      Example problematic use cases:

      • in a Confluence wiki if one searches for a hostname for example "appserver28nyc" and some pages on the wiki contain the string "appserver28nyc" in a context like "if the server your using is appserver28nyc then blah blah" whereas other pages on the wiki contain the string "appserver28nyc" in a context of like "make sure not to reboot appserver28nyc.dbpool.servers.domain.com since it will" only the former pages will be returned;
      • when searching for expressions in code, e.g. "MyClass" or "MyMethod", the search doesn't return pages that contain code like "MyClass.MyMethod()". To find the page, the user needs to search for the full expression "MyClass.Method".

      This happens because confluence (like other search engines like google) only searches for whole words, not substrings of words. Most search engines (like google) consider the period character "." to define a break in search tokens so that from an indexing perspective "appserver28nyc.dbpool.servers.domain.com" would be considered as 5 seperate tokens appserver28nyc, dbpool, servers, domain, and com. Confluence does not.

      Using wildcards could be a workaround only when searching for the first item in a dot-separated string: "MyClass.MyMethod()" can be found by querying MyClass*, but _*MyMethod()" won't work (as per CONF-9158).

            [AI-805] Break search tokens on period/dot (".") as well as space (" ")

            SET Analytics Bot made changes -
            Support reference count New: 1
            pqz made changes -
            Component/s Original: Search - Core [ 46383 ]
            Component/s New: Search - Core [ 75296 ]
            Key Original: CONFCLOUD-28957 New: AI-805
            Affects Version/s Original: 5.1-OD-4 [ 67671 ]
            Project Original: Confluence Cloud [ 18513 ] New: Atlassian Intelligence [ 23110 ]
            Katherine Yabut made changes -
            Workflow Original: JAC Suggestion Workflow [ 3410947 ] New: JAC Suggestion Workflow 3 [ 3626801 ]
            Monique Khairuliana (Inactive) made changes -
            Workflow Original: Confluence Workflow - Public Facing v3 [ 2246671 ] New: JAC Suggestion Workflow [ 3410947 ]
            Status Original: Open [ 1 ] New: Gathering Interest [ 11772 ]
            Katherine Yabut made changes -
            Workflow Original: Confluence Workflow - Public Facing v3 - TEMP [ 2142399 ] New: Confluence Workflow - Public Facing v3 [ 2246671 ]
            Katherine Yabut made changes -
            Workflow Original: Confluence Workflow - Public Facing v3 [ 1891782 ] New: Confluence Workflow - Public Facing v3 - TEMP [ 2142399 ]
            Katherine Yabut made changes -
            Workflow Original: Confluence Workflow - Public Facing v2 [ 1807125 ] New: Confluence Workflow - Public Facing v3 [ 1891782 ]
            jonah (Inactive) made changes -
            Description Original: *Example problematic use cases:*
            - in a Confluence wiki if one searches for a hostname for example "appserver28nyc" and some pages on the wiki contain the string "appserver28nyc" in a context like "if the server your using is appserver28nyc then blah blah" whereas other pages on the wiki contain the string "appserver28nyc" in a context of like "make sure not to reboot appserver28nyc.dbpool.servers.domain.com since it will" only the former pages will be returned;
            - when searching for expressions in code, e.g. "MyClass" or "MyMethod", the search doesn't return pages that contain code like "MyClass.MyMethod()". To find the page, the user needs to search for the full expression "MyClass.Method".

            This happens because confluence (like other search engines like google) only searches for whole words, not substrings of words. Most search engines (like google) consider the period character "." to define a break in search tokens so that from an indexing perspective "appserver28nyc.dbpool.servers.domain.com" would be considered as 5 seperate tokens appserver28nyc, dbpool, servers, domain, and com. Confluence does not.


            +Using wildcards could be a workaround+ only when searching for the first item in a dot-separated string: "MyClass.MyMethod()" can be found by querying _MyClass*_, but _*MyMethod()" won't work (as per CONF-9158).
            New: {panel:bgColor=#e7f4fa}
              *NOTE:* This suggestion is for *Confluence Cloud*. Using *Confluence Server*? [See the corresponding suggestion|http://jira.atlassian.com/browse/CONFSERVER-28957].
              {panel}

            *Example problematic use cases:*
            - in a Confluence wiki if one searches for a hostname for example "appserver28nyc" and some pages on the wiki contain the string "appserver28nyc" in a context like "if the server your using is appserver28nyc then blah blah" whereas other pages on the wiki contain the string "appserver28nyc" in a context of like "make sure not to reboot appserver28nyc.dbpool.servers.domain.com since it will" only the former pages will be returned;
            - when searching for expressions in code, e.g. "MyClass" or "MyMethod", the search doesn't return pages that contain code like "MyClass.MyMethod()". To find the page, the user needs to search for the full expression "MyClass.Method".

            This happens because confluence (like other search engines like google) only searches for whole words, not substrings of words. Most search engines (like google) consider the period character "." to define a break in search tokens so that from an indexing perspective "appserver28nyc.dbpool.servers.domain.com" would be considered as 5 seperate tokens appserver28nyc, dbpool, servers, domain, and com. Confluence does not.


            +Using wildcards could be a workaround+ only when searching for the first item in a dot-separated string: "MyClass.MyMethod()" can be found by querying _MyClass*_, but _*MyMethod()" won't work (as per CONF-9158).
            jonah (Inactive) made changes -
            Link New: This issue is related to CONFSERVER-28957 [ CONFSERVER-28957 ]
            vkharisma made changes -
            Link New: This issue is related to JRACLOUD-41053 [ JRACLOUD-41053 ]

              Unassigned Unassigned
              dwierzbicka Dobroslawa Wierzbicka (Inactive)
              Votes:
              12 Vote for this issue
              Watchers:
              10 Start watching this issue

                Created:
                Updated: