[CWD-2809] Use a single query for cross-directory searches when all directories are local

Type: Suggestion
Resolution: Unresolved
Fix Version/s: None
Component/s: Directory - Internal/Delegated
Labels:
- backlog

Feedback Policy:

Our product teams collect and evaluate feedback from a number of different sources. To learn more about how we use customer feedback in the planning process, check out our new feature policy.

When querying across directories, Crowd currently queries them individually and then aggregates in-memory.

When all directories are stored locally, either as local directories or cached remote, aggregating across directories can happen directly in the database for a performance increase.

is related to

CWD-3225 Improve performance for paginated queries across multiple directories

Closed

relates to

CWD-2804 ApplicationServiceGeneric.searchUsers is slow with large numbers of users

Closed

Mark Lassau (Inactive) added a comment - 16/May/2012 2:56 AM

Aggregation in this case means two different things: sorting and removing shadowed users.
Both of these have problems if you try to do them directly in the DB.
Removing shadowed users I believe is too complex for direct DB manipulation and sorting could produce different results to today based on the collation.

Furthermore some DAO implementations (JIRA) include in memory caches, so directly calling the DB would lead to a massive decrease in performance.

Instead there are other inefficiencies that we could explore:

ApplicationServiceGeneric lowercases the usernames for duplicate detection
We already store the lowercase name in the DB but we don't expose it through the User object
ApplicationServiceGeneric sorts Users even when there are is no "maxResults" in the query
Sorting is only needed when paging.
An ALL_RESULTS query is going to be big and is most in need of optimisation.
ApplicationServiceGeneric sorts all Users
We only need to sort up to the end of the current page.
Users that don't fall into that subset can be discarded without full sorting

Mark Lassau (Inactive) added a comment - 16/May/2012 2:56 AM Aggregation in this case means two different things: sorting and removing shadowed users. Both of these have problems if you try to do them directly in the DB. Removing shadowed users I believe is too complex for direct DB manipulation and sorting could produce different results to today based on the collation. Furthermore some DAO implementations (JIRA) include in memory caches, so directly calling the DB would lead to a massive decrease in performance. Instead there are other inefficiencies that we could explore: ApplicationServiceGeneric lowercases the usernames for duplicate detection We already store the lowercase name in the DB but we don't expose it through the User object ApplicationServiceGeneric sorts Users even when there are is no "maxResults" in the query Sorting is only needed when paging. An ALL_RESULTS query is going to be big and is most in need of optimisation. ApplicationServiceGeneric sorts all Users We only need to sort up to the end of the current page. Users that don't fall into that subset can be discarded without full sorting

Assignee:: Unassigned

Reporter:: joe

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 10/Apr/2012 12:40 AM

Updated:: 19/Aug/2019 11:54 PM

Details

Description

Attachments

Issue Links

Forms

Activity

Collapse comment: Mark Lassau (Inactive) added a comment - 16/May/2012 2:56 AM

Expand comment: Mark Lassau (Inactive) added a comment - 16/May/2012 2:56 AM

People

Dates