Uploaded image for project: 'Confluence Data Center'
  1. Confluence Data Center
  2. CONFSERVER-87298

Soft and Hard Retention cleanup jobs take long time to run

    XMLWordPrintable

Details

    Description

      Issue Summary

      This is reproducible on Data Center: yes

      Steps to Reproduce

      1. Create an instance with many content items with many versions (multiple spaces)
      2. Set a retention rule on a single space and run the job

      Expected Results

      The rules applied and the retention job cleans up content every 10 minutes, efficiently clearing all content that applies to the rules

      Actual Results

      The job can run upwards of 30 minutes per cycle, causing the cleanup to to take a long time. Note the time difference

      2023-05-04 13:06:22,683 DEBUG [Caesium-1-4] [confluence.impl.retention.DefaultHistoricalVersionService] find Finding Historical ATTACHMENT Content for content IDs starting with 73652368, result limit [3000]
      2023-05-04 13:35:58,861 DEBUG [Caesium-1-4] [impl.retention.manager.DefaultSpaceRetentionPolicyManager] getPolicy No SpaceRetentionPolicies found for space with key TEST

      This is because the find operation scans the entire content table. This is run twice (for page and attachments) for each job run

      Workaround

      Currently, there is no known workaround for this behaviour. A workaround will be added here when available.

      Atlassian Update - 16th January 2024

      Hi everyone,

      Thank you for sharing your interest and feedback on this issue.

      We are aware of some scalability limitations in the way the retention rules are currently operating. Resolving these issues is on our backlog and is something we plan on addressing later this calendar year, following our current focus on security.

      Please continue to vote on and watch this ticket to share the importance of this bug with us and to receive updates on our progress.

      To learn more about our recent investments in Confluence Data Center, please check our public roadmap and our dashboards containing recently resolved issues, and current work and future plans.

      Kind regards,
      Confluence Data Center

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ephillips@atlassian.com Edward
              Votes:
              34 Vote for this issue
              Watchers:
              44 Start watching this issue

              Dates

                Created:
                Updated: