Loading...

XML

Word

Printable

Type: Suggestion
Resolution: Unresolved
Component/s: Enterprise Insights - Data File Retrieval
Labels:
None

Support reference count:
1

User Problem

The Customer is transitioning from an API-based data extraction model to a more secure and efficient S3 Parquet-based replication system for data storage and disaster recovery.

The following challenges were identified:

The customer's requested replication model is non-standard for Atlassian and requires internal review, testing, and automation updates.
AWS replication delays for large data volumes risk disrupting sequential file processing and data accuracy.
Versioning requirements increase costs and complexity.
Ensuring compatibility of Parquet schema with the Customer's incremental updates.
Security and setup configurations require significant customization and approval.

Current Workarounds

Continue API-Based Incremental Updates: Use the existing hourly API process until the S3 replication model is fully implemented.
Schedule Transfers Strategically: Perform exports during off-peak hours to minimize AWS replication delays.
Small-Scale Parquet Testing: Validate the Parquet schema with limited data before scaling up.
Manual Monitoring: Assign oversight for critical data transfers to address delays or issues.
Temporary Non-Versioned Buckets: Use non-versioned buckets for initial testing to reduce costs.
Iterative Template Refinement: Adjust provided templates to fit Atlassian’s standards and expedite security approval.

mentioned in: Page Loading...

relates to: ALIGNSP-28721 Loading...

Assignee:: Melissa Hartsock
Reporter:: Rodrigo San Vicente
Votes:: 1 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: 26/May/2025 1:23 PM
Updated:: 01/Jul/2025 8:19 PM

Enhancing data transfer with S3 Parquet files and replication

User Problem

Suggested Solution

Current Workarounds

Details

Description

User Problem

Suggested Solution

Current Workarounds

Attachments

Issue Links

Activity

People

Dates