[BSERV-4057] Handle database/filesystem conflicts (due to incorrect backup/restore) more gracefully

Type: Suggestion
Resolution: Duplicate
Fix Version/s: None
Component/s: Maintenance - Backup / Restore
Labels:
None
Environment:
RHEL Linux.

Feedback Policy:

We collect Bitbucket feedback from various sources, and we evaluate what we've collected when planning our product roadmap. To understand how this piece of feedback will be reviewed, see our Implementation of New Features Policy.

Original summary: "Add check on creating/deleting a Repository"
Hi.
On my Backup/restore tests I wanted to find out what happens if I do this backup "online" (hot-backup).
I found out that always the database has the main knowledge of the system configuration and it seems only the filesystem (stash_home) knows some extra informations about plugins and repositories.

In a scenario where you create a database backup followed by the backup of stash_home within a huge (unnatuaral) period of time you can describe following case:
if you create a repository within this time, it comes that if you restore that backup - this repository is gone. This is okay as at the time of the database backup the repository was unknown. But sadly this repository exists in the stash_home filesystem. First nobody knows BUT:
If you now try to create a new Repository on this restored system you get an error: stash says the reppsitory "repoID" (it's the repo-id-number) would exist... Looking into the filesystem you will see this repo existing...

I came to the conclusion that Stash iterates by 1 when creating a new repository...
I'd like to ask you to improve that by checking first if a repo with the next iteration already exists and suggest to create a new repository with one more iteration. Also put a big warning that it seems there is a "ghost-repository"
this would not paralyze our users from creatign repositories when having a huge stash-instance later

You can imageine that there is obviously the same counter when deleting a repository within that timespan.

Maybe as a spanning view I'd suggest to store some metainformation (which stash-project, wich repo-name) inside the bulk-repository on the filesystem. (I'll look for an open issue for that or open another one later)

Thanks!

duplicates

BSERV-4255 Backup Stash with zero downtime

Closed

Katherine Yabut made changes - 19/Sep/2019 5:50 AM

Workflow	Original: JAC Suggestion Workflow [ 3396353 ]	New: JAC Suggestion Workflow 3 [ 3624451 ]
Status	Original: RESOLVED [ 5 ]	New: Closed [ 6 ]

Monique Khairuliana (Inactive) made changes - 21/Aug/2019 12:46 AM

Workflow

Original: BSERV Suggestions Workflow [ 2687605 ]

New: JAC Suggestion Workflow [ 3396353 ]

Owen made changes - 04/Jun/2018 4:14 AM

Workflow	Original: Stash Workflow [ 579781 ]	New: BSERV Suggestions Workflow [ 2687605 ]
Status	Original: Closed [ 6 ]	New: Resolved [ 5 ]

Roger Barnes (Inactive) made changes - 18/May/2016 7:15 AM

Resolution		New: Duplicate [ 3 ]
Status	Original: Open [ 1 ]	New: Closed [ 6 ]

Roger Barnes (Inactive) made changes - 18/May/2016 7:15 AM

Link

New: This issue duplicates ~~BSERV-4255~~ [ ~~BSERV-4255~~ ]

Roger Barnes (Inactive) added a comment - 18/May/2016 7:15 AM

We'll track this suggestion in ~~BSERV-4255~~ which we believe incorporates this.

Roger Barnes (Inactive) added a comment - 18/May/2016 7:15 AM We'll track this suggestion in BSERV-4255 which we believe incorporates this.

Stefan Saasen (Inactive) made changes - 04/Aug/2014 5:19 AM

Affects Version/s	Original: 2.8.3 [ 37296 ]
Issue Type	Original: Improvement [ 4 ]	New: Suggestion [ 10000 ]
Labels	Original: backup repo-mgmt restore
Priority	Original: Minor [ 4 ]

herzog@t-systems.com_match added a comment - 26/Nov/2013 1:01 PM

Hi thank you for fiddling with that.
Well, your question is a good one. I can't really answer it but think of it.

As I know from other applications we run in our company they have some "consistency checks". Some are running on demand, others can be set on application start up. They compare the Data held in Database with the one that are stored in the filesystem. How they solve the conflicts differs from application to application and I don't really know what they internally really do if there are conflicts...
But as there is an unwritten general concept of how to do application backups: database first, filesystem later - it make sense to follow this by checking the consistency of DB vs FS by helding the DB as "base line information".

herzog@t-systems.com_match added a comment - 26/Nov/2013 1:01 PM Hi thank you for fiddling with that. Well, your question is a good one. I can't really answer it but think of it. As I know from other applications we run in our company they have some "consistency checks". Some are running on demand, others can be set on application start up. They compare the Data held in Database with the one that are stored in the filesystem. How they solve the conflicts differs from application to application and I don't really know what they internally really do if there are conflicts... But as there is an unwritten general concept of how to do application backups: database first, filesystem later - it make sense to follow this by checking the consistency of DB vs FS by helding the DB as "base line information".

Roger Barnes (Inactive) made changes - 26/Nov/2013 12:44 PM

Status

Original: Needs Triage [ 10030 ]

New: Open [ 1 ]

Roger Barnes (Inactive) made changes - 26/Nov/2013 12:44 PM

Description

Original: Hi.
On my Backup/restore tests I wanted to find out what happens if I do this backup "online" (hot-backup).
I found out that always the database has the main knowledge of the system configuration and it seems only the filesystem (stash_home) knows some extra informations about plugins and repositories.

In a scenario where you create a database backup followed by the backup of stash_home within a huge (unnatuaral) period of time you can describe following case:
if you create a repository within this time, it comes that if you restore that backup - this repository is gone. This is okay as at the time of the database backup the repository was unknown. But sadly this repository exists in the stash_home filesystem. First nobody knows BUT:
If you now try to create a new Repository on this restored system you get an error: stash says the reppsitory "repoID" (it's the repo-id-number) would exist... Looking into the filesystem you will see this repo existing...

I came to the conclusion that Stash iterates by 1 when creating a new repository...
I'd like to ask you to improve that by checking first if a repo with the next iteration already exists and suggest to create a new repository with one more iteration. Also put a big warning that it seems there is a "ghost-repository"
this would not paralyze our users from creatign repositories when having a huge stash-instance later :)

You can imageine that there is obviously the same counter when deleting a repository within that timespan.

Maybe as a spanning view I'd suggest to store some metainformation (which stash-project, wich repo-name) inside the bulk-repository on the filesystem. (I'll look for an open issue for that or open another one later)

Thanks!

New: Original summary: "Add check on creating/deleting a Repository"
Hi.
On my Backup/restore tests I wanted to find out what happens if I do this backup "online" (hot-backup).
I found out that always the database has the main knowledge of the system configuration and it seems only the filesystem (stash_home) knows some extra informations about plugins and repositories.

In a scenario where you create a database backup followed by the backup of stash_home within a huge (unnatuaral) period of time you can describe following case:
if you create a repository within this time, it comes that if you restore that backup - this repository is gone. This is okay as at the time of the database backup the repository was unknown. But sadly this repository exists in the stash_home filesystem. First nobody knows BUT:
If you now try to create a new Repository on this restored system you get an error: stash says the reppsitory "repoID" (it's the repo-id-number) would exist... Looking into the filesystem you will see this repo existing...

I came to the conclusion that Stash iterates by 1 when creating a new repository...
I'd like to ask you to improve that by checking first if a repo with the next iteration already exists and suggest to create a new repository with one more iteration. Also put a big warning that it seems there is a "ghost-repository"
this would not paralyze our users from creatign repositories when having a huge stash-instance later :)

You can imageine that there is obviously the same counter when deleting a repository within that timespan.

Maybe as a spanning view I'd suggest to store some metainformation (which stash-project, wich repo-name) inside the bulk-repository on the filesystem. (I'll look for an open issue for that or open another one later)

Thanks!

Assignee:: Unassigned

Reporter:: herzog@t-systems.com_match

Votes:: 1 Vote for this issue

Watchers:: 7 Start watching this issue

Created:: 06/Nov/2013 11:41 AM

Updated:: 19/Sep/2019 5:50 AM

Resolved:: 18/May/2016 7:15 AM

Details

Description

Attachments

Issue Links

Forms

Activity

Collapse comment: Roger Barnes (Inactive) added a comment - 18/May/2016 7:15 AM

Expand comment: Roger Barnes (Inactive) added a comment - 18/May/2016 7:15 AM

Collapse comment: herzog@t-systems.com_match added a comment - 26/Nov/2013 1:01 PM

Expand comment: herzog@t-systems.com_match added a comment - 26/Nov/2013 1:01 PM

People

Dates