Questions: deduplication, compression, extraction #84

Closed
opened 2018-08-01 21:08:17 +00:00 by patricko · 5 comments

Again, some noob questions

  1. Borg is deduplicating files, so does it mean that one data corruption can lead to all the other backups being corrupted?

Is there a backup of backups with borgmatic? like a "full backup" every 7 days or a "base backup".

  1. To use compression, we set either:
none
lz4
zlib,1-9
lzma,1-9

Is that right? I tried but I saw 0 difference in the result size though

  1. List of one or more consistency checks to run: "repository", "archives", and/or

    "extract". Defaults to "repository" and "archives". Set to "disabled" to disable

    all consistency checks. "repository" checks the consistency of the repository,

    "archive" checks all of the archives, and "extract" does an extraction dry-run

    of just the most recent archive.

    checks:
    - repository
    - archives
    - extract

I added "-extract" maybe you should add it to the conf too.
Anyway, this will verify if the last archive is corrupted or not? or what does it do exactly?

Again, some noob questions 1) Borg is deduplicating files, so does it mean that one data corruption can lead to all the other backups being corrupted? Is there a backup of backups with borgmatic? like a "full backup" every 7 days or a "base backup". 2) To use compression, we set either: ``` none lz4 zlib,1-9 lzma,1-9 ``` Is that right? I tried but I saw 0 difference in the result size though 3) # List of one or more consistency checks to run: "repository", "archives", and/or # "extract". Defaults to "repository" and "archives". Set to "disabled" to disable # all consistency checks. "repository" checks the consistency of the repository, # "archive" checks all of the archives, and "extract" does an extraction dry-run # of just the most recent archive. checks: - repository - archives - extract I added "-extract" maybe you should add it to the conf too. Anyway, this will verify if the last archive is corrupted or not? or what does it do exactly?
witten added the
question / support
label 2018-08-02 05:08:07 +00:00
Owner
  1. As far as I know, data corruption of a de-duplicated block can indeed screw up multiple different backup archives. If this is a concern, I recommend backing up to multiple different locations. There's not a "full backup" feature like you're describing with borgmatic or Borg. But it would be pretty easy to declare multiple repositories in your borgmatic config file, and borgmatic will backup to each repository separately, thereby getting you some of the redundancy you're looking for. (Note that unless these are stored on separate disks or servers, it won't necessarily help redundancy much.)

Also, there are a couple of Borg FAQ entries that may be of interest on this topic.

  1. That's generally right, but there are a few more compression options as well. Run borg help compression for more info. Whether compression does anything depends entirely on the source data. If your source data is already pretty well compressed, extra compression won't help much. If you suspect there's a borgmatic bug here (e.g. not passing your compression option to Borg), feel free to run borgmatic with -v 2 to up the verbosity and see exactly what it's passing to Borg.

  2. extract actually tries to do a dry-run restore of the most recent archive to make sure that it's extractable. For instance, with sufficient repository corruption, an extract check would fail and you'd know that you couldn't actually restore that most recent archive. So it's just a sort of smoke test.

1) As far as I know, data corruption of a de-duplicated block can indeed screw up multiple different backup archives. If this is a concern, I recommend backing up to multiple different locations. There's not a "full backup" feature like you're describing with borgmatic or Borg. But it would be pretty easy to declare multiple `repositories` in your borgmatic config file, and borgmatic will backup to each repository separately, thereby getting you some of the redundancy you're looking for. (Note that unless these are stored on separate disks or servers, it won't necessarily help redundancy much.) Also, there are a [couple of Borg FAQ entries](https://borgbackup.readthedocs.io/en/stable/faq.html?highlight=corruption#can-project-name-add-redundancy-to-the-backup-data-to-deal-with-hardware-malfunction) that may be of interest on this topic. 2) That's generally right, but there are a few more compression options as well. Run `borg help compression` for more info. Whether compression does anything depends entirely on the source data. If your source data is already pretty well compressed, extra compression won't help much. If you suspect there's a borgmatic bug here (e.g. not passing your compression option to Borg), feel free to run borgmatic with `-v 2` to up the verbosity and see exactly what it's passing to Borg. 3) `extract` actually tries to do a dry-run restore of the most recent archive to make sure that it's extractable. For instance, with sufficient repository corruption, an `extract` check would fail and you'd know that you couldn't actually restore that most recent archive. So it's just a sort of smoke test.
Author
  1. Fine!

  2. Ok thank you for that command!! So using "auto,lzma" seems like the best option to me.
    But what's the point of "auto"? to save time? isn't it risky though?

  3. Awesome! this tool is awesome
    My only concern is why not having it installable in a .deb archive too (in official distrib repos)?? too much work maybe

1) Fine! 2) Ok thank you for that command!! So using "auto,lzma" seems like the best option to me. But what's the point of "auto"? to save time? isn't it risky though? 3) Awesome! this tool is awesome My only concern is why not having it installable in a .deb archive too (in official distrib repos)?? too much work maybe
Owner

There are .deb packages of borgbackup, but not yet for borgmatic. See that thread for more information.

There are .deb packages of [borgbackup](https://packages.debian.org/sid/borgbackup), but not yet for [borgmatic](https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=887809). See that thread for more information.
Owner

Oh, as for auto compression, I don't know. My guess is that it might save a little CPU on decompression, because it's not unnecessarily (de-)compressing if compression won't actually save any space.

Oh, as for `auto` compression, I don't know. My guess is that it might save a little CPU on decompression, because it's not unnecessarily (de-)compressing if compression won't actually save any space.
Author

anyway thank you!!!!

anyway thank you!!!!
Sign in to join this conversation.
No Milestone
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: borgmatic-collective/borgmatic#84
No description provided.