Lxc copy --refresh workaround: efficient incremental ZFS snapshot sync with send/receive

usrflo · February 23, 2021, 1:25pm

I need to update/sync cold and hot standby containers on separate machines efficiently and I realized that lxc copy --refresh can’t help here because rsync is used in the refresh case and this can be very slow (slower than an initial full copy).

References: this requirement was mentioned in Lxc copy --refresh error and Future incremental copy - (lxc refresh) - ZFS backend already.

So I scripted the following files for an incremental ZFS snapshot send / receive to a backup server. In my tests the snapshot sync even worked with a running target container that finally is restored to the newly synchronized snapshot. This sync’ed container intentionally does not include further snapshots that are created on the source machine.

Now I’d like to discuss the completeness and relative stability of this approach regarding LXD updates as well as future simplifications e.g. remove the direct execution of zfs send on the remote source machine / switch over to lxc functionality.

gist.github.com

https://gist.github.com/usrflo/f4f62e886490b2efed2a1503aed3e228

lxc-sync-config

LXCBIN=/snap/bin/lxc
LXDBIN=/snap/bin/lxd
ZFSBIN=/sbin/zfs

SYNCSNAPNAME=bsync
LASTSYNCSNAPNAME=bsync-last

ZFSROOT=tank/containers/
MOUNTROOT=/var/snap/lxd/common/lxd/storage-pools/

lxc-sync-containers

# Array of containers to synchronize

declare -A SYNCCONT
declare -A SYNCCONTDETAIL

# sample container 1
SYNCCONTDETAIL["lxcremote"]='srcserver'
SYNCCONTDETAIL["sshcmd"]='ssh -p2222 -oStrictHostKeyChecking=no root@srcserver'
string=$(declare -p SYNCCONTDETAIL)
SYNCCONT['samplecont1']=${string}

This file has been truncated. show original

lxc-sync.sh

#!/bin/bash

# This script is meant to run on the sync target server that syncs remote container snapshots to local;
# remote and local containers may be running; a remote container is snapshotted and this
# snapshot will be restored on the local receiving container.
#
# IMPORTANT:
# - in the local lxc configuration all remotes needs to be configured (see `lxc remote list`)
# - ssh access to the remote/source servers needs to be available to execute remote zfs send
#

This file has been truncated. show original

usrflo · May 7, 2021, 7:11am

Update on this topic: the ZFS snapshot sync that is implemented with the scripts I posted in February is running stable in test setups since 2 months. After some minor changes I will use this in production for hot and cold standby containers on ZFS storage volumes.

bodleytunes · June 27, 2021, 11:37am

I do a similar thing but use Syncoid instead.

danboid · October 28, 2021, 1:02pm

Hi Florian

I have a couple of containers running under LXD using a ZFS storage pool so I too would like to see LXD integrate something like this so that we could take advantage of zfs send and receive to sync containers or VMs running on ZFS based LXD hosts.

How well does your script integrate with LXD’s (ZFS) snapshot support? I’m not very clear on how they link up yet. When I copy my container from one LXD server to another using this script, will the number of snapshots as output by lxc list match?

How would you recommend I do the initial copying of my containers from one LXD host to another, if I wanted to preserve the ZFS snapshots? These wouldn’t be preserved by lxc copy by the sounds of things but maybe combining it with zfs send will do the trick somehow?

usrflo · October 29, 2021, 7:55am

Hi Dan,

in the first run the script copies the state of an initial snapshot of the remote source container to the local target container.
On subsequent syncs it uses zfs send to sync the delta between a new snapshot and the last snapshot on the source container. So this sync works relatively fast compared to the rsync/copy --refresh approach of lxd. On one container I’m using this sync every 30 minutes to be up-to-date for a hot-standby failover.

So you won’t see all the snapshots that you created on the source container, i.e lxc list won’t match; you will just have the latest snapshot named ‘bsync’ that was created by the script.

Take a deeper look into the script to see what needed to be hacked so that this zfs send approach became usable. I don’t see that a match of snapshots of the source container is possible this way.

BTW: I just updated https://gist.github.com/usrflo/f4f62e886490b2efed2a1503aed3e228 so that a sync operation is executed on a single container only. This enables:

different sync source servers
different sync intervals (by specific cron jobs)

Regards,
Florian

danboid · November 1, 2021, 1:07pm

Hi Florian

Thanks for clarifying your script! On further consideration, I think I’d prefer to backup to a machine with a ZFS pool that isn’t running LXD as I don’t need a hot standby. Might you be able to answer my latest questions regarding LXD and ZFS in this thread?

Thanks!

danboid · November 2, 2021, 1:58pm

Hi again Florian

It looks like I’ll be better off basing my LXD backups around lxd recovery but it appears that hasn’t made it into the stable LXD branch yet.

Your script makes use of something called bsync and bsync-last. I presume these are some custom scripts you wrote to fetch the latest snapshot name? Could you share those too please?

stgraber · November 2, 2021, 2:47pm

4.0.8 has it and it’s being rolled out now (phased rollout should take around 48h to hit everyone).

danboid · November 2, 2021, 4:20pm

OK great! I don’t need to test it yet, that can wait a few days at least. I’d rather not update to the beta snap.

usrflo · November 5, 2021, 7:20am

Hi Dan,

“bsync” and “bsync-last” are constants only that are used as snapshot names to differ between the last sync’ed state and the new state (zfs snapshot diff).

george · January 9, 2022, 8:50am

@usrflo thank you so much for sharing this!

My two cents:

Once again about incremental zfs backups -- do we have them? Or can we have them? ))
- https://gist.github.com/gf-mse/d07069390c7a4fb6770bba01f81935a5

This assumes that the “backup” container was originally imported using lxc export --optimized-storage at the origin ( with a subsequent lxc import ... at destination ), and that the incremental deltas are sent via zfs send | zfs recv ; the script tries to go through the list of the new snapshots at destination, and to lxd sql add them to the “global” database, if they are missing.

PS. I definitely wouldn’t call it production quality, more like a (working) proof of concept thing.

One thing it misses in this version is that it uses the present date as a stub, when in fact it shall get the snapshot date from zfs properties, and convert it to an sqlite timestamp.

Not too hard to add, but at the moment I’m more concerned about the fragility of the whole lxd sql approach, as I mentioned here.

avidandrew · September 18, 2022, 4:13am

I’m also interested in this feature for situations where the destination server with the backup zpool doesn’t have LXD installed. I think if this PR can get merged, then it should be possibly to use syncoid to do this as follows:

syncoid -r --no-sync-snap --delete-target-snapshots sourcepool user@destination:destinationpool

The --no-sync-snap argument will ensure that syncoid doesn’t create its own temporary snapshot for the sync process (which it normally does) but instead just syncs the existing snapshots that LXD has created. Then, --delete-target-snapshots will ensure that no older (and therefore extraneous) snapshots are left on the destination, so the snapshot list on the destination should mirror the zpool on the source.

michacassola · November 28, 2022, 5:54pm

@usrflo What happens if you don’t lxd sql global "UPDATE...[]"?
Shouldn’t have lxc restore have either worked or not and done that stuff?

usrflo · November 29, 2022, 4:56pm

Without this SQL update the old snapshot creation date (the initial sync date) is kept.
The lxc restore is independent from the snapshot update via direct ZFS interaction. The restore assures the instance is started on the snapshot that was transferred directly with ZFS send/recv.

tomp · November 29, 2022, 5:04pm

LXD does support optimised refresh now when copying between ZFS pools.
It uses the snapshots that exist to avoid copying the whole instance again.

usrflo · December 4, 2022, 11:32am

Thanks @tomp , I missed that LXD 5.0 LTS already supports optimized refresh between ZFS pools like I now read at LXD 5.0 LTS has been released

But trying to switch over to the LXD-integrated approach revealed that a ‘copy --refresh’ requires the source container to be stopped. As I am using my own solution on productive, running containers this LXD-integrated approach is no working solution.

Is there a way to ‘copy --refresh’ running containers? Otherwise I should stay with my sync script.

tomp · December 5, 2022, 12:04pm

I just tried this on LXD 5.8 and it worked fine with running instance:

lxc list c1
+------+---------+--------------------+-----------------------------------------------+-----------+-----------+
| NAME |  STATE  |        IPV4        |                     IPV6                      |   TYPE    | SNAPSHOTS |
+------+---------+--------------------+-----------------------------------------------+-----------+-----------+
| c1   | RUNNING | 10.21.203.9 (eth0) | fd42:ffdb:caff:baf7:216:3eff:fe63:9a92 (eth0) | CONTAINER | 2         |
+------+---------+--------------------+-----------------------------------------------+-----------+-----------+

lxc copy c1 v1: --refresh

lxc list v1:
+------+---------+------+------+-----------+-----------+
| NAME |  STATE  | IPV4 | IPV6 |   TYPE    | SNAPSHOTS |
+------+---------+------+------+-----------+-----------+
| c1   | STOPPED |      |      | CONTAINER | 2         |
+------+---------+------+------+-----------+-----------+

usrflo · December 5, 2022, 12:38pm

Ok, it seems to me that the optimized copy support on running containers was added somewhere between LXD 5.0 and LXD 5.8.

I will check after an update.