pve-qemu-qoup

Author	SHA1	Message	Date
Wolfgang Bumiller	ed01236593	add patch: PVE Backup: allow passing max-workers performance setting Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-10-10 11:55:15 +02:00
Fiona Ebner	1976ca4607	savevm-async: set SAVE_STATE_DONE when closing state file was successful Without this change, it's necessary to send a second savevm-end QMP command after aborting a snaphsot, before a new savevm-start QMP command can succeed. In process_savevm_finalize(), no longer set an error in the abort scenario. If there already is another error, there's no need to override it. If canceling was done intentionally, qmp_savevm_end() is responsible for setting the state now. Reported-by: Mira Limbeck <m.limbeck@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-08-19 09:44:16 +02:00
Fiona Ebner	563c592898	savevm-async: avoid segfault when aborting snapshot Reported in the community forum[0]. For 6.1.0, there were a few changes to the coroutine-sleep API, but the adaptations in `f376b2b` ("update and rebase to QEMU v6.1.0") made a mistake. Currently, target_close_wait is NULL when passed to qemu_co_sleep_ns_wakeable(), which further passes it to qemu_co_sleep(), but there, it is dereferenced when trying to access the 'to_wake' member: > Thread 1 "kvm" received signal SIGSEGV, Segmentation fault. > qemu_co_sleep (w=0x0) at ../util/qemu-coroutine-sleep.c:57 To fix it, create a proper struct and pass its address instead. Also call qemu_co_sleep_wake unconditionally, because the NULL check (for the 'to_wake' member) is done inside the function itself. This patch is based on what the QEMU commits introducing the changes to the coroutine-sleep API did to the callers in QEMU: eaee072085 ("coroutine-sleep: allow qemu_co_sleep_wake that wakes nothing") 29a6ea24eb ("coroutine-sleep: replace QemuCoSleepState pointer with struct in the API") [0]: https://forum.proxmox.com/threads/112130/ Tested-by: Mira Limbeck <m.limbeck@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2022-08-19 09:44:14 +02:00
Fabian Ebner	0e88ec19db	add two more stable patches For the io_uring patch, it's not very clear which configurations can trigger it, but it should be rather uncommon. See qemu commit be6a166fde652589761cf70471bcde623e9bd72a for a bit more information. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-07-19 17:22:10 +02:00
Fabian Ebner	14ed554660	cherry-pick upstream fixes for 7.0.0 coming in via qemu-stable (except for the vdmk fix, which was tagged for-7.0 on the qemu-devel list, but didn't make it into the release). Also took the chance to switch the gluster fix to the version that made it into upstream. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-29 12:29:30 +02:00
Fabian Ebner	dc9827a6a4	update submodule and patches to 7.0.0 Only very minor changes needed: * Most patches in extra (or some version of them) are part of 7.0.0. * aio_set_fd_handler got an extra parameter, but can just pass NULL like we did for the related 'poll' parameter. See QEMU commit 826cc32423db2a99d184dbf4f507c737d7e7a4ae for more. * Add include for qemu/memalign.h in vma.c and vma-writer.c. * Add reverts for fixups of already reverted 0347a8fd4c ("block/rbd: implement bdrv_co_block_status") that came in with 7.0.0. Those fixups are not enough, see Proxmox bugzilla #4047. * Two trivial context changes for bitmap-mirror patches. * block_int.h got split up into multiple headers. * Some context changes in configure and meson.build. * Used the oppurtunity to squash fixup of bdrv_backuo_dump_create typo in a later patch into the patch introducing the function (had to move code to new header during rebase). Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-29 12:29:21 +02:00
Thomas Lamprecht	39e84ba82d	vma/alloc-track improvements Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-06-22 15:52:16 +02:00
Thomas Lamprecht	4fd0fa7fb3	re-export patches in normalized form iow. using: git format-patch --zero-commit --no-signature --no-numbered --diff-algorithm=myers ... Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-06-22 15:49:53 +02:00
Dominik Csapak	539e333eaa	add 'namespace' to BlockdevOptionsPbs so that we can use it for the -blockdev options (used for live-restore) Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2022-06-22 15:10:49 +02:00
Fabian Ebner	7bd4d8645a	fix #4101 : acquire job's aio context before calling job_unref Otherwise, we might run into an abort via bdrv_co_yield_to_drain() (can at least happen when a disk with iothread is used): > #0 0x00007fef4f5dece1 __GI_raise (libc.so.6 + 0x3bce1) > #1 0x00007fef4f5c8537 __GI_abort (libc.so.6 + 0x25537) > #2 0x00005641bce3c71f error_exit (qemu-system-x86_64 + 0x80371f) > #3 0x00005641bce3d02b qemu_mutex_unlock_impl (qemu-system-x86_64 + 0x80402b) > #4 0x00005641bcd51655 bdrv_co_yield_to_drain (qemu-system-x86_64 + 0x718655) > #5 0x00005641bcd52de8 bdrv_do_drained_begin (qemu-system-x86_64 + 0x719de8) > #6 0x00005641bcd47e07 blk_drain (qemu-system-x86_64 + 0x70ee07) > #7 0x00005641bcd498cd blk_unref (qemu-system-x86_64 + 0x7108cd) > #8 0x00005641bcd31e6f block_job_free (qemu-system-x86_64 + 0x6f8e6f) > #9 0x00005641bcd32d65 job_unref (qemu-system-x86_64 + 0x6f9d65) > #10 0x00005641bcd93b3d pvebackup_co_complete_stream (qemu-system-x86_64 + 0x75ab3d) > #11 0x00005641bce4e353 coroutine_trampoline (qemu-system-x86_64 + 0x815353) Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Acked-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-09 14:57:28 +02:00
Wolfgang Bumiller	7f4326d1dc	pbs cleanup fixes Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-08 13:10:51 +02:00
Wolfgang Bumiller	53bff441c5	delete patches which were dropped from the series file Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-06-08 13:07:04 +02:00
Fabian Ebner	dc265df350	add revert to work around performance regression when backing up large RBD disk resulting in QMP timeouts and very slow backups. The plan is to figure out (ideally together with upstream) a way to make the implementation of bdrv_co_block_status for RBD more efficient. But for now, revert the problematic change as a stop-gap measure. Upstream bug report: https://gitlab.com/qemu-project/qemu/-/issues/1026 Forum threads: https://forum.proxmox.com/threads/109272/ https://forum.proxmox.com/threads/109448/ https://forum.proxmox.com/threads/101334/ (partially) Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-05-19 09:23:38 +02:00
Wolfgang Bumiller	58a5492e9c	namespace support Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-05-12 13:49:35 +02:00
Thomas Lamprecht	309b5c1694	backport various fixes for gluster, qxl and vnc Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-05-11 10:40:14 +02:00
Thomas Lamprecht	f87d0523df	vma: allow partial restore Introduce a new map line for skipping a certain drive, of the form skip=drive-scsi0 Since in PVE, most archives are compressed and piped to vma for restore, it's not easily possible to skip reads. For the reader, a new skip flag for VmaRestoreState is added and the target is allowed to be NULL if skip is specified when registering. If the skip flag is set, no writes will be made as well as no check for duplicate clusters. Therefore, the flag is not set for verify. Originally-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Acked-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-25 10:07:37 +02:00
Thomas Lamprecht	2fd4ea2813	patches: update context Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-25 10:07:01 +02:00
Thomas Lamprecht	2653a5f029	vma: restore: call blk_unref for all opened block devices Originally-by: Fabian Ebner <f.ebner@proxmox.com> Link: https://lists.proxmox.com/pipermail/pve-devel/2022-April/052642.html Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-25 10:05:29 +02:00
Thomas Lamprecht	4de9440f87	various stable backports Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-22 10:22:39 +02:00
Thomas Lamprecht	c8ba14bed0	cherry-pick fix for passing some acpi slic tables Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-04-15 08:07:34 +02:00
Fabian Ebner	27199bd753	backup: add patch to initialize bcs bitmap early enough for PBS This is necessary for multi-disk backups where not all jobs are immediately started after they are created. QEMU commit 06e0a9c16405c0a4c1eca33cf286cc04c42066a2 did already part of the work, ensuring that new writes after job creation don't pass through to the backup, but not yet for the MIRROR_SYNC_MODE_BITMAP case which is used for PBS. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com> Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2022-03-03 11:37:17 +01:00
Fabian Ebner	f6d40bfdf4	add patch for loading a snapshot with qemu-img dd Will be used when cloning from a qcow2 efidisk. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-15 14:03:07 +01:00
Fabian Ebner	107132becc	fix getopt-string when introducing -n option for qemu-img dd The colon after U is wrong, because it doesn't take an argument. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-15 14:03:07 +01:00
Fabian Ebner	4567474e95	update submodule and patches to 6.2.0 Notable changes: * bdrv_co_p{discard,readv,writev,write_zeroes} function signatures changed, to using int64_t for offsets/bytes and some still had int rather than BrdvRequestFlags for the flags. * job_cancel_sync now has a force parameter. Commit messages in 73895f3838cd7fdaf185cf1dbc47be58844a966f 4cfb3f05627ad82af473e7f7ae113c3884cd04e3 sound like using force=true makes more sense. * Added 3 patches coming in via qemu-stable tag, most important one is to work around a librbd issue. * Added another 3 patches from qemu-devel to fix issue leading to crash when live migrating with iothread. * cluster_size calculation helper changed (see patch pve/0026). * QAPI's if conditionals now use 'CONFIG_FOO' rather than 'defined(CONFIG_FOO)' Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-15 14:03:07 +01:00
Fabian Ebner	2bf61c3eb6	vma: create: register all streams before entering coroutines Otherwise, the header might already get written by a coroutine and registering further streams will fail after that. Also adds a missing g_list_free call for the other GList that's used. Reported in the community forum: https://forum.proxmox.com/threads/104744/ Reproducer script (increase beyond 30 if the issue isn't triggered yet): > #!/usr/bin/perl > > my $dir = "./vma-create-bug"; > mkdir $dir; > > my $archive_path = "$dir/vzdump-qemu-104-2202_02_02-00_00_00.vma"; > unlink $archive_path; > > my $cmd = "vma create $archive_path -v"; > for (my $i = 0; $i < 30; $i++) { > system("truncate -s 1M $dir/drive-virtio$i.img"); > $cmd .= " drive-virtio$i=$dir/drive-virtio$i.img"; > } > system($cmd); Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2022-02-14 15:38:58 +01:00
Thomas Lamprecht	ddbf7a872d	update submodule and patches to 6.1.1 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2022-01-13 10:56:39 +01:00
Fabian Ebner	570d4ad51d	fix #3738 : cherry-pick "block: introduce max_hw_iov for use in scsi-generic" which fixes the bad commit 18473467d55a20d643b6c9b3a52de42f705b4d35 that was tracked down via bisecting, and has a Cc for qemu-stable as well. Issue was easy enough to reproduce with a single virtio-block disk using a few runs of dd if=/dev/urandom of=file bs=1M count=1000 Commit cc071629539dc1f303175a7e2d4ab854c0a8b20f upstream. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2021-12-01 15:34:27 +01:00
Dominik Csapak	c5e8e7c998	buildsys: fix build-dependencies on headers for 'vma' and 'pbs_restore' both of them depend on generated header files, so we have to specify them as sources. Otherwise, it happens (at least on some machines) that they will be compiled before the headers are generated, aborting the build. Signed-off-by: Dominik Csapak <d.csapak@proxmox.com>	2021-11-18 08:11:57 +01:00
Fabian Grünbichler	7cf6b60926	fix #3728 : handle machine without type libguestfs starts their helper VMs with `-machine accel=..` without a machine type, and our pve version suffix handling would segfault in that case. there might be other scripted use cases that are affected as well. this regression was introduced with the rebase of our patch set on top of 6.1.0 Fixes: `f376b2b9e2` Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-11-17 17:20:26 +01:00
Fabian Grünbichler	edbcc10a69	cherry-pick segfault fix this was reported multiple times in our forums[1 with backtraces, 2 & 3 with same log messages], fix is taken from upstream master. 1: https://forum.proxmox.com/threads/pve-7-0-14-1-vm-not-running-live-migration-kills-vm-post-ssd-move-pre-ram-move.99704/ 2: https://forum.proxmox.com/threads/proxmox-7-0-14-1-crashes-vm-during-migrate-to-other-host.99678 3: https://forum.proxmox.com/threads/cannot-migrate-between-zfs-and-ceph.99685/#post-430152 Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2021-11-16 09:23:43 +01:00
Stefan Reiter	af64ed13eb	add fixup patch for qxl migration logic Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-10-13 17:58:18 +02:00
Stefan Reiter	f376b2b9e2	update and rebase to QEMU v6.1.0 Very clean rebase, only the +pve version handling needed manual fixing. Drops two applied patches from extra/ and adds one new from upstream (extra/0001*, fixes VNC over unix sockets) as well as 3 of my own for allowing password changes on custom VNC displays again (as seen and reviewed upstream, but not yet applied). Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-10-11 15:13:26 +02:00
Stefan Reiter	26eee146bc	add temporary QMP race fix same as the initial version sent to qemu-devel, it won't be the final fix we plan to upstream but it should be enough band-aid to workaround how PVE uses the QMP. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com> [ Thomas: add a bit reasoning to commit message body ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-09-06 07:28:07 +02:00
Wolfgang Bumiller	277d33454f	drop patch force-disabling smm This drops debian/patches/pve/0005-PVE-Config-smm_available-false.patch (and renumbers the remaining patches) From what I could gather, this patch was originally added due to issues with old kernels. Now we have users which seem to run into issues with the patch. All this does is toggle an option, and it's available via a qemu CLI option anyway, so if dropping this patch causes issues for some people we can just add an option to qemu-server & UI control smm explicitly. Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Cc: Alexandre Derumier <aderumier@odiso.com> Tested-by: Stefan Reiter <s.reiter@proxmox.com>	2021-08-24 11:19:05 +02:00
Fabian Ebner	0114d3cd02	io_uring: resubmit when result is -EAGAIN Linux SCSI can throw spurious -EAGAIN in some corner cases in its completion path, which will end up being the result in the completed io_uring request. Resubmitting such requests should allow block jobs to complete, even if such spurious errors are encountered. Signed-off-by: Fabian Ebner <f.ebner@proxmox.com>	2021-07-29 11:51:57 +02:00
Stefan Reiter	8dca018b68	udpate and rebase to QEMU v6.0.0 Mostly minor changes, bigger ones summarized: * QEMU's internal backup code now uses a new async system, which allows parallel requests - the default max_workers settings is 64, I chose less, since 64 put enough stress on QEMU that the guest became practically unusable during the backup, and 16 still shows quite a nice measureable performance improvement. Little code changes for us though. * 'malformed' QAPI parameters/functions are now a build error (i.e. using '_' vs '-'), I chose to just whitelist our calls in the name of backwards compatibility. * monitor OOB race fix now uses the upstream variant, cherry-picked from origin/master since it's not in 6.0 by default * last patch fixes a bug with snapshot rollback related to the new yank system Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-05-28 11:29:44 +02:00
Thomas Lamprecht	0a88214b72	alloc track: use coroutine version of bdrv_pwrite_zeroes as we're in a coroutine here too Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-06 16:31:53 +02:00
Thomas Lamprecht	76e464784e	pbs block driver: run read in the AIO context of the bs Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-06 16:31:53 +02:00
Thomas Lamprecht	b36e8acc31	alloc track: acquire BS AIO context during dropping ran into this when live-restoring a backup configured for IO-threads, got the good ol': > qemu: qemu_mutex_unlock_impl: Operation not permitted error. Checking out the history of the related bdrv_backup_top_drop(*bs) method, we can see that it used to do the AIO context acquiring too, but in the backup path this was problematic and was changed to be higher up in the call path in a upstream series from Stefan[0]. That said, this is a completely different code path and it is safe to do so here. We always run from the main threads's AIO context here and we call it only indirectly once, guarded by checking for `s->drop_state == DropNone` and set `s->drop_state = DropRequested` shortly before we schedule the track_drop() in a bh. [0]: https://lists.gnu.org/archive/html/qemu-devel/2020-03/msg09139.html Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-06 16:27:48 +02:00
Thomas Lamprecht	aa42ea267e	alloc track: keep track_drop() closer to similar block drivers Reads just nicer with a drain begin and end call. Also clearing the backing link of the alloc track BDS makes it closer to bdrv_backup_top_drop() with which this driver has a bit in common. Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-04-06 16:27:37 +02:00
Stefan Reiter	e79be6c6c4	add upstream fixes for qmp_block_resize cherry-picked cleanly from 6.0 development tree, fixes an issue with resizing RBD drives (and reportedly also on krbd or potentially other storage backends) with iothreads. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-03-30 18:14:37 +02:00
Stefan Reiter	bb751cab32	Add tentative fix for QMP hang Not exactly as sent upstream[0] since we're missing a change in our v5.2.0 branch (irrelevant for us), but functionally works the same. [0] https://lists.gnu.org/archive/html/qemu-devel/2021-03/msg07590.html	2021-03-22 16:52:40 +01:00
Stefan Reiter	677d0d169f	add alloc-track block driver patch See added patches for more info, overview: 0044: slightly increase PBS performance by reducing allocations 0045: slightly increase block-stream performance for Ceph 0046: don't crash with block-stream on RBD 0047: add alloc-track driver for live restore Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-03-16 20:53:18 +01:00
Stefan Reiter	e9b36665c7	fix saving and loading dirty bitmaps in snapshots Saving dirty bitmaps from our savevm-async code didn't work, since we use a coroutine which holds the iothread mutex already (upstream savevm is sync, migration uses a thread). Release the mutex before calling the one function that (according to it's documentation) requires the lock to not be held: qemu_savevm_state_pending. Additionally, loading dirty bitmaps requires a call to dirty_bitmap_mig_before_vm_start after "loadvm", which the upstream savevm does explicitly afterwards - do that too. This is exposed via the query-proxmox-support property "pbs-dirty-bitmap-savevm". Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-03-16 20:44:06 +01:00
Stefan Reiter	40e6b6e5a5	add ACPI compat patch for 5.1 and older machine types Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-03-05 15:20:14 +01:00
Stefan Reiter	2413972b46	move bitmap-mirror patches to seperate folder ...instead of having them in the middle of the backup related patches. These might (hopefully) become upstream at some point as well. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-03-03 14:29:05 +01:00
Stefan Reiter	0c893fd820	clean up pve/ patches by squashing patches of patches No functional change intended. Signed-off-by: Stefan Reiter <s.reiter@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-03-03 14:29:05 +01:00
Stefan Reiter	4194124719	pbs-restore: unref/close target block backend Use blk_unref to drop the last reference, which will close the block backend and flush all caches and outstanding writes. This is especially important for restoring to Ceph, as the userspace librbd caches will not be flushed if the application exits immediately, leading to potentially incomplete restores. Reported-by: Eneko Lacunza <elacunza@binovo.es> Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-02-24 19:02:07 +01:00
Thomas Lamprecht	42a90c4e1c	d/patches: backport virtiofsd security fix Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2021-02-24 19:02:07 +01:00
Stefan Reiter	0b8da68824	add PBS master key support Signed-off-by: Stefan Reiter <s.reiter@proxmox.com>	2021-02-12 10:47:14 +01:00

1 2 3 4

155 Commits