pve-qemu-qoup

Author	SHA1	Message	Date
Fiona Ebner	f1eed34ac7	update submodule and patches to QEMU 8.2.2 This version includes both the AioContext lock and the block graph lock, so there might be some deadlocks lurking. It's not possible to disable the block graph lock like was done in QEMU 8.1, because there are no changes like the function bdrv_schedule_unref() that require it. QEMU 9.0 will finally get rid of the AioContext locking. During live-restore with a VirtIO SCSI drive with iothread there is a known racy deadlock related to the AioContext lock. Not new [1], but not sure if more likely now. Should be fixed in QEMU 9.0. The block graph lock comes with annotations that can be checked by clang's TSA. This required changes to the block drivers, i.e. alloc-track, pbs, zeroinit as well as taking the appropriate locks in pve-backup, savevm-async, vma-reader. Local variable shadowing is prohibited via a compiler flag now, required slight adaptation in vma.c. Major changes only affect alloc-track: * It is not possible to call a generated co-wrapper like bdrv_get_info() while holding the block graph lock exclusively [0], which does happen during initialization of alloc-track when the backing hd is set and the refresh_limits driver callback is invoked. The bdrv_get_info() call to get the cluster size is moved to directly after opening the file child in track_open(). The important thing is that at least the request alignment for the write target is used, because then the RMW cycle in bdrv_pwritev will gather enough data from the backing file. Partial cluster allocations in the target are not a fundamental issue, because the driver returns its allocation status based on the bitmap, so any other data that maps to the same cluster will still be copied later by a stream job (or during writes to that cluster). * Replacing the node cannot be done in the track_co_change_backing_file() callback, because it is a coroutine and cannot hold the block graph lock exclusively. So it is moved to the stream job itself with the auto-remove option not having an effect anymore (qemu-server would always set it anyways). In the future, there could either be a special option for the stream job, or maybe the upcoming blockdev-replace QMP command can be used. Replacing the backing child is actually already done in the stream job, so no need to do it in the track_co_change_backing_file() callback. It also cannot be called from a coroutine. Looking at the implementation in the qcow2 driver, it doesn't seem to be intended to change the backing child itself, just update driver-internal state. Other changes: * alloc-track: Error out early when used without auto-remove. Since replacing the node now happens in the stream job, where the option cannot be read from (it's internal to the driver), it will always be treated as 'on'. Makes sure to have users beside qemu-server notice the change (should they even exist). The option can be fully dropped in the future while adding a version guard in qemu-server. * alloc-track: Avoid seemingly superfluous child permission update. Doesn't seem necessary nowadays (maybe after commit "alloc-track: fix deadlock during drop" where the dropping is not rescheduled and delayed anymore or some upstream change). Replacing the block node will already update the permissions of the new node (which was the file child before). Should there really be some issue, instead of having a drop state, this could also be just based off the fact whether there is still a backing child. Dumping the cumulative (shared) permissions for the BDS with a debug print yields the same values after this patch and with QEMU 8.1, namely 3 and 5. * PBS block driver: compile unconditionally. Proxmox VE always needs it and something in the build process changed to make it not enabled by default. Probably would need to move the build option to meson otherwise. * backup: job unreferencing during cleanup needs to happen outside of coroutine, so it was moved to before invoking the clean * mirror: Cherry-pick stable fix to avoid potential deadlock. * savevm-async: migrate_init now can fail, so propagate potential error. * savevm-async: compression counters are not accessible outside migration/ram-compress now, so drop code that prophylactically set it to zero. [0]: https://lore.kernel.org/qemu-devel/220be383-3b0d-4938-b584-69ad214e5d5d@proxmox.com/ [1]: https://lore.kernel.org/qemu-devel/e13b488e-bf13-44f2-acca-e724d14f43fd@proxmox.com/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-04-26 14:14:06 +02:00
Thomas Lamprecht	59ab88deb6	bump version to 8.1.5-5 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-04-11 20:05:02 +02:00
Thomas Lamprecht	20209d8d73	implement support for backup fleecing Excerpt from Fiona's v3 cover-letter [0]: When a backup for a VM is started, QEMU will install a "copy-before-write" filter in its block layer. This filter ensures that upon new guest writes, old data still needed for the backup is sent to the backup target first. The guest write blocks until this operation is finished so guest IO to not-yet-backed-up sectors will be limited by the speed of the backup target. With backup fleecing, such old data is cached in a fleecing image rather than sent directly to the backup target. This can help guest IO performance and even prevent hangs in certain scenarios, at the cost of requiring more storage space. With this series it will be possible to enable backup-fleecing via e.g. `vzdump 123 --fleecing enabled=1,storage=local-lvm` with fleecing images created on the storage `local-lvm`. The fleecing storage should be a fast local storage which supports thin-provisioning and discard. If the storage supports qcow2, that is used as the fleecing image format. If the underlying file system does not support discard, with qcow2 and preallocation=off, at least already allocated parts of the image can be re-used later. Fleecing images are created by qemu-server via pve-storage and attached to QEMU before the backup starts, and cleaned up after the backup finished or failed. The naming schema for fleecing images is 'vm-ID-fleece-N(.FORMAT)'. The allocated images are recorded in the guest configuration, so that even after a hard failure, clean-up can be re-attempted. While not too bad, it's a non-trivial amount of code and I'm not 100% sure about the cost-benefit, so sending those as RFC. The fleecing image needs to be the exact same size as the source, but luckily, an explicit size can be specified when attaching a raw image to QEMU so there are no size issues when using storages that have coarser allocation/round up. For qcow2, it seems that virtual size can be nearly arbitrary (i.e. modulo 512 byte granularity) during allocation. [0]: https://lists.proxmox.com/pipermail/pve-devel/2024-April/062815.html Originally-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-04-11 20:05:02 +02:00
Thomas Lamprecht	47bdd04244	bump version to 8.1.5-4 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-03-12 14:08:48 +01:00
Thomas Lamprecht	8dd76cc52d	backup: factor out & clean up gathering device info into helper Squash the two original patches [0][1] from Fiona, which got send separate to be easier to review, into the big patch that adds the Proxmox backup integration. [0]: https://lists.proxmox.com/pipermail/pve-devel/2024-January/061479.html [1]: https://lists.proxmox.com/pipermail/pve-devel/2024-January/061478.html Originally-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-03-12 13:55:00 +01:00
Fiona Ebner	cd7676f3e6	backup: avoid bubbling up first ECANCELED error With pvebackup_propagate_error(), the first error wins. When one job in the transaction fails, it is expected that later jobs get the ECANCELED error. Those are not interesting and by skipping them a more interesting error, which is likely the actual root cause, can win. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-03-12 13:20:28 +01:00
Fiona Ebner	862b46e3e0	cleanup: squash backup dump driver change into patch introducing the driver Makes it simpler and shorter. Still results in the same code after applying both patches in question. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-03-12 13:19:30 +01:00
Fiona Ebner	061e9ceb36	fix patch for accepting NULL qiov when padding All callers of the function pass an address, so dereferencing once before checking for NULL is required. It's also necessary to update bytes and offset nevertheless, so the request will actually be aligned later and not trigger an assertion failure. Seems like this was accidentally broken in `8dca018` ("udpate and rebase to QEMU v6.0.0") and this is effectively a revert to the original version of the patch. The qiov functions changed back then, which might've been the reason Stefan tried to simplify the patch. Should fix live-import for certain kinds of VMDK images. Reported-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-03-12 13:11:21 +01:00
Thomas Lamprecht	0d4462207b	bump version to 8.1.5-3 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-02-21 20:11:27 +01:00
Fiona Ebner	ed159bc32a	add patch to fix deadlock with VirtIO block and iothread during QMP stop Backported from commit bfa36802d1 ("virtio-blk: avoid using ioeventfd state in irqfd conditional") because the rework/rename dataplane -> ioeventfd didn't happen yet. Reported in the community forum [0] and reproduced doing a backup loop to PBS with suspend mode with fio doing heavy IO in the guest and using an RBD storage (with krbd). [0]: https://forum.proxmox.com/threads/141320 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-02-21 20:09:22 +01:00
Fiona Ebner	86460aef76	fix #4507 : add patch to automatically increase NOFILE soft limit In many configurations, e.g. multiple vNICs with multiple queues or with many Ceph OSDs, the default soft limit of 1024 is not enough. QEMU is supposed to work fine with file descriptors >= 1024 and does not use select() on POSIX. Bump the soft limit to the allowed hard limit to avoid issues with the aforementioned configurations. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-02-06 10:33:12 +01:00
Thomas Lamprecht	676adda3c6	bump version to 8.1.5-2 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-02-02 19:41:31 +01:00
Thomas Lamprecht	4ff04bdfa5	work around stuck guest IO with iothread and VirtIO block/SCSI This essentially repeats commit `6b7c181` ("add patch to work around stuck guest IO with iothread and VirtIO block/SCSI") with an added fix for the SCSI event virtqueue, which requires special handling. This is to avoid the issue [3] that made the revert `2a49e66` ("Revert "add patch to work around stuck guest IO with iothread and VirtIO block/SCSI"") necessary the first time around. When using iothread, after commits 1665d9326f ("virtio-blk: implement BlockDevOps->drained_begin()") 766aa2de0f ("virtio-scsi: implement BlockDevOps->drained_begin()") it can happen that polling gets stuck when draining. This would cause IO in the guest to get completely stuck. A workaround for users is stopping and resuming the vCPUs because that would also stop and resume the dataplanes which would kick the host notifiers. This can happen with block jobs like backup and drive mirror as well as with hotplug [2]. Reports in the community forum that might be about this issue[0][1] and there is also one in the enterprise support channel. As a workaround in the code, just re-enable notifications and kick the virt queue after draining. Draining is already costly and rare, so no need to worry about a performance penalty here. Take special care to attach the SCSI event virtqueue host notifier with the _no_poll() variant like in virtio_scsi_dataplane_start(). This avoids the issue from the first attempted fix where the iothread would suddenly loop with 100% CPU usage whenever some guest IO came in [3]. This is necessary because of commit 38738f7dbb ("virtio-scsi: don't waste CPU polling the event virtqueue"). See [4] for the relevant discussion. [0]: https://forum.proxmox.com/threads/137286/ [1]: https://forum.proxmox.com/threads/137536/ [2]: https://issues.redhat.com/browse/RHEL-3934 [3]: https://forum.proxmox.com/threads/138140/ [4]: https://lore.kernel.org/qemu-devel/bfc7b20c-2144-46e9-acbc-e726276c5a31@proxmox.com/ Link: https://lore.kernel.org/qemu-devel/20240202153158.788922-1-hreitz@redhat.com/ Originally-by: Fiona Ebner <f.ebner@proxmox.com> [ TL: Update to v2 and rebased patch series handling to v8.1.5 ] Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-02-02 19:35:34 +01:00
Thomas Lamprecht	12b69ed9c5	bump version to 8.1.5-1 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2024-02-02 19:08:16 +01:00
Fiona Ebner	5e8903f875	stable fixes for corner case in i386 emulation and crash with VNC clipboard Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-02-02 19:06:29 +01:00
Fiona Ebner	4b7975e75d	update submodule and patches to QEMU 8.1.5 Most notable fixes from a Proxmox VE perspective are: * "virtio-net: correctly copy vnet header when flushing TX" To prevent a stack overflow that could lead to leaking parts of the QEMU process's memory. * "hw/pflash: implement update buffer for block writes" To prevent an edge case for half-completed writes. This potentially affected EFI disks. * Fixes to i386 emulation and ARM emulation. No changes for patches were necessary (all are just automatic context changes). Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2024-02-02 19:06:29 +01:00
Fiona Ebner	f366bb97ae	bump version to 8.1.2-6 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-12-15 14:26:09 +01:00
Fiona Ebner	2a49e667ba	Revert "add patch to work around stuck guest IO with iothread and VirtIO block/SCSI" This reverts commit `6b7c1815e1`. The attempted fix has been reported to cause high CPU usage after backup [0]. Not difficult to reproduce and it's iothreads getting stuck in a loop. Downgrading to pve-qemu-kvm=8.1.2-4 helps which was also verified by Christian, thanks! The issue this was supposed to fix is much rarer, so revert for now, while upstream is still working on a proper fix. [0]: https://forum.proxmox.com/threads/138140/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-12-15 14:16:26 +01:00
Thomas Lamprecht	c6eb05a799	bump version to 8.1.2-5 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-12-11 16:59:16 +01:00
Fiona Ebner	dfac4f3593	pick fix for potential deadlock with QMP resize and iothread While the patch gives bdrv_graph_wrlock() as an example where the issue can manifest, something similar can happen even when that is disabled. Was able to reproduce the issue with while true; do qm resize 115 scsi0 +4M; sleep 1; done while running fio --name=make-mirror-work --size=100M --direct=1 --rw=randwrite \ --bs=4k --ioengine=psync --numjobs=5 --runtime=1200 --time_based in the VM. Fix picked up from: https://lists.nongnu.org/archive/html/qemu-devel/2023-12/msg01102.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-12-11 16:56:50 +01:00
Fiona Ebner	6b7c1815e1	add patch to work around stuck guest IO with iothread and VirtIO block/SCSI When using iothread, after commits 1665d9326f ("virtio-blk: implement BlockDevOps->drained_begin()") 766aa2de0f ("virtio-scsi: implement BlockDevOps->drained_begin()") it can happen that polling gets stuck when draining. This would cause IO in the guest to get completely stuck. A workaround for users is stopping and resuming the vCPUs because that would also stop and resume the dataplanes which would kick the host notifiers. This can happen with block jobs like backup and drive mirror as well as with hotplug [2]. Reports in the community forum that might be about this issue[0][1] and there is also one in the enterprise support channel. As a workaround in the code, just re-enable notifications and kick the virt queue after draining. Draining is already costly and rare, so no need to worry about a performance penalty here. This was taken from the following comment of a QEMU developer [3] (in my debugging, I had already found re-enabling notification to work around the issue, but also kicking the queue is more complete). [0]: https://forum.proxmox.com/threads/137286/ [1]: https://forum.proxmox.com/threads/137536/ [2]: https://issues.redhat.com/browse/RHEL-3934 [3]: https://issues.redhat.com/browse/RHEL-3934?focusedId=23562096&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-23562096 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-12-11 16:56:50 +01:00
Thomas Lamprecht	24d732ac0f	bump version to 8.1.2-4 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-11-22 14:28:25 +01:00
Fiona Ebner	df2cc786ee	add fix for vnc clipboard This fixes the host->guest direction with noNVC as a client (and likely others). Reported-by: Friedrich Weber <f.weber@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Tested-by: Friedrich Weber <f.weber@proxmox.com>	2023-11-22 14:19:45 +01:00
Thomas Lamprecht	38726d3473	bump version to 8.1.2-3 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-11-20 10:35:52 +01:00
Fiona Ebner	89b46e17ec	fix #5054 : backport fix for software reset with SATA The issue prevented FreeBSD 14 VMs with SATA disk from booting. The commit it fixes e2a5d9b3d9c3 ("hw/ide/ahci: simplify and document PxCI handling") is part of stable 8.1.2. The patch was already applied to the block branch upstream: https://lists.nongnu.org/archive/html/qemu-devel/2023-11/msg02711.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Tested-by: Friedrich Weber <f.weber@proxmox.com>	2023-11-20 10:35:00 +01:00
Thomas Lamprecht	33b22c3fe0	bump version to 8.1.2-2 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-11-17 11:55:26 +01:00
Fiona Ebner	c38e337f5d	revert commit breaking VirtIO network adapters for certain versions of Windows As reported in the community forum [0] and reproduced locally this breaks VirtIO network adapters in (at least) the German ISO of Windows Server 2022. The fix itself was for > Issue is not fatal but as result acpi-index/"PCI Label ID" property > is either not shown in device details page or shows incorrect value. so revert and tolerate that as a stop-gap, rather than have the devices not working at all. [0]: https://forum.proxmox.com/threads/92094/post-605684 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-11-17 11:52:52 +01:00
Fiona Ebner	763949965f	fix #4710 : vma create: don't use O_DIRECT for tmpfs The implementation of the helper is_path_tmpfs() is similar to the existing qemu_fd_getfs() function in util/mmap-alloc.c, which unfortunately only takes an existing fd. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-11-07 16:37:34 +01:00
Thomas Lamprecht	1807330a6f	bump version to 8.1.2-1 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Thomas Lamprecht	a31ab74058	d/control: add python3-venv as build-dependency Seems to be required since commit 81e2b198a8 ("configure: create a python venv unconditionally"). Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Fiona Ebner	b39f726f31	d/control: add versioned Breaks for qemu-server <= 8.0.6 Upstream QEMU commit 4271f40383 ("virtio-net: correctly report maximum tx_queue_size value") made setting an invalid tx_queue_size for a non-vDPA/vhost-user net device a hard error. Now, qemu-server before commit 089aed81 ("cfg2cmd: netdev: fix value for tx_queue_size") did just that, so the newer QEMU version would break start-up for most VMs (a default vNIC configuration would be affected). Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Fiona Ebner	a36bda146c	add patch to avoid huge snapshot performance regression Taking a snapshot became prohibitively slow because of the migration_transferred_bytes() call in migration_rate_exceeded() [0]. This also applied to the async snapshot taking in Proxmox VE, so work around the issue until it is fixed upstream. [0]: https://gitlab.com/qemu-project/qemu/-/issues/1821 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Fiona Ebner	03ff63aa61	add patch to disable graph locking There are still some issues with graph locking, e.g. deadlocks during backup canceling [0] and initial attempts to fix it didn't work [1]. Because the AioContext locks still exist, it should still be safe to disable graph locking. [0]: https://lists.nongnu.org/archive/html/qemu-devel/2023-09/msg00729.html [1]: https://lists.nongnu.org/archive/html/qemu-devel/2023-09/msg06905.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Fiona Ebner	10e1093325	update submodule and patches to QEMU 8.1.2 Bigger notable changes: * Commit 1a30b0f5d7 ("block: .bdrv_open is non-coroutine and unlocked") broke the PVE backup patches, in particular setting up the backup dump block driver, because bdrv_new_open_driver() cannot be called from a coroutine. To fix it, bdrv_co_open() is used instead, and while it's a much more involved function, the result should be essentially the same. The only difference I noticed is that the BDRV_O_ALLOW_RDWR flag is also set in the resulting bds (block driver state), but that shouldn't hurt. Smaller notable changes: * aio_set_fd_handler() dropped its 'is_external' parameter stating that all callers now pass false in 60f782b6b7 ("aio: remove aio_disable_external() API"). The calls in the PVE patches also passed false, so just drop the parameter too. * global_state_store() does not have a return value anymore, so the user in the PVE savevm-async patch was adapted. For context, see c33f1829f8 ("migration: never fail in global_state_store()"). * Renames affecting the PVE savevm-async patch: migrate_use_block() -> migrate_block() and ram_counters -> mig_stats 9d4b1e5f22 ("migration: Move migrate_use_block() to options.c") aff3f6606d ("migration: Rename ram_counters to mig_stats") Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Fiona Ebner	89520c1cd0	d/rules: use disable-download option instead of git-submodules=ignore See the following QEMU commits for reference: 0c5f3dcbb2 ("configure: add --enable-pypi and --disable-pypi") ac4ccac740 ("configure: rename --enable-pypi to --enable-download, control subprojects too") 6f3ae23b29 ("configure: remove --with-git-submodules=") removed The last one removed the option and the closest thing to git-submodule=ignore is using disable-download. Which will then just verify that the submodules are present. Building now will require running either * Running 'meson subprojects download' in the qemu submodule first. * Using --enable-download, but then the submodules would be downloaded for each build (if not already downloaded in the submodule first) and it's just a bit too surprising if downloads happen during build. The disable-download option will also disable automatic downloading of missing Python modules from PyPI. Hopefully, it's enough to add them as Debian build dependencies when required. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-24 15:01:23 +02:00
Thomas Lamprecht	eca4daeeed	bump version to 8.0.2-7 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-10-04 08:33:39 +02:00
Fiona Ebner	816077299c	fix #2874 : SATA: avoid unsolicited write to sector 0 during reset If there is a pending DMA operation during ide_bus_reset(), the fact that the IDEstate is already reset before the operation is canceled can be problematic. In particular, ide_dma_cb() might be called and then use the reset IDEstate which contains the signature after the reset. When used to construct the IO operation this leads to ide_get_sector() returning 0 and nsector being 1. This is particularly bad, because a write command will thus destroy the first sector which often contains a partition table or similar. Upstream discussion: https://lists.nongnu.org/archive/html/qemu-devel/2023-08/msg04239.html Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-09-26 11:30:22 +02:00
Fiona Ebner	ef3308db71	vma: avoid compiler warning about incompatible pointer type Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-09-08 11:18:30 +02:00
Filip Schauer	0ff45eb23e	backup: Fix spelling error in function name Signed-off-by: Filip Schauer <f.schauer@proxmox.com> [FE: fixup patch context] Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-09-08 11:13:04 +02:00
Thomas Lamprecht	6c5563e30b	bump version to 8.0.2-6 Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>	2023-09-06 17:04:04 +02:00
Fiona Ebner	9e0186f289	backup: drop broken BACKUP_FORMAT_DIR Since upstream QEMU 8.0, it's no longer possible to call bdrv_img_create() from a coroutine anymore, meaning a backup with the directory format would crash the QEMU instance. The feature is only exposed via the monitor and was intended to be experimental. There were no user reports about the breakage and it only was noticed during the rebase for QEMU 8.1, because other parts of the backup code needed adaptation and I decided to check the BACKUP_FORMAT_DIR case too. It should not stay in a broken state of course, but avoid the maintenance cost and just make it a removed feature for Proxmox VE 8 retroactively. Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-09-06 16:59:12 +02:00
Fiona Ebner	0cffb504e7	backup: create jobs in a drained section With the drive-backup QMP command, upstream QEMU uses a drained section for the source drive when creating the backup job. Do the same here to avoid subtle bugs. There, the drained section extends until after the job is started, but this cannot be done here for multi-disk backups (could at most start the first job). The important thing is that the cbw (copy-before-write) node is in place and the bcs (block-copy-state) bitmap is initialized, which both happen during job creation (ensured by the "block/backup: move bcs bitmap initialization to job creation" PVE patch). One such bug is one reported in the community forum [0], where using a drive with iothread can lead to an overlapping block-copy request and consequently an assertion failure. The block-copy code relies on the bcs bitmap to determine if a request for a certain range can be created. Each time a request is created, it resets the bcs bitmap at that range to indicate that it's being handled. The duplicate request can happen as follows: Thread A attaches the cbw node Thread B creates a request and resets the bitmap at that range Thread A clears the bitmap and merges it with the PBS bitmap The merging can lead to the bitmap being set again at the range of the previous request, so the block-copy code thinks it's fine to create a request there. Thread B creates another requests at an overlapping range before the other request is finished. The drained section ensures that nothing else can interfere with the bcs bitmap between attaching the copy-before-write block node and initialization of the bitmap. [0]: https://forum.proxmox.com/threads/133149/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-09-06 16:59:12 +02:00
Fiona Ebner	f7eed6caa1	regenerate patch stats Apparently wasn't correct in `0cff91a` ("fix #1534: vma: Add extract filter for disk images"). Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-09-06 16:59:12 +02:00
Filip Schauer	0cff91a000	fix #1534 : vma: Add extract filter for disk images Add a filter to the "vma extract" command. A comma seperated list of disk images that should be extracted can be passed with the "-d" option. Example to extract an IDE drive and an SCSI drive from vzdump.vma: vma extract vzdump.vma -d "drive-ide0,drive-scsi0" extractdir Signed-off-by: Filip Schauer <f.schauer@proxmox.com>	2023-08-30 10:40:51 +02:00
Fiona Ebner	6cadf3677d	bump version to 8.0.2-5 Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-08-16 11:56:49 +02:00
Fiona Ebner	5f9cb29c3a	backup: trim heap after finishing Reported in the community forum [0]. By default, there can be large amounts of memory left assigned to the QEMU process after backup. Likely because of fragmentation, it's necessary to explicitly call malloc_trim() to tell glibc that it shouldn't keep all that memory resident for the process. QEMU itself already does a malloc_trim() in the RCU thread, but that code path might not be reached (or not for a long time) under usual operation. The value of 4 MiB for the argument was also copied from there. Example with the following configuration: > agent: 1 > boot: order=scsi0 > cores: 4 > cpu: x86-64-v2-AES > ide2: none,media=cdrom > memory: 1024 > name: backup-mem > net0: virtio=DA:58:18:26:59:9F,bridge=vmbr0,firewall=1 > numa: 0 > ostype: l26 > scsi0: rbd:base-107-disk-0/vm-106-disk-1,size=4302M > scsihw: virtio-scsi-pci > smbios1: uuid=b2d4511e-8d01-44f1-afd6-9581b30c24a6 > sockets: 2 > startup: order=2 > virtio0: lvmthin:vm-106-disk-1,iothread=1,size=1G > virtio1: lvmthin:vm-106-disk-2,iothread=1,size=1G > virtio2: lvmthin:vm-106-disk-3,iothread=1,size=1G > vmgenid: 0a1d8751-5e02-449d-977e-c0160e900231 Before the change: > root@pve8a1 ~ # grep VmRSS /proc/$(cat /var/run/qemu-server/106.pid)/status > VmRSS: 370948 kB > root@pve8a1 ~ # vzdump 106 --storage pbs > (...) > INFO: Backup job finished successfully > root@pve8a1 ~ # grep VmRSS /proc/$(cat /var/run/qemu-server/106.pid)/status > VmRSS: 2114964 kB After the change: > root@pve8a1 ~ # grep VmRSS /proc/$(cat /var/run/qemu-server/106.pid)/status > VmRSS: 398788 kB > root@pve8a1 ~ # vzdump 106 --storage pbs > (...) > INFO: Backup job finished successfully > root@pve8a1 ~ # grep VmRSS /proc/$(cat /var/run/qemu-server/106.pid)/status > VmRSS: 424356 kB [0]: https://forum.proxmox.com/threads/131339/ Co-diagnosed-by: Friedrich Weber <f.weber@proxmox.com> Co-diagnosed-by: Dominik Csapak <d.csapak@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Acked-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2023-08-16 11:50:12 +02:00
Fiona Ebner	c36e3f9d17	refresh patch context Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Acked-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2023-08-16 11:50:08 +02:00
Filip Schauer	b8b4ce0480	Add format attributes to function candidates Add format attributes to functions that take printf-like arguments. This provides additional compile-time checking that the correct parameters are passed to the functions. This fixes compiler warnings generated by the -Wsuggest-attribute=format flag. Signed-off-by: Filip Schauer <f.schauer@proxmox.com>	2023-08-08 09:08:48 +02:00
Fiona Ebner	df47146afe	add patch fixing fd leak for vhost Each pause+resume operation (which is also done as part of taking a VM snapshot) would increase the number of open file descriptors by the number of vhost devices (e.g. network devices by default). This could lead to crashes during backup and surely other issues once the system limit (default 1024) was reached [0]. [0]: https://forum.proxmox.com/threads/131603/ Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>	2023-08-03 17:40:13 +02:00
Fabian Grünbichler	d9cbfafeeb	bump version to 8.0.2-4 Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>	2023-07-28 12:59:10 +02:00

1 2 3 4 5 ...

440 Commits