Commit Graph

1053 Commits

Author SHA1 Message Date
Thomas Lamprecht ac49c428da update sources to Ubuntu-6.5.0-27.27
(generated with debian/scripts/import-upstream-tag)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-03-11 13:38:05 +01:00
Thomas Lamprecht bda1c40146 update ABI file for 6.5.13-1-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-02-14 12:02:47 +01:00
Thomas Lamprecht 46f7b8bf3f update fwlist for 6.5.13-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-02-14 11:12:56 +01:00
Thomas Lamprecht 98d7a8dad9 bump version to 6.5.13-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-02-14 11:12:56 +01:00
Friedrich Weber 29cb6fcbb7 cherry-pick scheduler fix to avoid temporary VM freezes on NUMA hosts
Users have been reporting [1] that VMs occasionally become
unresponsive with high CPU usage for some time (varying between ~1 and
more than 60 seconds). After that time, the guests come back and
continue running. Windows VMs seem most affected (not responding to
pings during the hang, RDP sessions time out), but we also got reports
about Linux VMs (reporting soft lockups). The issue was not present on
host kernel 5.15 and was first reported with kernel 6.2. Users
reported that the issue becomes easier to trigger the more memory is
assigned to the guests. Setting mitigations=off was reported to
alleviate (but not eliminate) the issue. For most users the issue
seems to disappear after (also) disabling KSM [2], but some users
reported freezes even with KSM disabled [3].

It turned out the reports concerned NUMA hosts only, and that the
freezes correlated with runs of the NUMA balancer [4]. Users reported
that disabling the NUMA balancer resolves the issue (even with KSM
enabled).

We put together a Linux VM reproducer, ran a git-bisect on the kernel
to find the commit introducing the issue and asked upstream for help
[5]. As it turned out, an upstream bugreport was recently opened [6]
and a preliminary fix to the KVM TDP MMU was proposed [7]. With that
patch [7] on top of kernel 6.7, the reproducer does not trigger
freezes anymore. As of now, the patch (or its v2 [8]) is not yet
merged in the mainline kernel, and backporting it may be difficult due
to dependencies on other KVM changes [9].

However, the bugreport [6] also prompted an upstream developer to
propose a patch to the kernel scheduler logic that decides whether a
contended spinlock/rwlock should be dropped [10]. Without the patch,
PREEMPT_DYNAMIC kernels (such as ours) would always drop contended
locks. With the patch, the kernel only drops contended locks if the
kernel is currently set to preempt=full. As noted in the commit
message [10], this can (counter-intuitively) improve KVM performance.
Our kernel defaults to preempt=voluntary (according to
/sys/kernel/debug/sched/preempt), so with the patch it does not drop
contended locks anymore, and the reproducer does not trigger freezes
anymore. Hence, backport [10] to our kernel.

[1] https://forum.proxmox.com/threads/130727/
[2] https://forum.proxmox.com/threads/130727/page-4#post-575886
[3] https://forum.proxmox.com/threads/130727/page-8#post-617587
[4] https://www.kernel.org/doc/html/latest/admin-guide/sysctl/kernel.html#numa-balancing
[5] https://lore.kernel.org/kvm/832697b9-3652-422d-a019-8c0574a188ac@proxmox.com/
[6] https://bugzilla.kernel.org/show_bug.cgi?id=218259
[7] https://lore.kernel.org/all/20230825020733.2849862-1-seanjc@google.com/
[8] https://lore.kernel.org/all/20240110012045.505046-1-seanjc@google.com/
[9] https://lore.kernel.org/kvm/Zaa654hwFKba_7pf@google.com/
[10] https://lore.kernel.org/all/20240110214723.695930-1-seanjc@google.com/

Signed-off-by: Friedrich Weber <f.weber@proxmox.com>
2024-02-14 11:10:25 +01:00
Thomas Lamprecht 5dde66b4fe update kernel and patches for Ubuntu-6.5.0-20.20
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-02-14 11:08:30 +01:00
Fabian Grünbichler 1b4116e1c8 fix malformed lintian overrides
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 14:59:51 +01:00
Fabian Grünbichler d3b49cbb02 update ABI file for 6.5.11-8-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 14:38:48 +01:00
Fabian Grünbichler 45cc7d1372 bump version to 6.5.11-8
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 13:28:01 +01:00
Fabian Grünbichler 0ec9138fc0 fix #5158: cherry-pick ext4 fix for high-CPU flush
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 13:26:35 +01:00
Fabian Grünbichler 53226238d9 fix #5077: cherry-pick revert for aacraid resets
reported both in our bug tracker and upstream to fix the affected hardware.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 13:24:16 +01:00
Thomas Lamprecht 06fe6d5acf bump version to 6.5.11-7
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-12-05 10:44:17 +01:00
Thomas Lamprecht 4dec90b9bc update ZFS to 2.2.2
There shouldn't be any changes for us w.r.t. data integrity and the
recent uncovered dnode dirtiness, as we backported those patches
already.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-12-05 10:42:10 +01:00
Fabian Grünbichler c87a354095 update ABI file for 6.5.11-6-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-29 10:28:15 +01:00
Fabian Grünbichler e294ffe2e6 bump version to 6.5.11-6
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-29 09:33:09 +01:00
Fabian Grünbichler 82a3d01c72 update zfs to 2.2.0-pve4
which contains the fix for https://github.com/openzfs/zfs/issues/15526

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-29 09:31:50 +01:00
Fabian Grünbichler c604a765d5 update ABI file for 6.5.11-5-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-27 21:37:46 +01:00
Fabian Grünbichler 24c3901a4b bump version to 6.5.11-5
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-27 20:54:07 +01:00
Fiona Ebner cc99d7fd2f cherry-pick fix for RCU stall issue after VM live migration
caused by a lapic timer interrupt getting lost.

Already queued for 6.5.13:
https://lore.kernel.org/stable/20231124172031.920738810@linuxfoundation.org/

Reported in the community forum:
https://forum.proxmox.com/threads/136992/

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2023-11-27 18:58:23 +01:00
Thomas Lamprecht f7fedc604d d/rules: VFIO_VIRQFD config is a boolean
since e2d55709398e ("vfio: Fold vfio_virqfd.ko into vfio.ko") this
config isn't a tristate anymore but a bool, so adapt to that.

Luckily the kconfig script did the right thing and set (or at least
kept) this to yes anyway

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-25 14:13:49 +01:00
Fabian Grünbichler bf577f5f83 update ABI file for 6.5.11-4-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 15:02:04 +01:00
Fabian Grünbichler 08f167c994 build: add signed-template to DEBS
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 15:02:04 +01:00
Fabian Grünbichler 4c8651aeb9 bump version to 6.5.11-4
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 12:59:50 +01:00
Fabian Grünbichler 331dac76a3 meta package: prefer signed kernel package
to ensure future versions are properly pulled in.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 12:59:50 +01:00
Fabian Grünbichler 17ad9a3376 d/control: add missing python3-dev
required by perf's Makefile

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 12:28:56 +01:00
Fabian Grünbichler 2b3d5a2269 build: add proxmox-kernel-X.Y-pve-signed-template
the signed template together with the binary package(s) containing the unsigned
files form the input to our secure boot signing service.

the signed template consists of
- files.json (specifying which files are signed how and by which key)
- packaging template used to build the signed package(s)

the signing service
- extracts and checks the signed-template binary package
- extracts the unsigned package(s)
- signs the needed files
- packs up the signatures + the template contained in the signed-template
  package into the signed source package

the signed source package can then be built in the regular fashion (in case of
the kernel packages, it will copy the kernel image, modules and some helper
files from the unsigned package, attach the signature created by the signing
service, and re-pack the result as signed-kernel package).

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 12:28:56 +01:00
Fabian Grünbichler 03c65664fc add Proxmox UEFI certificates
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 11:18:07 +01:00
Thomas Lamprecht e486c89950 update ABI file for 6.5.11-3-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-17 18:09:41 +01:00
Thomas Lamprecht eecd83b751 bump version to 6.5.11-3
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-17 17:34:46 +01:00
Thomas Lamprecht 5111be4ff3 update ZFS to include important fixes staged for 2.2.1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-17 17:33:36 +01:00
Thomas Lamprecht 275449f512 update ABI file for 6.5.11-2-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-16 15:05:41 +01:00
Thomas Lamprecht bb9b21fa29 bump version to 6.5.11-2
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-16 13:29:45 +01:00
Thomas Lamprecht d7ddc69bea update submodule to Ubuntu-6.5.0-14.14
still with 6.5.3 to 6.5.10 cherry-picked

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-16 13:28:19 +01:00
Thomas Lamprecht 934b701c85 d/rules: temporarily disable UBSAN bound checks again
it's really not just ZFS and AMDGPU modules, but way more and
generating scary looking messages for these "issues" is just noise
that drown real issues. Disable this for now, maybe in another few
years.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-16 13:27:52 +01:00
Fiona Ebner dd086d18e3 backport UBSAN fixes for amdgpu
to silence array-index-out-of-bounds warnings for dynamically-sized
arrays. All commits applied cleanly and just replace array[1] with
array[].

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2023-11-14 16:15:22 +01:00
Thomas Lamprecht de4af42cbd update ABI file for 6.5.11-1-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 18:02:30 +01:00
Thomas Lamprecht bcc95c3263 bump version to 6.5.11-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 17:31:57 +01:00
Thomas Lamprecht 4a4ddffc89 cherry-pick 6.5.11 stable release
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 16:45:41 +01:00
Thomas Lamprecht 3fd758c529 d/rules: use olddefconfig to generate our config
makes it easier to cherry-pick newer stable release tags, that
sometimes contain new config values one must pick from.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 16:36:19 +01:00
Thomas Lamprecht 248716e28c d/rules: disable CONFIG_WQ_CPU_INTENSIVE_REPORT for now
it's mostly noise for users, and quiet some interpret this as real
problem and report it to us.

Ideally we'd either educate them, or take time ourself, to report this
upstream and see if the situation can be improved overall, but
currently that's not feasible. We should check this out a few releases
down, if the lower hanging fruits got fixed and noise got lower we
could enable it again to catch the more rare cases.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 16:33:20 +01:00
Thomas Lamprecht c0a8921e19 update ZFS to get better work-around for UBSAN bounds-checking
We have a slightly better fix where only a few targeted ZFS module
parts are added to the UBSAN ignore-list, so the rest of the kernel
still gets exposure.

Link: https://github.com/openzfs/zfs/pull/15510
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 16:20:25 +01:00
Thomas Lamprecht b0ac1e9734 Revert "UBUNTU: SAUCE: ceph: make sure all the files successfully put before unmounting"
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-07 09:37:12 +01:00
Thomas Lamprecht 8f06837c7c revert "memfd: improve userspace warnings for missing exec-related flags"
This is generating far too much noise in the logs, so keep it at once
per boot until we (and other user space tools) adapted to the kernel
wanting user space to chose memfd execution behavior very explicitly.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-06 10:21:03 +01:00
Thomas Lamprecht a2edb6600c update ABI file for 6.5.3-1-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:29:54 +02:00
Thomas Lamprecht 88f8395203 bump version to 6.5.3-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:29:24 +02:00
Thomas Lamprecht 50a5490fa6 update to current ubuntu-mantic/master-next
to get some relevant fixes for our use cases, like hangs on reboots on
some x86_64 machines.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:56 +02:00
Thomas Lamprecht 43f8ddaae1 d/rules: temporarily disable UBSAN bound checks
until ZFS can cope with them:
https://github.com/openzfs/zfs/issues/15145

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:56 +02:00
Thomas Lamprecht 62d5fcabed bump version to 6.5.3-1~1
for initial testing and internal QA

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:52 +02:00
Thomas Lamprecht fc8577c9fa d/rules: adapt to merge of ZFS kernel modules
Following ZFS commit ad9e76765 ("linux: module: weld all but spl.ko
into zfs.ko") we only have two modules to care about.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:11 +02:00
Thomas Lamprecht 9b93b3b2f7 update ZFS module to 2.2.0
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:11 +02:00