Commit Graph

1102 Commits

Author SHA1 Message Date
Thomas Lamprecht
bda1c40146 update ABI file for 6.5.13-1-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-02-14 12:02:47 +01:00
Thomas Lamprecht
46f7b8bf3f update fwlist for 6.5.13-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-02-14 11:12:56 +01:00
Thomas Lamprecht
98d7a8dad9 bump version to 6.5.13-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-02-14 11:12:56 +01:00
Friedrich Weber
29cb6fcbb7 cherry-pick scheduler fix to avoid temporary VM freezes on NUMA hosts
Users have been reporting [1] that VMs occasionally become
unresponsive with high CPU usage for some time (varying between ~1 and
more than 60 seconds). After that time, the guests come back and
continue running. Windows VMs seem most affected (not responding to
pings during the hang, RDP sessions time out), but we also got reports
about Linux VMs (reporting soft lockups). The issue was not present on
host kernel 5.15 and was first reported with kernel 6.2. Users
reported that the issue becomes easier to trigger the more memory is
assigned to the guests. Setting mitigations=off was reported to
alleviate (but not eliminate) the issue. For most users the issue
seems to disappear after (also) disabling KSM [2], but some users
reported freezes even with KSM disabled [3].

It turned out the reports concerned NUMA hosts only, and that the
freezes correlated with runs of the NUMA balancer [4]. Users reported
that disabling the NUMA balancer resolves the issue (even with KSM
enabled).

We put together a Linux VM reproducer, ran a git-bisect on the kernel
to find the commit introducing the issue and asked upstream for help
[5]. As it turned out, an upstream bugreport was recently opened [6]
and a preliminary fix to the KVM TDP MMU was proposed [7]. With that
patch [7] on top of kernel 6.7, the reproducer does not trigger
freezes anymore. As of now, the patch (or its v2 [8]) is not yet
merged in the mainline kernel, and backporting it may be difficult due
to dependencies on other KVM changes [9].

However, the bugreport [6] also prompted an upstream developer to
propose a patch to the kernel scheduler logic that decides whether a
contended spinlock/rwlock should be dropped [10]. Without the patch,
PREEMPT_DYNAMIC kernels (such as ours) would always drop contended
locks. With the patch, the kernel only drops contended locks if the
kernel is currently set to preempt=full. As noted in the commit
message [10], this can (counter-intuitively) improve KVM performance.
Our kernel defaults to preempt=voluntary (according to
/sys/kernel/debug/sched/preempt), so with the patch it does not drop
contended locks anymore, and the reproducer does not trigger freezes
anymore. Hence, backport [10] to our kernel.

[1] https://forum.proxmox.com/threads/130727/
[2] https://forum.proxmox.com/threads/130727/page-4#post-575886
[3] https://forum.proxmox.com/threads/130727/page-8#post-617587
[4] https://www.kernel.org/doc/html/latest/admin-guide/sysctl/kernel.html#numa-balancing
[5] https://lore.kernel.org/kvm/832697b9-3652-422d-a019-8c0574a188ac@proxmox.com/
[6] https://bugzilla.kernel.org/show_bug.cgi?id=218259
[7] https://lore.kernel.org/all/20230825020733.2849862-1-seanjc@google.com/
[8] https://lore.kernel.org/all/20240110012045.505046-1-seanjc@google.com/
[9] https://lore.kernel.org/kvm/Zaa654hwFKba_7pf@google.com/
[10] https://lore.kernel.org/all/20240110214723.695930-1-seanjc@google.com/

Signed-off-by: Friedrich Weber <f.weber@proxmox.com>
2024-02-14 11:10:25 +01:00
Thomas Lamprecht
5dde66b4fe update kernel and patches for Ubuntu-6.5.0-20.20
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2024-02-14 11:08:30 +01:00
Fabian Grünbichler
1b4116e1c8 fix malformed lintian overrides
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 14:59:51 +01:00
Fabian Grünbichler
d3b49cbb02 update ABI file for 6.5.11-8-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 14:38:48 +01:00
Fabian Grünbichler
45cc7d1372 bump version to 6.5.11-8
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 13:28:01 +01:00
Fabian Grünbichler
0ec9138fc0 fix #5158: cherry-pick ext4 fix for high-CPU flush
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 13:26:35 +01:00
Fabian Grünbichler
53226238d9 fix #5077: cherry-pick revert for aacraid resets
reported both in our bug tracker and upstream to fix the affected hardware.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2024-01-30 13:24:16 +01:00
Thomas Lamprecht
06fe6d5acf bump version to 6.5.11-7
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-12-05 10:44:17 +01:00
Thomas Lamprecht
4dec90b9bc update ZFS to 2.2.2
There shouldn't be any changes for us w.r.t. data integrity and the
recent uncovered dnode dirtiness, as we backported those patches
already.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-12-05 10:42:10 +01:00
Fabian Grünbichler
c87a354095 update ABI file for 6.5.11-6-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-29 10:28:15 +01:00
Fabian Grünbichler
e294ffe2e6 bump version to 6.5.11-6
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-29 09:33:09 +01:00
Fabian Grünbichler
82a3d01c72 update zfs to 2.2.0-pve4
which contains the fix for https://github.com/openzfs/zfs/issues/15526

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-29 09:31:50 +01:00
Fabian Grünbichler
c604a765d5 update ABI file for 6.5.11-5-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-27 21:37:46 +01:00
Fabian Grünbichler
24c3901a4b bump version to 6.5.11-5
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-27 20:54:07 +01:00
Fiona Ebner
cc99d7fd2f cherry-pick fix for RCU stall issue after VM live migration
caused by a lapic timer interrupt getting lost.

Already queued for 6.5.13:
https://lore.kernel.org/stable/20231124172031.920738810@linuxfoundation.org/

Reported in the community forum:
https://forum.proxmox.com/threads/136992/

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2023-11-27 18:58:23 +01:00
Thomas Lamprecht
f7fedc604d d/rules: VFIO_VIRQFD config is a boolean
since e2d55709398e ("vfio: Fold vfio_virqfd.ko into vfio.ko") this
config isn't a tristate anymore but a bool, so adapt to that.

Luckily the kconfig script did the right thing and set (or at least
kept) this to yes anyway

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-25 14:13:49 +01:00
Fabian Grünbichler
bf577f5f83 update ABI file for 6.5.11-4-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 15:02:04 +01:00
Fabian Grünbichler
08f167c994 build: add signed-template to DEBS
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 15:02:04 +01:00
Fabian Grünbichler
4c8651aeb9 bump version to 6.5.11-4
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 12:59:50 +01:00
Fabian Grünbichler
331dac76a3 meta package: prefer signed kernel package
to ensure future versions are properly pulled in.

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 12:59:50 +01:00
Fabian Grünbichler
17ad9a3376 d/control: add missing python3-dev
required by perf's Makefile

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 12:28:56 +01:00
Fabian Grünbichler
2b3d5a2269 build: add proxmox-kernel-X.Y-pve-signed-template
the signed template together with the binary package(s) containing the unsigned
files form the input to our secure boot signing service.

the signed template consists of
- files.json (specifying which files are signed how and by which key)
- packaging template used to build the signed package(s)

the signing service
- extracts and checks the signed-template binary package
- extracts the unsigned package(s)
- signs the needed files
- packs up the signatures + the template contained in the signed-template
  package into the signed source package

the signed source package can then be built in the regular fashion (in case of
the kernel packages, it will copy the kernel image, modules and some helper
files from the unsigned package, attach the signature created by the signing
service, and re-pack the result as signed-kernel package).

Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 12:28:56 +01:00
Fabian Grünbichler
03c65664fc add Proxmox UEFI certificates
Signed-off-by: Fabian Grünbichler <f.gruenbichler@proxmox.com>
2023-11-20 11:18:07 +01:00
Thomas Lamprecht
e486c89950 update ABI file for 6.5.11-3-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-17 18:09:41 +01:00
Thomas Lamprecht
eecd83b751 bump version to 6.5.11-3
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-17 17:34:46 +01:00
Thomas Lamprecht
5111be4ff3 update ZFS to include important fixes staged for 2.2.1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-17 17:33:36 +01:00
Thomas Lamprecht
275449f512 update ABI file for 6.5.11-2-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-16 15:05:41 +01:00
Thomas Lamprecht
bb9b21fa29 bump version to 6.5.11-2
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-16 13:29:45 +01:00
Thomas Lamprecht
d7ddc69bea update submodule to Ubuntu-6.5.0-14.14
still with 6.5.3 to 6.5.10 cherry-picked

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-16 13:28:19 +01:00
Thomas Lamprecht
934b701c85 d/rules: temporarily disable UBSAN bound checks again
it's really not just ZFS and AMDGPU modules, but way more and
generating scary looking messages for these "issues" is just noise
that drown real issues. Disable this for now, maybe in another few
years.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-16 13:27:52 +01:00
Fiona Ebner
dd086d18e3 backport UBSAN fixes for amdgpu
to silence array-index-out-of-bounds warnings for dynamically-sized
arrays. All commits applied cleanly and just replace array[1] with
array[].

Signed-off-by: Fiona Ebner <f.ebner@proxmox.com>
2023-11-14 16:15:22 +01:00
Thomas Lamprecht
de4af42cbd update ABI file for 6.5.11-1-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 18:02:30 +01:00
Thomas Lamprecht
bcc95c3263 bump version to 6.5.11-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 17:31:57 +01:00
Thomas Lamprecht
4a4ddffc89 cherry-pick 6.5.11 stable release
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 16:45:41 +01:00
Thomas Lamprecht
3fd758c529 d/rules: use olddefconfig to generate our config
makes it easier to cherry-pick newer stable release tags, that
sometimes contain new config values one must pick from.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 16:36:19 +01:00
Thomas Lamprecht
248716e28c d/rules: disable CONFIG_WQ_CPU_INTENSIVE_REPORT for now
it's mostly noise for users, and quiet some interpret this as real
problem and report it to us.

Ideally we'd either educate them, or take time ourself, to report this
upstream and see if the situation can be improved overall, but
currently that's not feasible. We should check this out a few releases
down, if the lower hanging fruits got fixed and noise got lower we
could enable it again to catch the more rare cases.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 16:33:20 +01:00
Thomas Lamprecht
c0a8921e19 update ZFS to get better work-around for UBSAN bounds-checking
We have a slightly better fix where only a few targeted ZFS module
parts are added to the UBSAN ignore-list, so the rest of the kernel
still gets exposure.

Link: https://github.com/openzfs/zfs/pull/15510
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-12 16:20:25 +01:00
Thomas Lamprecht
b0ac1e9734 Revert "UBUNTU: SAUCE: ceph: make sure all the files successfully put before unmounting"
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-07 09:37:12 +01:00
Thomas Lamprecht
8f06837c7c revert "memfd: improve userspace warnings for missing exec-related flags"
This is generating far too much noise in the logs, so keep it at once
per boot until we (and other user space tools) adapted to the kernel
wanting user space to chose memfd execution behavior very explicitly.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-11-06 10:21:03 +01:00
Thomas Lamprecht
a2edb6600c update ABI file for 6.5.3-1-pve
(generated with debian/scripts/abi-generate)

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:29:54 +02:00
Thomas Lamprecht
88f8395203 bump version to 6.5.3-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:29:24 +02:00
Thomas Lamprecht
50a5490fa6 update to current ubuntu-mantic/master-next
to get some relevant fixes for our use cases, like hangs on reboots on
some x86_64 machines.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:56 +02:00
Thomas Lamprecht
43f8ddaae1 d/rules: temporarily disable UBSAN bound checks
until ZFS can cope with them:
https://github.com/openzfs/zfs/issues/15145

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:56 +02:00
Thomas Lamprecht
62d5fcabed bump version to 6.5.3-1~1
for initial testing and internal QA

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:52 +02:00
Thomas Lamprecht
fc8577c9fa d/rules: adapt to merge of ZFS kernel modules
Following ZFS commit ad9e76765 ("linux: module: weld all but spl.ko
into zfs.ko") we only have two modules to care about.

Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:11 +02:00
Thomas Lamprecht
9b93b3b2f7 update ZFS module to 2.2.0
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:11 +02:00
Thomas Lamprecht
3c4cb919f9 update fwlist for 6.5.3-1
Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com>
2023-10-28 14:28:11 +02:00