This happens once in ten years...
I've had now some very hot moments at a large scientific organization where I run a entire rack of Proxmox servers and a Ceph cluster.
Even the basic infrastructure was randomly down (Firewalls, DNS, DHCP ...).
But it's OK. Sh1t happens. That's life!
The...
Please help identify the common factors that cause this problem.
I can confirm, that I had exact the same issue as described by @Tim-AU and @Lephisto
Still looking for the code different or added in kernel 6.8 causing this issue.
* I'm using in the VM CPU host, is this the same in your case?
*...
Asrock and Asus are server vendors And we have a good relation with them.
And this Is not the case of RX570. As I wrote in my previous post, I'm aware of three different situations of 6.8 freezing - non of them applies to a epyc (non GPU) server.
The RX570 relates to the Destroy DC context...
solved!
it is definitely a kernel 6.8 bug
i need to know which kernel patch/commit is causing this regression
i'm looking in to ubuntu kernel and are the any proxmox specific patches?
i will share this kernel problem with asrock and asus vendor
we need to know, what changed in kernel 6.8
and when it would be save, to go upstream again
are the any proxmox specific kernel patches?
tried kernel 6.8 without success:
amd_iommu=off iommu=off
default - no parameters
and with ceph optimatizations amd_iommu=on iommu=pt pcie_aspm=off
with...
after many attempts
replacing PSU, CPU, RAM etc...
trying different kernel parameters etc...
checking ipmi and system for any error and logs
checking all power cables and upgrading UPS
updating BIOS, Firmware and BCM/IPMI
BIOS configuring different options
installing a new version of...
This thread is dedicated to the issue where the server just freezes.
If the kernel gives error messages when the server crashes
there is a thread https://forum.proxmox.com/threads/random-6-8-4-2-pve-kernel-crashes.145760
and not AMD GPU related as in...
Solved
there is no bonding involved
just the kernel (version) + bnxt_re module + Broadcom firmware + initializing RDMA RoCE
I had the same problem.
In my rack there are in total 25 pieces of Broadcom BCM57504 4x25G SFP28 PCIe network cards.
Most of them do not have this problem, but some of...
solved!
thanks fiona
pinning machine type to 8.0 definitely helps
i should have thought of this solution right away
PS: this should be the right solution for all the virtual appliances from Cisco...
Upgrading Proxmox to 8.1.3 and pve-qemu-kvm to 8.1.2-4 from Proxmox 8.0.4 and pve-qemu-kvm 8.0.2-6 did cause AsyncOS to stop working properly.
The VM is booting and starts, but this upgrade causes the newest AsyncOS to be non functional.
The network is not working (network virtio) and at the...
the bug is related to some powersave functions (EEE or green ethernet etc.) on the switch with the combination of linux kernel
found some kernel bug reports
-> solved
OK - the 6.2 kernel is a nice one - but it doesn't help
I'm running 20 proxmox ve or pbs and only some of them are affected by this bug.
Next i will fiddle around with the switch (all affected systems are on one switch)
the switch is fine and ok (i can achieve 1gbs on the same ports but with...
I have the same issue on proxmox ve and proxmox backup server
Intel nics do wrong auto negotiation, realtek and broadcom are ok
the cables are ok (they work in other scenarios)
the switch is cheap but ok (other servers do not have a problem to reach 1gbs)
the nics are ok (they do 1gbs under...
this problem is replicable
(tried 2 socket xeon E5645, 2 socket E5-2637 v2, 2 socket CPU E5-2620 v3, 2 socket CPU E5-2650 v4, 2 socket Xeon Silver 4216):
install proxmox (tried 6.4 and 7.2 +7.3)
create a vm with debian 8 or ubuntu 12.04 or lower on ceph or zfs
(because virtio scsi is not...
Thank you,
the most likely change is, that we started backing up virtuals using the Proxmox Backup Server. This could cause some stress on the IO.
And it is probably the scsi subsystem... I found some old kernel bug "CPU freezes in KVM guests during high IO load on host" and this could be...
Hi everyone,
I have an issue with VMs randomly freezing. IT happens multiple times a day at seemingly random times. Not all my VMs with old Linux kernel freeze at once but they are all affected at one time or another. I did not have issues with more recent Linux distributions.
I think that...
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.