Hard crash/hang (?) when uploading ISO to cephFS volume

scyto

Active Member
Aug 8, 2023
359
64
28
Symptoms:
When I upload something to the cephFS volume in my cluster the node i am uploading it on may hard hang at any point in that process, for example uploading an 4GB iso it didn't just after the copy operation seems to have complete.

I see no stdrr/stdout on the screen connected via KVM
The node cannot be reach on any network
I have to hard reboot.

When did this start happening?
Only after i did two things:
  1. when i rolled my own 6.5.2 linux kernel to use the thunderbolt patches
  2. moved to using IPv6 on ceph public/private
I am more than happy to revert 2 and even 1 to prove this. I knew the risks!

I am surprised there is nothing on the console to indicate the hang.

I would like to understand if the issue is:
1. the move to IPv6 (this is easy for me to test)
2. is it a general 6.5.2 kernel / ceph issue
3. is it specific to the code fixes I patched

Question
Beyond hooking up a debugger to a usb serial port is there anyway for me to capture dmesg/jouranlctl from the moment of the crash / is their a dump file on the system
(i am used to some basic windbg debugging on windows (aka just using !analyze... lol, but nothing beyond that).

If not we will do it the old fashioned way and just revert what i did.
 
Last edited:

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!