Four times in the last two days I have tried to upload an img file to the PVE local storage. Each time the copy seems to complete, then immediately afterwards, we get a no communication error on the browser. We also lose SSH and ping contact to the server. I have left it in this state for 12 hours, and it doesn't recover.
The file is 15 GB but the LAN is a 1Gbps and doesn't seem to be the issue. The drive is an NVMe, and IO performance during the copy does not seem to be the issue.
The syslog reports
Sep 29 22:25:33 pve pvedaemon[1241]: <root@pam> successful auth for user 'root@pam'
Sep 29 22:27:09 pve pvedaemon[1240]: <root@pam> successful auth for user 'root@pam'
Sep 29 22:29:01 pve kernel: usb 1-7: new full-speed USB device number 108 using xhci_hcd
Sep 29 22:29:01 pve kernel: usb 1-7: device descriptor read/64, error -71
Sep 29 22:29:32 pve pveproxy[10772]: multipart upload complete (size: 15032385536 time: 213s rate: 67.00MiB/s md5sum: f913e09caa5f3e6c82dd0d8496fe8514)
Sep 29 22:29:32 pve unknow:
-- Reboot --
Sep 29 22:54:06 pve kernel: Linux version 5.15.60-1-pve (build@proxmox) (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2) #1 SMP PVE 5.15.60-1 (Mon, 19 Sep 2022 17:53:17 +0200) ()
Sep 29 22:54:06 pve kernel: Command line: BOOT_IMAGE=/boot/vmlinuz-5.15.60-1-pve root=/dev/mapper/pve-root ro quiet
I do not believe the USB error is the issue, as this has been connected for several months without issue. This has only happened for the last two days.
As you can see form the syslog, the upload seems to complete, the pve get an unknown: status. Then I must perform a hardware reset to get the server online again.
Does anyone have any ideas regarding why this might me.
There were only two small VMs running at the time. CPU % < 5% Memory usage, 6Gb from 32Gb.
The file is 15 GB but the LAN is a 1Gbps and doesn't seem to be the issue. The drive is an NVMe, and IO performance during the copy does not seem to be the issue.
The syslog reports
Sep 29 22:25:33 pve pvedaemon[1241]: <root@pam> successful auth for user 'root@pam'
Sep 29 22:27:09 pve pvedaemon[1240]: <root@pam> successful auth for user 'root@pam'
Sep 29 22:29:01 pve kernel: usb 1-7: new full-speed USB device number 108 using xhci_hcd
Sep 29 22:29:01 pve kernel: usb 1-7: device descriptor read/64, error -71
Sep 29 22:29:32 pve pveproxy[10772]: multipart upload complete (size: 15032385536 time: 213s rate: 67.00MiB/s md5sum: f913e09caa5f3e6c82dd0d8496fe8514)
Sep 29 22:29:32 pve unknow:
-- Reboot --
Sep 29 22:54:06 pve kernel: Linux version 5.15.60-1-pve (build@proxmox) (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2) #1 SMP PVE 5.15.60-1 (Mon, 19 Sep 2022 17:53:17 +0200) ()
Sep 29 22:54:06 pve kernel: Command line: BOOT_IMAGE=/boot/vmlinuz-5.15.60-1-pve root=/dev/mapper/pve-root ro quiet
I do not believe the USB error is the issue, as this has been connected for several months without issue. This has only happened for the last two days.
As you can see form the syslog, the upload seems to complete, the pve get an unknown: status. Then I must perform a hardware reset to get the server online again.
Does anyone have any ideas regarding why this might me.
There were only two small VMs running at the time. CPU % < 5% Memory usage, 6Gb from 32Gb.