Hello everyone!
I am having an issue with my "server" (it's just a pc with old spare parts): when I need to shut it down or restart it, it takes agonizingly long to do so.
Prior to posting here, I tried troubleshooting with some LLMs like Claude, Gemini or ChatGPT, to no avail. I have close to no experience with Proxmox besides some light usage many years ago on a OVH dedicated server.
I suspect this started when I added an HBA card to the setup (specifically, this one: https://www.ebay.it/itm/166376779814).
I use this HBA card in passthrough to my Unraid VM (I had it bare metal but I was constantly left unsatisfied by some things, especially how it manages Docker and VMs, so I installed Proxmox and moved it there). The card per se it seems to be working fine for my usage, but this issue is making me go crazy haha.
Claude had me run lspci so I'm reporting the output here:
Claude also made me notice I put the card in a wrong PCI slot, and moved to a more appropriate since then (now it's correctly in a 8x slot). Sadly, the move did not fix the issue.
Journalctl reports also this:
Furthermore when watching the shutdown process through KVMIP after an exhausting long screen of a "blinking underscore", I managed to get this screenshot a minute or so before the actual shutdown of the device:
I also had another issue regarding the speed of the network that I noticed while using SMB from the Unraid VM to my Windows PC. When it was bare metal it was fine, gigabit speeds; now instead it can stay gigabit stable for idk, a week or a day, and speeds plummet.
Iperf3 reports 13 Mbits/sec instead of the usual ~950. Usually a reboot of the vm fixes this for the time being.
I'm unsure if it's related but I'm reporting all I can haha.
Any help in fixing this issue is very well appreciated, and sorry if I posted in the wrong place.
Please tell me if you need more logs or command outputs.
I am having an issue with my "server" (it's just a pc with old spare parts): when I need to shut it down or restart it, it takes agonizingly long to do so.
Prior to posting here, I tried troubleshooting with some LLMs like Claude, Gemini or ChatGPT, to no avail. I have close to no experience with Proxmox besides some light usage many years ago on a OVH dedicated server.
I suspect this started when I added an HBA card to the setup (specifically, this one: https://www.ebay.it/itm/166376779814).
I use this HBA card in passthrough to my Unraid VM (I had it bare metal but I was constantly left unsatisfied by some things, especially how it manages Docker and VMs, so I installed Proxmox and moved it there). The card per se it seems to be working fine for my usage, but this issue is making me go crazy haha.
Claude had me run lspci so I'm reporting the output here:
Code:
lspci -vv -s 02:00.0
02:00.0 RAID bus controller: Broadcom / LSI SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] (rev 03)
Subsystem: Fujitsu Technology Solutions HBA Ctrl SAS 6G 0/1 [D2607]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
Latency: 0, Cache Line Size: 64 bytes
Interrupt: pin A routed to IRQ 18
IOMMU group: 11
Region 0: I/O ports at 3000 [size=256]
Region 1: Memory at 3fcc0000 (64-bit, non-prefetchable) [size=16K]
Region 3: Memory at 3f880000 (64-bit, non-prefetchable) [size=256K]
Expansion ROM at 3f800000 [disabled] [size=512K]
Capabilities: [50] Power Management version 3
Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [68] Express (v2) Endpoint, IntMsgNum 0
DevCap: MaxPayload 4096 bytes, PhantFunc 0, Latency L0s <64ns, L1 <1us
ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ SlotPowerLimit 0W TEE-IO-
DevCtl: CorrErr+ NonFatalErr+ FatalErr+ UnsupReq+
RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+ FLReset-
MaxPayload 256 bytes, MaxReadReq 512 bytes
DevSta: CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr- TransPend-
LnkCap: Port #0, Speed 5GT/s, Width x8, ASPM L0s, Exit Latency L0s <64ns
ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp-
LnkCtl: ASPM Disabled; RCB 64 bytes, LnkDisable- CommClk+
ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
LnkSta: Speed 5GT/s, Width x2 (downgraded)
TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
DevCap2: Completion Timeout: Range BC, TimeoutDis+ NROPrPrP- LTR-
10BitTagComp- 10BitTagReq- OBFF Not Supported, ExtFmt- EETLPPrefix-
EmergencyPowerReduction Not Supported, EmergencyPowerReductionInit-
FRS- TPHComp- ExtTPHComp-
AtomicOpsCap: 32bit- 64bit- 128bitCAS-
DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-
AtomicOpsCtl: ReqEn-
IDOReq- IDOCompl- LTR- EmergencyPowerReductionReq-
10BitTagReq- OBFF Disabled, EETLPPrefixBlk-
LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-
Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
Compliance Preset/De-emphasis: -6dB de-emphasis, 0dB preshoot
LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete- EqualizationPhase1-
EqualizationPhase2- EqualizationPhase3- LinkEqualizationRequest-
Retimer- 2Retimers- CrosslinkRes: unsupported
Capabilities: [d0] Vital Product Data
pcilib: sysfs_read_vpd: read failed: No such device
Not readable
Capabilities: [a8] MSI: Enable- Count=1/1 Maskable- 64bit+
Address: 0000000000000000 Data: 0000
Capabilities: [c0] MSI-X: Enable+ Count=15 Masked-
Vector table: BAR=1 offset=00002000
PBA: BAR=1 offset=00003800
Capabilities: [100 v1] Advanced Error Reporting
UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP-
ECRC- UnsupReq- ACSViol- UncorrIntErr- BlockedTLP- AtomicOpBlocked- TLPBlockedErr-
PoisonTLPBlocked- DMWrReqBlocked- IDECheck- MisIDETLP- PCRC_CHECK- TLPXlatBlocked-
UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP-
ECRC- UnsupReq- ACSViol- UncorrIntErr- BlockedTLP- AtomicOpBlocked- TLPBlockedErr-
PoisonTLPBlocked- DMWrReqBlocked- IDECheck- MisIDETLP- PCRC_CHECK- TLPXlatBlocked-
UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP+
ECRC- UnsupReq- ACSViol- UncorrIntErr- BlockedTLP- AtomicOpBlocked- TLPBlockedErr-
PoisonTLPBlocked- DMWrReqBlocked- IDECheck- MisIDETLP- PCRC_CHECK- TLPXlatBlocked-
CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr- CorrIntErr- HeaderOF-
CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr+ CorrIntErr- HeaderOF-
AERCap: First Error Pointer: 00, ECRCGenCap+ ECRCGenEn- ECRCChkCap+ ECRCChkEn-
MultHdrRecCap- MultHdrRecEn- TLPPfxPres- HdrLogCap-
HeaderLog: 00000000 00000000 00000000 00000000
Capabilities: [138 v1] Power Budgeting <?>
Capabilities: [150 v1] Single Root I/O Virtualization (SR-IOV)
IOVCap: Migration- 10BitTagReq- IntMsgNum 0
IOVCtl: Enable- Migration- Interrupt- MSE- ARIHierarchy+ 10BitTagReq-
IOVSta: Migration-
Initial VFs: 16, Total VFs: 16, Number of VFs: 0, Function Dependency Link: 00
VF offset: 1, stride: 1, Device ID: 0072
Supported Page Size: 00000553, System Page Size: 00000001
Region 0: Memory at 000000003fcc4000 (64-bit, non-prefetchable)
Region 2: Memory at 000000003f8c0000 (64-bit, non-prefetchable)
VF Migration: offset: 00000000, BIR: 0
Capabilities: [190 v1] Alternative Routing-ID Interpretation (ARI)
ARICap: MFVC- ACS-, Next Function: 0
ARICtl: MFVC- ACS-, Function Group: 0
Kernel driver in use: vfio-pci
Kernel modules: mpt3sas
Claude also made me notice I put the card in a wrong PCI slot, and moved to a more appropriate since then (now it's correctly in a 8x slot). Sadly, the move did not fix the issue.
Journalctl reports also this:
Furthermore when watching the shutdown process through KVMIP after an exhausting long screen of a "blinking underscore", I managed to get this screenshot a minute or so before the actual shutdown of the device:
I also had another issue regarding the speed of the network that I noticed while using SMB from the Unraid VM to my Windows PC. When it was bare metal it was fine, gigabit speeds; now instead it can stay gigabit stable for idk, a week or a day, and speeds plummet.
Iperf3 reports 13 Mbits/sec instead of the usual ~950. Usually a reboot of the vm fixes this for the time being.
I'm unsure if it's related but I'm reporting all I can haha.
Any help in fixing this issue is very well appreciated, and sorry if I posted in the wrong place.
Please tell me if you need more logs or command outputs.