Hi,
First I thought that the pve no subscription channel has some kernel debugging enabled that filled my root device in less than two days of non continual usage, but even dmesg is listing some errors intead of the usual boot and peripheral information. My logs are bloated of repeated messages. dmesg returns a lot of ones similar to the following:
[34256.095328] pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
[34256.095333] pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, id=00e0(Receiver ID)
[34256.095337] pcieport 0000:00:1c.0: device [8086:a293] error status/mask=00000001/00002000
[34256.095354] pcieport 0000:00:1c.0: [ 0] Receiver Error (First)
[34256.097803] pcieport 0000:00:1c.0: AER: Multiple Corrected error received: id=00e0
[34256.097917] pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)
[34256.097921] pcieport 0000:00:1c.0: device [8086:a293] error status/mask=00001100/00002000
[34256.097923] pcieport 0000:00:1c.0: [ 8] RELAY_NUM Rollover
[34256.097925] pcieport 0000:00:1c.0: [12] Replay Timer Timeout
[34256.097929] pcieport 0000:00:1c.0: AER: Multiple Corrected error received: id=00e0
[34256.098090] pcieport 0000:00:1c.0: can't find device of ID00e0
[34256.098092] pcieport 0000:00:1c.0: AER: Multiple Corrected error received: id=00e0
[34256.098151] pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)
[34256.098155] pcieport 0000:00:1c.0: device [8086:a293] error status/mask=00003100/00002000
[34256.098159] pcieport 0000:00:1c.0: [ 8] RELAY_NUM Rollover
[34256.098162] pcieport 0000:00:1c.0: [12] Replay Timer Timeout
I am setting up this server for an unmanaged poc embedded on a train, which should be running before this weekend. When root partition filled, no vm, no container could start.
Could someone please help me debug this ?
P.S.: This is on a new optiplex 5050 with one intel onboard nic and one rtl8111 added nic.
TIA.
First I thought that the pve no subscription channel has some kernel debugging enabled that filled my root device in less than two days of non continual usage, but even dmesg is listing some errors intead of the usual boot and peripheral information. My logs are bloated of repeated messages. dmesg returns a lot of ones similar to the following:
[34256.095328] pcieport 0000:00:1c.0: AER: Corrected error received: id=00e0
[34256.095333] pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, id=00e0(Receiver ID)
[34256.095337] pcieport 0000:00:1c.0: device [8086:a293] error status/mask=00000001/00002000
[34256.095354] pcieport 0000:00:1c.0: [ 0] Receiver Error (First)
[34256.097803] pcieport 0000:00:1c.0: AER: Multiple Corrected error received: id=00e0
[34256.097917] pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)
[34256.097921] pcieport 0000:00:1c.0: device [8086:a293] error status/mask=00001100/00002000
[34256.097923] pcieport 0000:00:1c.0: [ 8] RELAY_NUM Rollover
[34256.097925] pcieport 0000:00:1c.0: [12] Replay Timer Timeout
[34256.097929] pcieport 0000:00:1c.0: AER: Multiple Corrected error received: id=00e0
[34256.098090] pcieport 0000:00:1c.0: can't find device of ID00e0
[34256.098092] pcieport 0000:00:1c.0: AER: Multiple Corrected error received: id=00e0
[34256.098151] pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)
[34256.098155] pcieport 0000:00:1c.0: device [8086:a293] error status/mask=00003100/00002000
[34256.098159] pcieport 0000:00:1c.0: [ 8] RELAY_NUM Rollover
[34256.098162] pcieport 0000:00:1c.0: [12] Replay Timer Timeout
I am setting up this server for an unmanaged poc embedded on a train, which should be running before this weekend. When root partition filled, no vm, no container could start.
Could someone please help me debug this ?
P.S.: This is on a new optiplex 5050 with one intel onboard nic and one rtl8111 added nic.
TIA.