Dear Proxmox Forum,
I am trying to setup a new cluster consisting of 3 Supermicro X13-SAE-F Mainboards. I also installed a common Intel X710 Network adapter for 10G networking (for Ceph):
02:00.0 Ethernet controller [0200]: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ [8086:1572] (rev
02)
Subsystem: Intel Corporation Ethernet Converged Network Adapter X710-2 [8086:0008]
Kernel modules: i40e
02:00.1 Ethernet controller [0200]: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ [8086:1572] (rev
02)
Subsystem: Intel Corporation Ethernet Converged Network Adapter X710 [8086:0000]
Kernel modules: i40e
When booting the Linux kernel (Debian / Proxmox) the system runs into some timeouts and networking is down. The reason lies in the loading of the kernel module for the Intel X710 network adapter (i40e). When loading this module it tells in the log:
[Sun Dec 29 23:54:15 2024] i40e 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible
[Sun Dec 29 23:56:26 2024] i40e 0000:02:00.0: PF reset failed: -5
[Sun Dec 29 23:56:26 2024] i40e 0000:02:00.0: Entering recovery mode due to repeated FW resets. This may take several minutes. Refer to the Intel(R) Ethernet Adapters and Devices User Guide.
[Sun Dec 29 23:56:26 2024] i40e: probe of 0000:02:00.0 failed with error -5
[Sun Dec 29 23:56:26 2024] i40e 0000:02:00.1: Unable to change power state from D3cold to D0, device inaccessible
[Sun Dec 29 23:58:37 2024] i40e 0000:02:00.1: PF reset failed: -5
[Sun Dec 29 23:58:37 2024] i40e 0000:02:00.1: Entering recovery mode due to repeated FW resets. This may take several minutes. Refer to the Intel(R) Ethernet Adapters and Devices User Guide.
[Sun Dec 29 23:58:37 2024] i40e: probe of 0000:02:00.1 failed with error -5
Our Linux kernel version is 6.8.12-4-pve
Some similar problems seem to have been fixed by adding the following as kernel boot parameter:
pcie_port_pm=off
Unfortunately in this case, it does not help.
Any clue how to solve this problem?
Regards,
Hermann
I am trying to setup a new cluster consisting of 3 Supermicro X13-SAE-F Mainboards. I also installed a common Intel X710 Network adapter for 10G networking (for Ceph):
02:00.0 Ethernet controller [0200]: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ [8086:1572] (rev
02)
Subsystem: Intel Corporation Ethernet Converged Network Adapter X710-2 [8086:0008]
Kernel modules: i40e
02:00.1 Ethernet controller [0200]: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ [8086:1572] (rev
02)
Subsystem: Intel Corporation Ethernet Converged Network Adapter X710 [8086:0000]
Kernel modules: i40e
When booting the Linux kernel (Debian / Proxmox) the system runs into some timeouts and networking is down. The reason lies in the loading of the kernel module for the Intel X710 network adapter (i40e). When loading this module it tells in the log:
[Sun Dec 29 23:54:15 2024] i40e 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible
[Sun Dec 29 23:56:26 2024] i40e 0000:02:00.0: PF reset failed: -5
[Sun Dec 29 23:56:26 2024] i40e 0000:02:00.0: Entering recovery mode due to repeated FW resets. This may take several minutes. Refer to the Intel(R) Ethernet Adapters and Devices User Guide.
[Sun Dec 29 23:56:26 2024] i40e: probe of 0000:02:00.0 failed with error -5
[Sun Dec 29 23:56:26 2024] i40e 0000:02:00.1: Unable to change power state from D3cold to D0, device inaccessible
[Sun Dec 29 23:58:37 2024] i40e 0000:02:00.1: PF reset failed: -5
[Sun Dec 29 23:58:37 2024] i40e 0000:02:00.1: Entering recovery mode due to repeated FW resets. This may take several minutes. Refer to the Intel(R) Ethernet Adapters and Devices User Guide.
[Sun Dec 29 23:58:37 2024] i40e: probe of 0000:02:00.1 failed with error -5
Our Linux kernel version is 6.8.12-4-pve
Some similar problems seem to have been fixed by adding the following as kernel boot parameter:
pcie_port_pm=off
Unfortunately in this case, it does not help.
Any clue how to solve this problem?
Regards,
Hermann