I have a 3 node NUC cluster where I do a thunderbolt-net mesh/routed network between the 3 nodes. (Routed 26Gbps is pretty awesome)
In one scenario it generates the error in the title :
To expand on the failure test scenarios that work vs don't work:
I find the following scenarios super reliable:
However, for this scenario alone things are not so pretty:
I know this is a very edge case scenario, i am hoping someone has seen this on other devices and found a fix.
I am running Proxmox 8
In one scenario it generates the error in the title :
pve1 kernel: thunderbolt 1-1: failed request link state change, aborting
To expand on the failure test scenarios that work vs don't work:
I find the following scenarios super reliable:
- pulling one of the three TB cables
- Rebooting any node
- hard failure of any node (pull the power cord)
However, for this scenario alone things are not so pretty:
- shutdown a node (gracefully) and power back on by pressing the front button
I know this is a very edge case scenario, i am hoping someone has seen this on other devices and found a fix.
I am running Proxmox 8
Code:
[ 1.585102] ACPI: bus type thunderbolt registered.
[ 3.532746] thunderbolt 0-0:1.1: new retimer found, vendor=0x8087 device=0x15ee
[ 5.471801] thunderbolt 1-0:1.1: new retimer found, vendor=0x8087 device=0x15ee
[ 17.035024] thunderbolt 0-1: new host found, vendor=0x8086 device=0x1
[ 17.035028] thunderbolt 0-1: Intel Corp. pve3
[ 17.038497] thunderbolt-net 0-1.0 en05: renamed from thunderbolt0
[ 18.230611] thunderbolt 1-1: failed request link state change, aborting
....
[ 83.895648] thunderbolt 1-1: failed request link state change, aborting
[ 84.919547] thunderbolt 1-1: failed request link state change, aborting
[ 85.943324] thunderbolt 1-1: failed request link state change, aborting
[ 86.899519] thunderbolt 1-0:1.1: retimer disconnected
[ 91.407058] thunderbolt 1-0:1.1: new retimer found, vendor=0x8087 device=0x15ee
[ 96.726934] thunderbolt 1-1: new host found, vendor=0x8086 device=0x1
[ 96.726938] thunderbolt 1-1: Intel Corp. pve2
[ 96.729412] thunderbolt-net 1-1.0 en06: renamed from thunderbolt0