Optiplex 7010 Proxmox and VMs unresponsive and inaccessible - tips to troubleshoot?

breadman

New Member
Jul 18, 2025
4
0
1
Hi having issues with a new proxmox build - would appreciate any tips or links to try troublehoot as i'm stumped...

I recently acquired a dell optiplex 7010 micro (13th gen i5-13500T) and upgraded the ram to 64gb and nvme ssd to 2TB.

Installed proxmox and passed through the igpu UHD 770 to one of the VMs (used for plex). Think all was done successfully as cpu usage very low when hardware transcoding. Not 100% sure here but think its fine.

Also restored another VM from my other proxmox pc which is a simple ubuntu VM used for various docker containers i run. Again all seems to work fine and was very impressed with using pbs to restore.

The optiplex isn't plugged into a monitor but when it is there is no output - i think because i have passed through the igpu to the vm as a pci device?

After a period of time the vms and proxmox ip becomes unresponsive. I havent worked out the exact time but its under 12-24 hours and then i have to restart the PC.

Its like its shut down or no longer running. The actual optiplex box has a green light and looks/sounds like its on but nothing responds. Its like its asleep or has shut down. My router cant see the device/vms either.

Steps i've tried so far:
- upgraded the BIOS
- turned off all the BIOS settings i think may make it sleep, although havent really found a good guide or video that matches my bios.
- tried looking at various logs but can't find half the logs that should exist

So can anyone point me in the right direction how to try fix? Happy to paste any logs required (if i can find them).

Thanks
 
Your going to have to check the logs around the time of the crash.

To troubleshoot, you probably should try running the node without that passed through iGPU. So maybe disable that VM (passed through iGPU) from running at boot, then reboot node & check to see if it still crashes after a period of time.

Next you could test running the node without any VM running, & see if it is stable.

You should probably also check the RAM properly.

Good luck.
 
tha
Your going to have to check the logs around the time of the crash.

To troubleshoot, you probably should try running the node without that passed through iGPU. So maybe disable that VM (passed through iGPU) from running at boot, then reboot node & check to see if it still crashes after a period of time.

Next you could test running the node without any VM running, & see if it is stable.

You should probably also check the RAM properly.

Good luck.
thanks - what logs should i be checking? i've tried various logs from googling but cant find anything
 
Assuming the crash happened today at 01:00:00 you would probably use something with the following format:
Code:
journalctl --since "2025-07-18 00:55:00" --until "2025-07-18 01:01:00"
This will give you a 6 minute window of the crash time. Adjust accordingly & narrow up/down.
 
Thanks - dd a clean install and ran into the same problem. Found this in the logs:


Code:
Jul 19 18:34:56 pve kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                              TDH                  <7a>
                              TDT                  <a1>
                              next_to_use          <a1>
                              next_to_clean        <79>
                            buffer_info[next_to_clean]:
                              time_stamp           <100086118>
                              next_to_watch        <7a>
                              jiffies              <100086740>
                              next_to_watch.status <0>
                            MAC Status             <40080083>
                            PHY Status             <796d>
                            PHY 1000BASE-T Status  <7800>
                            PHY Extended Status    <3000>
                            PCI Status             <10>
Jul 19 18:34:58 pve kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                              TDH                  <7a>
                              TDT                  <a1>
                              next_to_use          <a1>
                              next_to_clean        <79>
                            buffer_info[next_to_clean]:
                              time_stamp           <100086118>
                              next_to_watch        <7a>
                              jiffies              <100086f00>
                              next_to_watch.status <0>
                            MAC Status             <40080083>
                            PHY Status             <796d>
                            PHY 1000BASE-T Status  <7800>
                            PHY Extended Status    <3000>
                            PCI Status             <10>
Jul 19 18:35:00 pve kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                              TDH                  <7a>
                              TDT                  <a1>
                              next_to_use          <a1>
                              next_to_clean        <79>
                            buffer_info[next_to_clean]:
                              time_stamp           <100086118>
                              next_to_watch        <7a>
                              jiffies              <100087706>
                              next_to_watch.status <0>
                            MAC Status             <40080083>
                            PHY Status             <796d>
                            PHY 1000BASE-T Status  <7800>
                            PHY Extended Status    <3000>
                            PCI Status             <10>
Jul 19 18:35:02 pve kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                              TDH                  <7a>
                              TDT                  <a1>
                              next_to_use          <a1>
                              next_to_clean        <79>
                            buffer_info[next_to_clean]:
                              time_stamp           <100086118>
                              next_to_watch        <7a>
                              jiffies              <100087ec0>
                              next_to_watch.status <0>
                            MAC Status             <40080083>
                            PHY Status             <796d>
                            PHY 1000BASE-T Status  <7800>
                            PHY Extended Status    <3000>
                            PCI Status             <10>
Jul 19 18:35:04 pve kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                              TDH                  <7a>
                              TDT                  <a1>
                              next_to_use          <a1>
                              next_to_clean        <79>
                            buffer_info[next_to_clean]:
                              time_stamp           <100086118>
                              next_to_watch        <7a>
                              jiffies              <100088680>
                              next_to_watch.status <0>
                            MAC Status             <40080083>
                            PHY Status             <796d>
                            PHY 1000BASE-T Status  <7800>
                            PHY Extended Status    <3000>
                            PCI Status             <10>
Jul 19 18:35:06 pve kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                              TDH                  <7a>
                              TDT                  <a1>
                              next_to_use          <a1>
                              next_to_clean        <79>
                            buffer_info[next_to_clean]:
                              time_stamp           <100086118>
                              next_to_watch        <7a>
                              jiffies              <100088e40>
                              next_to_watch.status <0>
                            MAC Status             <40080083>
                            PHY Status             <796d>
                            PHY 1000BASE-T Status  <7800>
                            PHY Extended Status    <3000>
                            PCI Status             <10>
Jul 19 18:35:08 pve kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                              TDH                  <7a>
                              TDT                  <a1>
                              next_to_use          <a1>
                              next_to_clean        <79>
                            buffer_info[next_to_clean]:
                              time_stamp           <100086118>
                              next_to_watch        <7a>
                              jiffies              <100089640>
                              next_to_watch.status <0>
                            MAC Status             <40080083>
                            PHY Status             <796d>
                            PHY 1000BASE-T Status  <7800>
                            PHY Extended Status    <3000>
                            PCI Status             <10>
Jul 19 18:35:10 pve pvestatd[1147]: pbs: error fetching datastores - 500 Can't connect to 192.168.68.70:>
Jul 19 18:35:10 pve pvestatd[1147]: status update time (7.132 seconds)
Jul 19 18:35:10 pve kernel: e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                              TDH                  <7a>
                              TDT                  <a1>
                              next_to_use          <a1>
                              next_to_clean        <79>
                            buffer_info[next_to_clean]:
                              time_stamp           <100086118>
                              next_to_watch        <7a>
                              jiffies              <100089e00>
                              next_to_watch.status <0>
                            MAC Status             <40080083>