Hello all,
I will deploy a Proxmox host in the next few days and I wanted to poll users here about the best hard drive strategy. I have had reasonably good experience (as a hobbyist) with 6.4 in the past on a Dell Precision 7810 paired with 2 Xeons 2698v4.
Here is the host setup:
- Ryzen 7950x (16 core)
- Motherboard MSI Pro B650-Wifi
- 128 Gb of DDR5 Memory split over 4 banks
- 2 x 2 Tb NVME SSD
- 1 x Nvidia 3090 Ti
- 1 x Nvidia Quadro T100
- 1 x 16 Tb Magnetic Hard Drive (Exos Enterprise x18)
I was thinking about the following layout:
Machine 1: For machine learning workloads
- 10 cores
- Ubuntu LTS
- 80 Gb RAM
- 500 Gb Disk space in ZFS partition (see host)
- 500 Gb NVME Space passed directly to VM, for performance
- Nvidia 3090 Ti passed as a PCI device directly to the VM OS (so blacklisted in host),
- This machine is likely to see heavy usage during the day, night etc... Could be reasonably I/O intensive at times.
Machine 2: For conventional office tasks etc..
- 4 cores
- Windows 10
- 32 Gb RAM
- 500 Gb Disk space in ZFS partition (see host)
- Nvidia Quadro T1000 passed as a PCI device directly to the VM OS (so blacklisted in host),
- This machine is likely to see light usage during the day (Office types workload, solidworks - nothing crazy), virtually none at night.
For the Proxmox host:
- 2 cores
- 20 Gb RAM
- OS Installed on Magnetic HD
- ZFS Raidz1 (Mirrored) partition on NVME SSD
- Use leftover space on Magnetic Hard Drive to perform daily backups
Here are my questions:
1) Any use for SLOG / ZIL for the NVME SSDs? If so, can I do that off a partition on either NVME SSD?
2) Any foreseeable risks of 'underprovisionning' the host (RAM / CPU or both, I know ZFS can play hard ball at times)
3) Any better way for accomplishing this?
4) Last time I used Proxmox I could not do live backups, I probably did something wrong. Again, what are the gotchas to allow live backups?
5) Do I need CPU Pinning for the machine learning VM?
6) What about NUMA? Would either machine would benefit from this? Do I need to allocate according to the hardware layout (e.g. 64 Gb and only 8 cores?)?
7) Do I need to disable the onboard graphics provided by the CPU 7950x
TIA!
PS: I will very likely be buying one of the support option with SSH, not sure which one though.
I will deploy a Proxmox host in the next few days and I wanted to poll users here about the best hard drive strategy. I have had reasonably good experience (as a hobbyist) with 6.4 in the past on a Dell Precision 7810 paired with 2 Xeons 2698v4.
Here is the host setup:
- Ryzen 7950x (16 core)
- Motherboard MSI Pro B650-Wifi
- 128 Gb of DDR5 Memory split over 4 banks
- 2 x 2 Tb NVME SSD
- 1 x Nvidia 3090 Ti
- 1 x Nvidia Quadro T100
- 1 x 16 Tb Magnetic Hard Drive (Exos Enterprise x18)
I was thinking about the following layout:
Machine 1: For machine learning workloads
- 10 cores
- Ubuntu LTS
- 80 Gb RAM
- 500 Gb Disk space in ZFS partition (see host)
- 500 Gb NVME Space passed directly to VM, for performance
- Nvidia 3090 Ti passed as a PCI device directly to the VM OS (so blacklisted in host),
- This machine is likely to see heavy usage during the day, night etc... Could be reasonably I/O intensive at times.
Machine 2: For conventional office tasks etc..
- 4 cores
- Windows 10
- 32 Gb RAM
- 500 Gb Disk space in ZFS partition (see host)
- Nvidia Quadro T1000 passed as a PCI device directly to the VM OS (so blacklisted in host),
- This machine is likely to see light usage during the day (Office types workload, solidworks - nothing crazy), virtually none at night.
For the Proxmox host:
- 2 cores
- 20 Gb RAM
- OS Installed on Magnetic HD
- ZFS Raidz1 (Mirrored) partition on NVME SSD
- Use leftover space on Magnetic Hard Drive to perform daily backups
Here are my questions:
1) Any use for SLOG / ZIL for the NVME SSDs? If so, can I do that off a partition on either NVME SSD?
2) Any foreseeable risks of 'underprovisionning' the host (RAM / CPU or both, I know ZFS can play hard ball at times)
3) Any better way for accomplishing this?
4) Last time I used Proxmox I could not do live backups, I probably did something wrong. Again, what are the gotchas to allow live backups?
5) Do I need CPU Pinning for the machine learning VM?
6) What about NUMA? Would either machine would benefit from this? Do I need to allocate according to the hardware layout (e.g. 64 Gb and only 8 cores?)?
7) Do I need to disable the onboard graphics provided by the CPU 7950x
TIA!
PS: I will very likely be buying one of the support option with SSH, not sure which one though.
Last edited: