Hey people! First time poster here.
I'm a middle-aged, EU-based dude with background in software engineering. Nothing too fancy, but I can find my way around most of the "IT" problems I encounter in life. I have never, however, seriously ventured into neither virtualization, hypervisors, nor clusters.
My first motivation to get Proxmox set up was to migrate my Home Assistant away from Raspberry Pi, I was running it off an SD card and the secondary reason was to consolidate the rest of 24/7 household services to a unified hardware system: mainly the Roon server, Pi-hole and a torrent client - I had an old and loud desktop box for those so I wanted to get rid of that. I'll also need a windows VM at some point and I'm sure I'll find a bunch of services I'll want to run locally when the whole thing is up.
I realize I might be all over the place, please help me out.
I'm a middle-aged, EU-based dude with background in software engineering. Nothing too fancy, but I can find my way around most of the "IT" problems I encounter in life. I have never, however, seriously ventured into neither virtualization, hypervisors, nor clusters.
My first motivation to get Proxmox set up was to migrate my Home Assistant away from Raspberry Pi, I was running it off an SD card and the secondary reason was to consolidate the rest of 24/7 household services to a unified hardware system: mainly the Roon server, Pi-hole and a torrent client - I had an old and loud desktop box for those so I wanted to get rid of that. I'll also need a windows VM at some point and I'm sure I'll find a bunch of services I'll want to run locally when the whole thing is up.
So took the plunge and fast forward a month - I have a Dell Optiplex 7080 micro with a 10500t and 32gb ram running all 4 aforementioned services, another identical box on the way and an old Haswell era NUC, along with the RPi4 that used to previously run Home Assistant. Oh, and I also have an old Synology DS409 for a SMB/NFS NAS. Network is unifi everything (UDR, etc)
I'm not sleeping well though, because the 2 boxes I've ran Proxmox on, both started acting up at some point. Before I got the Dell, I was testing out PVE on the NUC (even upgraded RAM in it) and every couple of days it froze and it needed to be hard reset. I didn't manage to find out the root cause, but I'm suspecting the SSD. Thermals were okay, RAM was brand new, SSD was an old Crucial mSATA with a cryptic 4% wearout that I still don't know if it's counting up or down. In any case, the Dell arrived and that was going to be my new server! I'm planning on setting up Home Assistant to run in high availability, so either 2 or 3 node cluster is in the future. But first I migrate the 2 VM + 2 CT (from backups) to a fresh 8.3.4 Proxmox. Btw I'm running PBS off the RPi4, storage is NFS and a USB WD drive.
Everything is beautiful, until the exact same thing (apparently) started happening on the Dell - everything becomes unresponsive, web interface isn't serving, can't SSH into hypervisor, can't SSH into any of the VMs, can't ping anything, the only thing that was showing signs of life was Home Assistant performing some very simple functions off cached frontend. But no point in diagnosing it any further, because yesterday the nvme (root) drive just died, so I guess that was the reason. Kinda suspicious though, two different SSDs in two different systems acting up?
So if you read up to this point - thanks for doing so. I will now list some questions and if you can pitch in with a suggestion, advice, comment - anything at all, I'll be grateful.
I'm not sleeping well though, because the 2 boxes I've ran Proxmox on, both started acting up at some point. Before I got the Dell, I was testing out PVE on the NUC (even upgraded RAM in it) and every couple of days it froze and it needed to be hard reset. I didn't manage to find out the root cause, but I'm suspecting the SSD. Thermals were okay, RAM was brand new, SSD was an old Crucial mSATA with a cryptic 4% wearout that I still don't know if it's counting up or down. In any case, the Dell arrived and that was going to be my new server! I'm planning on setting up Home Assistant to run in high availability, so either 2 or 3 node cluster is in the future. But first I migrate the 2 VM + 2 CT (from backups) to a fresh 8.3.4 Proxmox. Btw I'm running PBS off the RPi4, storage is NFS and a USB WD drive.
Everything is beautiful, until the exact same thing (apparently) started happening on the Dell - everything becomes unresponsive, web interface isn't serving, can't SSH into hypervisor, can't SSH into any of the VMs, can't ping anything, the only thing that was showing signs of life was Home Assistant performing some very simple functions off cached frontend. But no point in diagnosing it any further, because yesterday the nvme (root) drive just died, so I guess that was the reason. Kinda suspicious though, two different SSDs in two different systems acting up?
So if you read up to this point - thanks for doing so. I will now list some questions and if you can pitch in with a suggestion, advice, comment - anything at all, I'll be grateful.
- Main requirement is rock solid Home Assistant (tricky, I know - but that's a separate story) and Roon Rock, which means some kind of redundancy - will PVE HA do the job?
- My plan is to have a 2 node cluster with identical(ish) Dell 7080 micro boxes and a qdevice running off the RPi4 that's also running PBS - it's not the worst idea for a homelab enviroment, right?
- Including the NUC doesn't really make a lot of sense, so it would probably be a cold spare node, is there a better use for it?
- The main thing I'm struggling with is, what to use for storage. I haven't dealt with SSDs a whole lot before, but apparently the differences are significant. This is what I really need help with, please!
- I'm running out of funds, so I'd like to keep it cheap.
- was it somehow my fault the drive(s?) are dying?
- i don't need a ton of volume, I'm thinking I'd probably be perfectly fine with 500 gb per node.
- I can fit 2x 2280 gen3 nvme, plus a 2.5" sata into the Dells.
- do I need a separate, small but "enterprise-grade" drive for root? or do I really need those for HA'd VM's shared storage?
- If I'll be getting into HA soon, does that affect the requirements of the SSDs?
- I can't wrap my head around: ZFS being hard on SSDs, best way to set up shared storage for HA machines (CEPH I guess?), and ultimately, what affordable drives to get.
- I don't think I need to run mirrored drives in nodes if I'll be doing HA, so I guess at least that means couple less drives.
I realize I might be all over the place, please help me out.