Hi all,
i have a problem with my proxmox setup.
I'm running two nodes each with a 512GB NVMe SSD, i5 8500T and 32GB Memory. On this two machines i'm running 4 VMs. A Docker Host with ~15 Containers, a Mailserver (Mailcow Dockerized), a VPN server and a Proxy. All those machines are replicated through ZFS to each other every minute.
One disk has 1700h and 67TB written, the other one on the other host has 3300h and 66TB written. What the hell is causing this excessive writes? i mean for the first drive this is about 40GB/h or 11MB/s continuous.
This is the output of iostat:
I don't trust this iostat because the MB_wrtn/s doesn't change anymore, it's stuck on 6.00 MB/s.
Even with replication disabled and no VM or container running on the host its still more than 2MB/s:
I read about write amplification here: https://forum.proxmox.com/threads/high-ssd-wear-after-a-few-days.24840/#post-124488 but i have no idea how i can check this and how to find the reason for this.
As a first try i set the replication schedule to every two hours but this doesn't solve my problem at all... Please help
i have a problem with my proxmox setup.
I'm running two nodes each with a 512GB NVMe SSD, i5 8500T and 32GB Memory. On this two machines i'm running 4 VMs. A Docker Host with ~15 Containers, a Mailserver (Mailcow Dockerized), a VPN server and a Proxy. All those machines are replicated through ZFS to each other every minute.
One disk has 1700h and 67TB written, the other one on the other host has 3300h and 66TB written. What the hell is causing this excessive writes? i mean for the first drive this is about 40GB/h or 11MB/s continuous.
This is the output of iostat:
Code:
Device tps MB_read/s MB_wrtn/s MB_read MB_wrtn
nvme0n1 267.57 0.50 6.00 643034 7728174
Even with replication disabled and no VM or container running on the host its still more than 2MB/s:
Code:
Device tps MB_read/s MB_wrtn/s MB_read MB_wrtn
nvme0n1 162.22 0.55 2.61 543 2598
I read about write amplification here: https://forum.proxmox.com/threads/high-ssd-wear-after-a-few-days.24840/#post-124488 but i have no idea how i can check this and how to find the reason for this.
As a first try i set the replication schedule to every two hours but this doesn't solve my problem at all... Please help