Hi everyone,
I have my test cluster setup with 4 PVE 5.1 nodes having their storage on ceph ssd pool.
I followed the wiki in order to setup HA.
I created a HA group, named cluster, including the 4 nodes with same priority.
I then enabled HA on a test VM, named test-ubuntu, with the following configuration :
Ceph SSD pool is fine, its size and replication rule can support the loss of one node.
I notice that this VM is well relocated on another node of the HA group cluster.
However, the test-ubuntu VM cannot be started due to a timeout.
I do not know where this issue could come from.
Can you please provide me some clues on this ?
Thank you in advance for your help.
Best regards,
Saiki
I have my test cluster setup with 4 PVE 5.1 nodes having their storage on ceph ssd pool.
I followed the wiki in order to setup HA.
I created a HA group, named cluster, including the 4 nodes with same priority.
I then enabled HA on a test VM, named test-ubuntu, with the following configuration :
- max restart : 4
- max relocate : 2
- group : cluster
- request state : started
Ceph SSD pool is fine, its size and replication rule can support the loss of one node.
I notice that this VM is well relocated on another node of the HA group cluster.
However, the test-ubuntu VM cannot be started due to a timeout.
Code:
task started by HA resource agent
TASK ERROR: start failed: command '/usr/bin/kvm -id 114 -chardev 'socket,id=qmp,path=/var/run/qemu-server/114.qmp,server,nowait' -mon 'chardev=qmp,mode=control' -pidfile /var/run/qemu-server/114.pid -daemonize -smbios 'type=1,uuid=08daf2dc-0689-4210-ae75-8021cac53e50' -name test-ubuntu -smp '2,sockets=1,cores=2,maxcpus=2' -nodefaults -boot 'menu=on,strict=on,reboot-timeout=1000,splash=/usr/share/qemu-server/bootsplash.jpg' -vga std -vnc unix:/var/run/qemu-server/114.vnc,x509,password -cpu kvm64,+lahf_lm,+sep,+kvm_pv_unhalt,+kvm_pv_eoi,enforce -m 4096 -k en-us -device 'pci-bridge,id=pci.1,chassis_nr=1,bus=pci.0,addr=0x1e' -device 'pci-bridge,id=pci.2,chassis_nr=2,bus=pci.0,addr=0x1f' -device 'piix3-usb-uhci,id=uhci,bus=pci.0,addr=0x1.0x2' -device 'usb-tablet,id=tablet,bus=uhci.0,port=1' -chardev 'socket,path=/var/run/qemu-server/114.qga,server,nowait,id=qga0' -device 'virtio-serial,id=qga0,bus=pci.0,addr=0x8' -device 'virtserialport,chardev=qga0,name=org.qemu.guest_agent.0' -device 'virtio-balloon-pci,id=balloon0,bus=pci.0,addr=0x3' -iscsi 'initiator-name=iqn.1993-08.org.debian:01:e3e7c3b43fe4' -drive 'if=none,id=drive-ide2,media=cdrom,aio=threads' -device 'ide-cd,bus=ide.1,unit=0,drive=drive-ide2,id=ide2,bootindex=200' -drive 'file=rbd:rbd-ssd/vm-114-disk-1:mon_host=192.168.2.1;192.168.2.2;192.168.2.3;192.168.2.4:auth_supported=cephx:id=admin:keyring=/etc/pve/priv/ceph/ceph-rbd-ssd.keyring,if=none,id=drive-virtio0,cache=writeback,format=raw,aio=threads,detect-zeroes=on' -device 'virtio-blk-pci,drive=drive-virtio0,id=virtio0,bus=pci.0,addr=0xa,bootindex=100' -netdev 'type=tap,id=net0,ifname=tap114i0,script=/var/lib/qemu-server/pve-bridge,downscript=/var/lib/qemu-server/pve-bridgedown,vhost=on' -device 'virtio-net-pci,mac=6E:33:C2:AD:0E:FD,netdev=net0,bus=pci.0,addr=0x12,id=net0,bootindex=300'' failed: got timeout
I do not know where this issue could come from.
Can you please provide me some clues on this ?
Thank you in advance for your help.
Best regards,
Saiki
Last edited: