So currently I have a 5 node cluster. 3 nodes are running ceph while the other two are running local storage currently. Ceph in the future.
I went to update the nodes to the current version to get them all on the same version. As before they were slightly different. the ceph cluster being on 8.0.4 and the other two being 8.1.2 I believe. I had some issues with host verification and thought updating may help since they were on different versions.
The newest of the two that are not on ceph work completely fine. And my verification issues resolved. The three nodes that run ceph don't allow me to access any VM's. I cannot change any configurations on any of the VM's or view them in the console. I can change node options and configurations, just not VM info. They show that they start but the task bar below shows failure. I have a list of what it says when I do different things with the VM's.
Starting shows this -
TASK ERROR: start failed: command '/usr/bin/kvm -id 205 -name 'PokehaanCraft2,debug-threads=on' -no-shutdown -chardev 'socket,id=qmp,path=/var/run/qemu-server/205.qmp,server=on,wait=off' -mon 'chardev=qmp,mode=control' -chardev 'socket,id=qmp-event,path=/var/run/qmeventd.sock,reconnect=5' -mon 'chardev=qmp-event,mode=control' -pidfile /var/run/qemu-server/205.pid -daemonize -smbios 'type=1,uuid=f4be94ab-a2e4-4a64-ae0e-22f608c45932' -smp '4,sockets=1,cores=4,maxcpus=4' -nodefaults -boot 'menu=on,strict=on,reboot-timeout=1000,splash=/usr/share/qemu-server/bootsplash.jpg' -vnc 'unix:/var/run/qemu-server/205.vnc,password=on' -cpu host,+kvm_pv_eoi,+kvm_pv_unhalt -m 16384 -object 'iothread,id=iothread-virtioscsi0' -device 'pci-bridge,id=pci.1,chassis_nr=1,bus=pci.0,addr=0x1e' -device 'pci-bridge,id=pci.2,chassis_nr=2,bus=pci.0,addr=0x1f' -device 'pci-bridge,id=pci.3,chassis_nr=3,bus=pci.0,addr=0x5' -device 'vmgenid,guid=48b412a3-6ff1-4421-8986-a830fee1050e' -device 'piix3-usb-uhci,id=uhci,bus=pci.0,addr=0x1.0x2' -device 'usb-tablet,id=tablet,bus=uhci.0,port=1' -device 'VGA,id=vga,bus=pci.0,addr=0x2' -device 'virtio-balloon-pci,id=balloon0,bus=pci.0,addr=0x3,free-page-reporting=on' -iscsi 'initiator-name=iqn.1993-08.org.debian:01:cbe9613f797c' -drive 'if=none,id=drive-ide2,media=cdrom,aio=io_uring' -device 'ide-cd,bus=ide.1,unit=0,drive=drive-ide2,id=ide2,bootindex=101' -device 'virtio-scsi-pci,id=virtioscsi0,bus=pci.3,addr=0x1,iothread=iothread-virtioscsi0' -drive 'file=rbd:ceph-pool/vm-205-disk-0:conf=/etc/pve/ceph.conf:id=admin:keyring=/etc/pve/priv/ceph/ceph-pool.keyring,if=none,id=drive-scsi0,format=raw,cache=none,aio=io_uring,detect-zeroes=on' -device 'scsi-hd,bus=virtioscsi0.0,channel=0,scsi-id=0,lun=0,drive=drive-scsi0,id=scsi0,rotation_rate=1,bootindex=100' -netdev 'type=tap,id=net0,ifname=tap205i0,script=/var/lib/qemu-server/pve-bridge,downscript=/var/lib/qemu-server/pve-bridgedown,vhost=on' -device 'virtio-net-pci,mac=CA:B7:59:7F0:83,netdev=net0,bus=pci.0,addr=0x12,id=net0,rx_queue_size=1024,tx_queue_size=256,bootindex=102' -machine 'type=pc+pve0'' failed: got timeout
Changing any options does not technically error. It just never changes anything and endlessly tries to process it.
Trying to open consoles gives this error -
VM 205 qmp command 'set_password' failed - unable to connect to VM 205 qmp socket - timeout after 51 retries
TASK ERROR: Failed to run vncproxy.
I am seeing that two of my OSD's are down and won't start for any reason. Although Proxmox says Done! but they never start. Is this an issue with a new version of Ceph? Is it something where the nodes upgraded but the VM's ar ebased on old versions?
I went to update the nodes to the current version to get them all on the same version. As before they were slightly different. the ceph cluster being on 8.0.4 and the other two being 8.1.2 I believe. I had some issues with host verification and thought updating may help since they were on different versions.
The newest of the two that are not on ceph work completely fine. And my verification issues resolved. The three nodes that run ceph don't allow me to access any VM's. I cannot change any configurations on any of the VM's or view them in the console. I can change node options and configurations, just not VM info. They show that they start but the task bar below shows failure. I have a list of what it says when I do different things with the VM's.
Starting shows this -
TASK ERROR: start failed: command '/usr/bin/kvm -id 205 -name 'PokehaanCraft2,debug-threads=on' -no-shutdown -chardev 'socket,id=qmp,path=/var/run/qemu-server/205.qmp,server=on,wait=off' -mon 'chardev=qmp,mode=control' -chardev 'socket,id=qmp-event,path=/var/run/qmeventd.sock,reconnect=5' -mon 'chardev=qmp-event,mode=control' -pidfile /var/run/qemu-server/205.pid -daemonize -smbios 'type=1,uuid=f4be94ab-a2e4-4a64-ae0e-22f608c45932' -smp '4,sockets=1,cores=4,maxcpus=4' -nodefaults -boot 'menu=on,strict=on,reboot-timeout=1000,splash=/usr/share/qemu-server/bootsplash.jpg' -vnc 'unix:/var/run/qemu-server/205.vnc,password=on' -cpu host,+kvm_pv_eoi,+kvm_pv_unhalt -m 16384 -object 'iothread,id=iothread-virtioscsi0' -device 'pci-bridge,id=pci.1,chassis_nr=1,bus=pci.0,addr=0x1e' -device 'pci-bridge,id=pci.2,chassis_nr=2,bus=pci.0,addr=0x1f' -device 'pci-bridge,id=pci.3,chassis_nr=3,bus=pci.0,addr=0x5' -device 'vmgenid,guid=48b412a3-6ff1-4421-8986-a830fee1050e' -device 'piix3-usb-uhci,id=uhci,bus=pci.0,addr=0x1.0x2' -device 'usb-tablet,id=tablet,bus=uhci.0,port=1' -device 'VGA,id=vga,bus=pci.0,addr=0x2' -device 'virtio-balloon-pci,id=balloon0,bus=pci.0,addr=0x3,free-page-reporting=on' -iscsi 'initiator-name=iqn.1993-08.org.debian:01:cbe9613f797c' -drive 'if=none,id=drive-ide2,media=cdrom,aio=io_uring' -device 'ide-cd,bus=ide.1,unit=0,drive=drive-ide2,id=ide2,bootindex=101' -device 'virtio-scsi-pci,id=virtioscsi0,bus=pci.3,addr=0x1,iothread=iothread-virtioscsi0' -drive 'file=rbd:ceph-pool/vm-205-disk-0:conf=/etc/pve/ceph.conf:id=admin:keyring=/etc/pve/priv/ceph/ceph-pool.keyring,if=none,id=drive-scsi0,format=raw,cache=none,aio=io_uring,detect-zeroes=on' -device 'scsi-hd,bus=virtioscsi0.0,channel=0,scsi-id=0,lun=0,drive=drive-scsi0,id=scsi0,rotation_rate=1,bootindex=100' -netdev 'type=tap,id=net0,ifname=tap205i0,script=/var/lib/qemu-server/pve-bridge,downscript=/var/lib/qemu-server/pve-bridgedown,vhost=on' -device 'virtio-net-pci,mac=CA:B7:59:7F0:83,netdev=net0,bus=pci.0,addr=0x12,id=net0,rx_queue_size=1024,tx_queue_size=256,bootindex=102' -machine 'type=pc+pve0'' failed: got timeout
Changing any options does not technically error. It just never changes anything and endlessly tries to process it.
Trying to open consoles gives this error -
VM 205 qmp command 'set_password' failed - unable to connect to VM 205 qmp socket - timeout after 51 retries
TASK ERROR: Failed to run vncproxy.
I am seeing that two of my OSD's are down and won't start for any reason. Although Proxmox says Done! but they never start. Is this an issue with a new version of Ceph? Is it something where the nodes upgraded but the VM's ar ebased on old versions?