I have a problem similar to this OP
http://forum.proxmox.com/threads/14515-Proxmox-VE-3-0-Clustering-setup-broke-permissions
but coming from different path:
cluster of 2 nodes, yesterday I had to (live) migrate all vms from one node to the other (why? see this thread)
I left the "empty" node stopped for maintenance
during the night on the other node, ALL backups, except the only openvz CT, failed for the same reason:
checking on the filesystem:
of course my cluster (not ha managed) sees only one node
# pvecm nodes
and
? why is this happening?
i guess answer is here
but I have no HA, and it's (for me) perfectly fine that 1 node is down...
what to do now, and when the other node will come up again?
I will have to do the reverse, moving all vms to the first node, and stopping the second...
Marco
http://forum.proxmox.com/threads/14515-Proxmox-VE-3-0-Clustering-setup-broke-permissions
but coming from different path:
cluster of 2 nodes, yesterday I had to (live) migrate all vms from one node to the other (why? see this thread)
I left the "empty" node stopped for maintenance
during the night on the other node, ALL backups, except the only openvz CT, failed for the same reason:
Code:
VMID NAME STATUS TIME SIZE FILENAME
102 VM 102 FAILED 00:00:00 command 'qm set 102 --lock backup' failed: exit code 2
108 openvzmwserve.proxmox OK 00:01:22 564MB /mnt/pve/iso_qnap/dump/vzdump-openvz-108-2013_07_12-01_00_02.tar.lzo
202 VM 202 FAILED 00:00:01 command 'qm set 202 --lock backup' failed: exit code 2
203 VM 203 FAILED 00:00:00 command 'qm set 203 --lock backup' failed: exit code 2
205 VM 205 FAILED 00:00:01 command 'qm set 205 --lock backup' failed: exit code 2
206 VM 206 FAILED 00:00:00 command 'qm set 206 --lock backup' failed: exit code 2
207 VM 207 FAILED 00:00:00 command 'qm set 207 --lock backup' failed: exit code 2
209 VM 209 FAILED 00:00:01 command 'qm set 209 --lock backup' failed: exit code 2
211 VM 211 FAILED 00:00:00 command 'qm set 211 --lock backup' failed: exit code 2
300 VM 300 FAILED 00:00:01 command 'qm set 300 --lock backup' failed: exit code 2
301 VM 301 FAILED 00:00:00 command 'qm set 301 --lock backup' failed: exit code 2
302 VM 302 FAILED 00:00:00 command 'qm set 302 --lock backup' failed: exit code 2
310 VM 310 FAILED 00:00:01 command 'qm set 310 --lock backup' failed: exit code 2
400 VM 400 FAILED 00:00:00 command 'qm set 400 --lock backup' failed: exit code 2
900 VM 900 FAILED 00:00:01 command 'qm set 900 --lock backup' failed: exit code 2
TOTAL 00:01:28 564MB
Detailed backup logs:
vzdump 108 202 203 204 205 206 207 209 211 300 301 302 310 400 101 900 102 --quiet 1 --mailto mail_address --mode snapshot --compress lzo --storage iso_qnap
102: Jul 12 01:00:02 INFO: Starting Backup of VM 102 (qemu)
102: Jul 12 01:00:02 INFO: status = running
102: Jul 12 01:00:02 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/102.conf.tmp.764986' - Permission denied
102: Jul 12 01:00:02 ERROR: Backup of VM 102 failed - command 'qm set 102 --lock backup' failed: exit code 2
108: Jul 12 01:00:02 INFO: Starting Backup of VM 108 (openvz)
108: Jul 12 01:00:02 INFO: CTID 108 exist mounted running
108: Jul 12 01:00:02 INFO: status = running
108: Jul 12 01:00:02 INFO: backup mode: snapshot
108: Jul 12 01:00:02 INFO: ionice priority: 7
108: Jul 12 01:00:02 INFO: creating lvm snapshot of /dev/mapper/pve-data ('/dev/pve/vzsnap-pve2-0')
108: Jul 12 01:00:03 INFO: Logical volume "vzsnap-pve2-0" created
108: Jul 12 01:00:03 INFO: creating archive '/mnt/pve/iso_qnap/dump/vzdump-openvz-108-2013_07_12-01_00_02.tar.lzo'
108: Jul 12 01:01:15 INFO: Total bytes written: 1015971840 (969MiB, 16MiB/s)
108: Jul 12 01:01:22 INFO: archive file size: 564MB
108: Jul 12 01:01:22 INFO: delete old backup '/mnt/pve/iso_qnap/dump/vzdump-openvz-108-2013_07_10-01_00_02.tar.lzo'
108: Jul 12 01:01:24 INFO: Finished Backup of VM 108 (00:01:22)
202: Jul 12 01:01:24 INFO: Starting Backup of VM 202 (qemu)
202: Jul 12 01:01:24 INFO: status = running
202: Jul 12 01:01:25 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/202.conf.tmp.765347' - Permission denied
202: Jul 12 01:01:25 ERROR: Backup of VM 202 failed - command 'qm set 202 --lock backup' failed: exit code 2
203: Jul 12 01:01:25 INFO: Starting Backup of VM 203 (qemu)
203: Jul 12 01:01:25 INFO: status = running
203: Jul 12 01:01:25 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/203.conf.tmp.765351' - Permission denied
203: Jul 12 01:01:25 ERROR: Backup of VM 203 failed - command 'qm set 203 --lock backup' failed: exit code 2
205: Jul 12 01:01:25 INFO: Starting Backup of VM 205 (qemu)
205: Jul 12 01:01:25 INFO: status = running
205: Jul 12 01:01:26 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/205.conf.tmp.765355' - Permission denied
205: Jul 12 01:01:26 ERROR: Backup of VM 205 failed - command 'qm set 205 --lock backup' failed: exit code 2
206: Jul 12 01:01:26 INFO: Starting Backup of VM 206 (qemu)
206: Jul 12 01:01:26 INFO: status = running
206: Jul 12 01:01:26 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/206.conf.tmp.765359' - Permission denied
206: Jul 12 01:01:26 ERROR: Backup of VM 206 failed - command 'qm set 206 --lock backup' failed: exit code 2
207: Jul 12 01:01:26 INFO: Starting Backup of VM 207 (qemu)
207: Jul 12 01:01:26 INFO: status = running
207: Jul 12 01:01:26 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/207.conf.tmp.765363' - Permission denied
207: Jul 12 01:01:26 ERROR: Backup of VM 207 failed - command 'qm set 207 --lock backup' failed: exit code 2
209: Jul 12 01:01:26 INFO: Starting Backup of VM 209 (qemu)
209: Jul 12 01:01:26 INFO: status = running
209: Jul 12 01:01:27 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/209.conf.tmp.765367' - Permission denied
209: Jul 12 01:01:27 ERROR: Backup of VM 209 failed - command 'qm set 209 --lock backup' failed: exit code 2
211: Jul 12 01:01:27 INFO: Starting Backup of VM 211 (qemu)
211: Jul 12 01:01:27 INFO: status = running
211: Jul 12 01:01:27 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/211.conf.tmp.765371' - Permission denied
211: Jul 12 01:01:27 ERROR: Backup of VM 211 failed - command 'qm set 211 --lock backup' failed: exit code 2
300: Jul 12 01:01:27 INFO: Starting Backup of VM 300 (qemu)
300: Jul 12 01:01:27 INFO: status = running
300: Jul 12 01:01:27 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/300.conf.tmp.765375' - Permission denied
300: Jul 12 01:01:28 ERROR: Backup of VM 300 failed - command 'qm set 300 --lock backup' failed: exit code 2
301: Jul 12 01:01:28 INFO: Starting Backup of VM 301 (qemu)
301: Jul 12 01:01:28 INFO: status = running
301: Jul 12 01:01:28 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/301.conf.tmp.765379' - Permission denied
301: Jul 12 01:01:28 ERROR: Backup of VM 301 failed - command 'qm set 301 --lock backup' failed: exit code 2
302: Jul 12 01:01:28 INFO: Starting Backup of VM 302 (qemu)
302: Jul 12 01:01:28 INFO: status = running
302: Jul 12 01:01:28 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/302.conf.tmp.765405' - Permission denied
302: Jul 12 01:01:28 ERROR: Backup of VM 302 failed - command 'qm set 302 --lock backup' failed: exit code 2
310: Jul 12 01:01:28 INFO: Starting Backup of VM 310 (qemu)
310: Jul 12 01:01:28 INFO: status = running
310: Jul 12 01:01:29 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/310.conf.tmp.765423' - Permission denied
310: Jul 12 01:01:29 ERROR: Backup of VM 310 failed - command 'qm set 310 --lock backup' failed: exit code 2
400: Jul 12 01:01:29 INFO: Starting Backup of VM 400 (qemu)
400: Jul 12 01:01:29 INFO: status = running
400: Jul 12 01:01:29 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/400.conf.tmp.765427' - Permission denied
400: Jul 12 01:01:29 ERROR: Backup of VM 400 failed - command 'qm set 400 --lock backup' failed: exit code 2
900: Jul 12 01:01:29 INFO: Starting Backup of VM 900 (qemu)
900: Jul 12 01:01:29 INFO: status = running
900: Jul 12 01:01:29 INFO: unable to open file '/etc/pve/nodes/pve2/qemu-server/900.conf.tmp.765431' - Permission denied
900: Jul 12 01:01:30 ERROR: Backup of VM 900 failed - command 'qm set 900 --lock backup' failed: exit code 2
checking on the filesystem:
Code:
ls -lah /etc/pve/nodes/pve2/qemu-server/
total 9.5K
dr-xr-x--- 2 root www-data 0 May 18 2012 .
dr-xr-x--- 2 root www-data 0 May 18 2012 ..
-r--r----- 1 root www-data 393 Jun 19 2012 100.conf
-r--r----- 1 root www-data 183 Jul 11 16:00 102.conf
-r--r----- 1 root www-data 202 Apr 17 11:28 200.conf
-r--r----- 1 root www-data 260 Jul 11 01:08 202.conf
-r--r----- 1 root www-data 262 Jul 11 01:14 203.conf
-r--r----- 1 root www-data 259 Jul 11 01:36 205.conf
-r--r----- 1 root www-data 312 Jul 11 01:53 206.conf
-r--r----- 1 root www-data 299 Jul 11 02:07 207.conf
-r--r----- 1 root www-data 342 Jul 11 02:25 209.conf
-r--r----- 1 root www-data 247 Jul 11 02:37 211.conf
-r--r----- 1 root www-data 180 Apr 12 16:55 216.conf
-r--r----- 1 root www-data 330 Jul 11 02:42 300.conf
-r--r----- 1 root www-data 281 Jul 11 03:02 301.conf
-r--r----- 1 root www-data 214 Jul 11 03:13 302.conf
-r--r----- 1 root www-data 333 Mar 27 02:41 306.conf
-r--r----- 1 root www-data 332 Mar 22 10:33 309.conf
-r--r----- 1 root www-data 288 Jul 11 03:41 310.conf
-r--r----- 1 root www-data 263 Jul 11 15:58 400.conf
-r--r----- 1 root www-data 224 Jul 11 15:57 900.conf
of course my cluster (not ha managed) sees only one node
# pvecm nodes
Code:
Node Sts Inc Joined Name
1 M 80 2013-07-05 16:15:25 pve2
2 X 88 pve1
and
Code:
# pvecm status
Version: 6.2.0
Config Version: 2
Cluster Name: pvecluster
Cluster Id: 48308
Cluster Member: Yes
Cluster Generation: 92
Membership state: Cluster-Member
Nodes: 1
Expected votes: 2
Total votes: 1
Node votes: 1
Quorum: 2 Activity blocked
Active subsystems: 5
Flags:
Ports Bound: 0
Node name: pve2
Node ID: 1
Multicast addresses: 239.192.188.113
Node addresses: xxx.xxx.xxx.xxx
? why is this happening?
i guess answer is here
Code:
Nodes: 1
Expected votes: 2
Total votes: 1
Node votes: 1
Quorum: 2 Activity blocked
but I have no HA, and it's (for me) perfectly fine that 1 node is down...
what to do now, and when the other node will come up again?
I will have to do the reverse, moving all vms to the first node, and stopping the second...
Code:
pveversion -v
pve-manager: 2.1-1 (pve-manager/2.1/f9b0f63a)
running kernel: 2.6.32-11-pve
proxmox-ve-2.6.32: 2.0-66
pve-kernel-2.6.32-11-pve: 2.6.32-66
lvm2: 2.02.95-1pve2
clvm: 2.02.95-1pve2
corosync-pve: 1.4.3-1
openais-pve: 1.1.4-2
libqb: 0.10.1-2
redhat-cluster-pve: 3.1.8-3
resource-agents-pve: 3.9.2-3
fence-agents-pve: 3.1.7-2
pve-cluster: 1.0-26
qemu-server: 2.0-39
pve-firmware: 1.0-15
libpve-common-perl: 1.0-27
libpve-access-control: 1.0-21
libpve-storage-perl: 2.0-18
vncterm: 1.0-2
vzctl: 3.0.30-2pve5
vzprocps: 2.0.11-2
vzquota: 3.0.12-3
pve-qemu-kvm: 1.0-9
ksm-control-daemon: 1.1-1
Marco
Last edited: