Hello together,
last night I had the following Situation:
3 Hosts
6 VM's
2 VM's stopped suddenly during the backup
HOST1
VM1 Ubuntu 12.04 Crash
INFO: Starting Backup of VM 100 (qemu)
INFO: status = running
INFO: update VM 100: -lock backup
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating archive '/mnt/backup-server/iBackOffice/dump/vzdump-qemu-100-2014_03_18-00_00_02.vma.gz'
INFO: started backup task '290961f4-516c-491a-aebc-09df718a6bbc'
INFO: status: 0% (111804416/64424509440), sparse 0% (8003584), duration 3, 37/34 MB/s
INFO: status: 1% (759824384/64424509440), sparse 0% (411312128), duration 11, 81/30 MB/s
INFO: status: 2% (1303773184/64424509440), sparse 1% (742797312), duration 20, 60/23 MB/s
INFO: status: 3% (1995571200/64424509440), sparse 2% (1302581248), duration 26, 115/22 MB/s
INFO: status: 4% (2717777920/64424509440), sparse 2% (1872957440), duration 29, 240/50 MB/s
INFO: status: 5% (3382968320/64424509440), sparse 3% (2430328832), duration 33, 166/26 MB/s
INFO: status: 7% (4903403520/64424509440), sparse 5% (3815112704), duration 36, 506/45 MB/s
INFO: status: 10% (6994001920/64424509440), sparse 9% (5804183552), duration 39, 696/33 MB/s
INFO: status: 11% (7222788096/64424509440), sparse 9% (5839634432), duration 42, 76/64 MB/s
INFO: status: 12% (7739015168/64424509440), sparse 9% (6120386560), duration 49, 73/33 MB/s
INFO: status: 13% (8382578688/64424509440), sparse 10% (6471688192), duration 56, 91/41 MB/s
INFO: status: 14% (9058451456/64424509440), sparse 10% (6769160192), duration 67, 61/34 MB/s
INFO: status: 15% (9711779840/64424509440), sparse 10% (7005376512), duration 80, 50/32 MB/s
INFO: status: 16% (10312351744/64424509440), sparse 10% (7055798272), duration 102, 27/25 MB/s
INFO: status: 17% (10973741056/64424509440), sparse 11% (7105454080), duration 123, 31/29 MB/s
INFO: status: 18% (11623727104/64424509440), sparse 11% (7225573376), duration 139, 40/33 MB/s
INFO: status: 19% (12262309888/64424509440), sparse 11% (7266762752), duration 158, 33/31 MB/s
INFO: status: 20% (12912295936/64424509440), sparse 11% (7333105664), duration 177, 34/30 MB/s
INFO: status: 21% (13562281984/64424509440), sparse 11% (7641001984), duration 189, 54/28 MB/s
INFO: status: 22% (14185660416/64424509440), sparse 11% (7703629824), duration 213, 25/23 MB/s
INFO: status: 23% (14828044288/64424509440), sparse 11% (7721385984), duration 244, 20/20 MB/s
INFO: status: 24% (15531245568/64424509440), sparse 12% (7832801280), duration 269, 28/23 MB/s
INFO: status: 25% (16116613120/64424509440), sparse 12% (7889600512), duration 288, 30/27 MB/s
INFO: status: 26% (16751984640/64424509440), sparse 12% (8004296704), duration 307, 33/27 MB/s
INFO: status: 27% (17404198912/64424509440), sparse 12% (8074612736), duration 327, 32/29 MB/s
INFO: status: 28% (18039963648/64424509440), sparse 12% (8162406400), duration 350, 27/23 MB/s
INFO: status: 29% (18697551872/64424509440), sparse 12% (8200192000), duration 375, 26/24 MB/s
INFO: status: 30% (19362742272/64424509440), sparse 12% (8273248256), duration 395, 33/29 MB/s
INFO: status: 31% (19977338880/64424509440), sparse 12% (8354426880), duration 419, 25/22 MB/s
INFO: status: 32% (20628504576/64424509440), sparse 12% (8366137344), duration 442, 28/27 MB/s
INFO: status: 33% (21263286272/64424509440), sparse 12% (8370499584), duration 468, 24/24 MB/s
INFO: status: 34% (21962686464/64424509440), sparse 13% (8533970944), duration 490, 31/24 MB/s
INFO: status: 35% (22574661632/64424509440), sparse 13% (8550928384), duration 516, 23/22 MB/s
INFO: status: 36% (23201841152/64424509440), sparse 13% (8610680832), duration 540, 26/23 MB/s
INFO: status: 37% (23870832640/64424509440), sparse 13% (8781332480), duration 559, 35/26 MB/s
INFO: status: 38% (24494211072/64424509440), sparse 13% (8922251264), duration 576, 36/28 MB/s
INFO: status: 39% (25140396032/64424509440), sparse 13% (8964857856), duration 599, 28/26 MB/s
INFO: status: 40% (25805586432/64424509440), sparse 14% (9084706816), duration 618, 35/28 MB/s
INFO: status: 41% (26444169216/64424509440), sparse 14% (9247731712), duration 634, 39/29 MB/s
ERROR: VM 100 not running
INFO: aborting backup job
ERROR: VM 100 not running
ERROR: Backup of VM 100 failed - VM 100 not running
VM2 WIN2012R2 OK
VM3 CentOS 5.6 OK
HOST2
VM1 Ubuntu 12.04 Crash
INFO: Starting Backup of VM 100 (qemu)
INFO: status = running
INFO: update VM 100: -lock backup
INFO: backup mode: snapshot
INFO: ionice priority: 7
INFO: creating archive '/mnt/backup-server/WD/dump/vzdump-qemu-100-2014_03_18-03_00_01.vma.gz'
INFO: started backup task '28869d90-4118-4340-a69d-956eb522536a'
INFO: status: 0% (97845248/64424509440), sparse 0% (6754304), duration 3, 32/30 MB/s
INFO: status: 1% (702808064/64424509440), sparse 0% (189382656), duration 18, 40/28 MB/s
ERROR: VM 100 not running
INFO: aborting backup job
ERROR: VM 100 not running
ERROR: Backup of VM 100 failed - VM 100 not running
VM2 WIN2008R2 OK
HOST3
VM1 Ubuntu 12.04 OK
The Problem does't belongs to a specific VM, the backup in snapshot mode with good compression runs fine one night, the other night the vm stops suddenly during the backup.
Best regards Stephan