Hi,
since I upgraded to Proxmox 3.0 my Backups of my OpenVZ machines are making me huge problems.
The nightly Snapshots are breaking the entire server. It seems like there is (also) a problem with backing up to an NFS storage.
It starts the snapshot but hangs with the first machine. All VMs are unresponsible afterwards and I have to restart the server.
It ends up giving me Out of memory errors in the OpenVZ machines when it runs.
Here is the part of the log file:
Jun 2 01:00:01 proxmox /USR/SBIN/CRON[636264]: (root) CMD (vzdump 100 101 102 103 104 105 106 107 108 109 110 111 112 113 115 --quiet 1 --mode snapshot --mailto sven@voleatech.com --compress gzip --storage backup)
Jun 2 01:00:02 proxmox vzdump[636265]: <root@pam> starting task UPIDroxmox:0009B56C:0032F5AE:51AA7CF2:vzdump::root@pam:
Jun 2 01:00:02 proxmox vzdump[636268]: INFO: starting new backup job: vzdump 100 101 102 103 104 105 106 107 108 109 110 111 112 113 115 --quiet 1 --mailto sven@voleatech.com --mode snapshot --compress gzip --storage backup
Jun 2 01:00:07 proxmox pvestatd[3277]: WARNING: command 'df -P -B 1 /media/backup/eku' failed: got timeout (THIS IS THE NFS DRIVE)
Jun 2 01:00:16 proxmox vzdump[636268]: INFO: Starting Backup of VM 100 (openvz)
Jun 2 01:00:16 proxmox pvestatd[3277]: status update time (10.948 seconds)
Jun 2 01:00:16 proxmox kernel: EXT3-fs: barriers disabled
Jun 2 01:00:16 proxmox kernel: kjournald starting. Commit interval 5 seconds
Jun 2 01:00:16 proxmox kernel: EXT3-fs (dm-4): using internal journal
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51004945
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51004703
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51004942
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51128245
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51128110
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261584
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261583
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261582
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261581
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261580
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1354024
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020207
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020205
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020202
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020201
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020198
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020181
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51019929
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51019928
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51019927
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51019882
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147491
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147489
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147488
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147480
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147477
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50889059
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50889050
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50889024
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50889015
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50888997
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50995205
Jun 2 01:00:16 proxmox kernel: EXT3-fs (dm-4): 32 orphan inodes deleted
Jun 2 01:00:16 proxmox kernel: EXT3-fs (dm-4): recovery complete
Jun 2 01:00:16 proxmox kernel: EXT3-fs (dm-4): mounted filesystem with ordered data mode
Jun 2 01:05:25 proxmox kernel: INFO: task flush-253:3:1863 blocked for more than 120 seconds.
Jun 2 01:05:25 proxmox kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jun 2 01:05:25 proxmox kernel: flush-253:3 D ffff88042c42f3a0 0 1863 2 0 0x00000000
Jun 2 01:05:25 proxmox kernel: ffff88042c431980 0000000000000046 0000000000000000 ffffffff8140b5ec
Jun 2 01:05:25 proxmox kernel: ffff88042c4319b0 ffffffff811c92f0 ffff88042c431d18 0000000000000282
Jun 2 01:05:25 proxmox kernel: ffff88042c431930 0000000101fb484b ffff88042c431fd8 ffff88042c431fd8
Jun 2 01:05:25 proxmox kernel: Call Trace:
Jun 2 02:47:17 proxmox kernel: ct0 nfs: server NAS not responding, still trying
Jun 2 03:02:43 proxmox kernel: >>> 100 oom generation 0 starts
Jun 2 03:02:43 proxmox kernel: 754037 (smbd) invoked loc oom-killer: gfp 0x200d2 order 0 oomkilladj=0
Jun 2 03:02:43 proxmox kernel: UB-100-Mem-Info:
Jun 2 03:02:43 proxmox kernel: Node 0 DMA prio:41 portion:486 scan:0 a_anon:0 40726492ms i_anon:0 40726492ms a_file:0 40726492ms i_file:0 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 0 DMA/shadow prio:41 portion:0 scan:0 a_anon:0 40726492ms i_anon:0 40726492ms a_file:0 40726492ms i_file:0 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 0 DMA32 prio:41 portion:104316 scan:0 a_anon:0 40726492ms i_anon:0 40726492ms a_file:0 40726492ms i_file:0 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 0 DMA32/shadow prio:41 portion:0 scan:0 a_anon:0 40726492ms i_anon:0 40726492ms a_file:0 40726492ms i_file:0 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 0 Normal prio:16 portion:418542 scan:91 a_anon:175408 1882835ms i_anon:91217 622835ms a_file:16 22835ms i_file:12 22835ms unevictable:0 reclaim_stat: 64669 41461 7908 2463
Jun 2 03:02:43 proxmox kernel: Node 0 Normal/shadow prio:41 portion:0 scan:0 a_anon:0 40726492ms i_anon:291902 40726492ms a_file:0 40726492ms i_file:61910 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 1 Normal prio:17 portion:525230 scan:46 a_anon:278093 2543437ms i_anon:49914 383437ms a_file:27 23437ms i_file:0 23437ms unevictable:0 reclaim_stat: 49585 33368 6403 1983
Jun 2 03:02:43 proxmox kernel: Node 1 Normal/shadow prio:41 portion:0 scan:0 a_anon:0 40726492ms i_anon:231572 40726492ms a_file:0 40726492ms i_file:27364 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Total 1207435 anon:1118106 file:89329 a_anon:453501 i_anon:664605 a_file:43 i_file:89286 unevictable:0
Jun 2 03:02:43 proxmox kernel: RAM: 1048576 / 1048576 [1] SWAP: 524288 / 524288 [1] KMEM: 1852403712 / 2147483648 [0] DCSZ: 43947787 / 1073741824 [0] OOMG: 911107 / inf [0] Dirty 0 Wback 0 Dche 46475 Prnd 13755
Jun 2 03:02:43 proxmox kernel: Out of memory in UB: Kill process 3668 (named) score -992 or sacrifice child
.....
Any suggestions?
Thanks
Sven
since I upgraded to Proxmox 3.0 my Backups of my OpenVZ machines are making me huge problems.
The nightly Snapshots are breaking the entire server. It seems like there is (also) a problem with backing up to an NFS storage.
It starts the snapshot but hangs with the first machine. All VMs are unresponsible afterwards and I have to restart the server.
It ends up giving me Out of memory errors in the OpenVZ machines when it runs.
Here is the part of the log file:
Jun 2 01:00:01 proxmox /USR/SBIN/CRON[636264]: (root) CMD (vzdump 100 101 102 103 104 105 106 107 108 109 110 111 112 113 115 --quiet 1 --mode snapshot --mailto sven@voleatech.com --compress gzip --storage backup)
Jun 2 01:00:02 proxmox vzdump[636265]: <root@pam> starting task UPIDroxmox:0009B56C:0032F5AE:51AA7CF2:vzdump::root@pam:
Jun 2 01:00:02 proxmox vzdump[636268]: INFO: starting new backup job: vzdump 100 101 102 103 104 105 106 107 108 109 110 111 112 113 115 --quiet 1 --mailto sven@voleatech.com --mode snapshot --compress gzip --storage backup
Jun 2 01:00:07 proxmox pvestatd[3277]: WARNING: command 'df -P -B 1 /media/backup/eku' failed: got timeout (THIS IS THE NFS DRIVE)
Jun 2 01:00:16 proxmox vzdump[636268]: INFO: Starting Backup of VM 100 (openvz)
Jun 2 01:00:16 proxmox pvestatd[3277]: status update time (10.948 seconds)
Jun 2 01:00:16 proxmox kernel: EXT3-fs: barriers disabled
Jun 2 01:00:16 proxmox kernel: kjournald starting. Commit interval 5 seconds
Jun 2 01:00:16 proxmox kernel: EXT3-fs (dm-4): using internal journal
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51004945
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51004703
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51004942
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51128245
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51128110
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261584
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261583
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261582
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261581
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1261580
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 1354024
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020207
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020205
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020202
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020201
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020198
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51020181
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51019929
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51019928
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51019927
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 51019882
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147491
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147489
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147488
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147480
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 147477
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50889059
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50889050
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50889024
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50889015
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50888997
Jun 2 01:00:16 proxmox kernel: ext3_orphan_cleanup: deleting unreferenced inode 50995205
Jun 2 01:00:16 proxmox kernel: EXT3-fs (dm-4): 32 orphan inodes deleted
Jun 2 01:00:16 proxmox kernel: EXT3-fs (dm-4): recovery complete
Jun 2 01:00:16 proxmox kernel: EXT3-fs (dm-4): mounted filesystem with ordered data mode
Jun 2 01:05:25 proxmox kernel: INFO: task flush-253:3:1863 blocked for more than 120 seconds.
Jun 2 01:05:25 proxmox kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jun 2 01:05:25 proxmox kernel: flush-253:3 D ffff88042c42f3a0 0 1863 2 0 0x00000000
Jun 2 01:05:25 proxmox kernel: ffff88042c431980 0000000000000046 0000000000000000 ffffffff8140b5ec
Jun 2 01:05:25 proxmox kernel: ffff88042c4319b0 ffffffff811c92f0 ffff88042c431d18 0000000000000282
Jun 2 01:05:25 proxmox kernel: ffff88042c431930 0000000101fb484b ffff88042c431fd8 ffff88042c431fd8
Jun 2 01:05:25 proxmox kernel: Call Trace:
Jun 2 02:47:17 proxmox kernel: ct0 nfs: server NAS not responding, still trying
Jun 2 03:02:43 proxmox kernel: >>> 100 oom generation 0 starts
Jun 2 03:02:43 proxmox kernel: 754037 (smbd) invoked loc oom-killer: gfp 0x200d2 order 0 oomkilladj=0
Jun 2 03:02:43 proxmox kernel: UB-100-Mem-Info:
Jun 2 03:02:43 proxmox kernel: Node 0 DMA prio:41 portion:486 scan:0 a_anon:0 40726492ms i_anon:0 40726492ms a_file:0 40726492ms i_file:0 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 0 DMA/shadow prio:41 portion:0 scan:0 a_anon:0 40726492ms i_anon:0 40726492ms a_file:0 40726492ms i_file:0 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 0 DMA32 prio:41 portion:104316 scan:0 a_anon:0 40726492ms i_anon:0 40726492ms a_file:0 40726492ms i_file:0 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 0 DMA32/shadow prio:41 portion:0 scan:0 a_anon:0 40726492ms i_anon:0 40726492ms a_file:0 40726492ms i_file:0 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 0 Normal prio:16 portion:418542 scan:91 a_anon:175408 1882835ms i_anon:91217 622835ms a_file:16 22835ms i_file:12 22835ms unevictable:0 reclaim_stat: 64669 41461 7908 2463
Jun 2 03:02:43 proxmox kernel: Node 0 Normal/shadow prio:41 portion:0 scan:0 a_anon:0 40726492ms i_anon:291902 40726492ms a_file:0 40726492ms i_file:61910 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Node 1 Normal prio:17 portion:525230 scan:46 a_anon:278093 2543437ms i_anon:49914 383437ms a_file:27 23437ms i_file:0 23437ms unevictable:0 reclaim_stat: 49585 33368 6403 1983
Jun 2 03:02:43 proxmox kernel: Node 1 Normal/shadow prio:41 portion:0 scan:0 a_anon:0 40726492ms i_anon:231572 40726492ms a_file:0 40726492ms i_file:27364 40726492ms unevictable:0 reclaim_stat: 0 0 0 0
Jun 2 03:02:43 proxmox kernel: Total 1207435 anon:1118106 file:89329 a_anon:453501 i_anon:664605 a_file:43 i_file:89286 unevictable:0
Jun 2 03:02:43 proxmox kernel: RAM: 1048576 / 1048576 [1] SWAP: 524288 / 524288 [1] KMEM: 1852403712 / 2147483648 [0] DCSZ: 43947787 / 1073741824 [0] OOMG: 911107 / inf [0] Dirty 0 Wback 0 Dche 46475 Prnd 13755
Jun 2 03:02:43 proxmox kernel: Out of memory in UB: Kill process 3668 (named) score -992 or sacrifice child
.....
Any suggestions?
Thanks
Sven