Hi Morbious, can you please post some more info?
- host HW configuration (CPU and memory type and amount, how many NVMe disks)
- host SW configuration (kernel versions on both machines, storage type and filesystem used), how many VMs
- VM configs of the VMs you get errors on (cpu number, memory, virtio devices, OS type, OS version)
- are there similar VMs on the host you don't get errors on?
- errors and context (what kind of errors you get in the VMs and during what, backups or restores?)
- do you use any of the mitigations I posted above?
HW:
4x CPU Intel(R) Xeon(R) CPU E7-8860 v3 @ 2.20GHz
1.5 TB RAM
6 NVME (ZFS)
8 HDD 1,6 TB (hardware RAID 5)
SW:
Proxmox 5.1 (working)
100 GB system
1 GB LVM /mnt/backup
rest thin-provision for guests machines
5 VM with heavy load ( 4x solr cluster on NVME/HDD,1 database on NVME/HDD)
example_machine: 8 cpu, 32 GB RAM, 32 HDD system, 750 GB data (HDD) [previously 500 GB NVME]
Proxmox 5.2/5.3 ( not working)
100 GB system
1 GB LVM /mnt/backup
rest thin-provision for guests machines
1 VM example_machine moved from Proxmox_5.1 (32 GB RAM, 8 cpu) without data, than create new data volume 750 GB (HDD) and fed with scp , during this operation I got:
[23370.114508] scsi_io_completion: 13 callbacks suppressed
[23370.114561] sd 2:0:0:1: [sdb] tag#1 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[23370.114566] sd 2:0:0:1: [sdb] tag#1 Sense Key : Aborted Command [current]
[23370.114569] sd 2:0:0:1: [sdb] tag#1 Add. Sense: I/O process terminated
[23370.114574] sd 2:0:0:1: [sdb] tag#1 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
[23370.114578] blk_update_request: 13 callbacks suppressed
[23370.114580] blk_update_request: I/O error, dev sdb, sector 784633688
[23370.115664] Aborting journal on device sdb-8.
[23370.154477] sd 2:0:0:1: [sdb] tag#1 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[23370.154505] sd 2:0:0:1: [sdb] tag#1 Sense Key : Aborted Command [current]
[23370.154513] sd 2:0:0:1: [sdb] tag#1 Add. Sense: I/O process terminated
[23370.154521] sd 2:0:0:1: [sdb] tag#1 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
[23370.154530] blk_update_request: I/O error, dev sdb, sector 784596992
[23370.155075] Buffer I/O error on dev sdb, logical block 98074624, lost sync page write
[23370.155597] JBD2: Error -5 detected when updating journal superblock for sdb-8.
[23372.010446] sd 2:0:0:1: [sdb] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[23372.010461] sd 2:0:0:1: [sdb] tag#0 Sense Key : Aborted Command [current]
[23372.010464] sd 2:0:0:1: [sdb] tag#0 Add. Sense: I/O process terminated
[23372.010468] sd 2:0:0:1: [sdb] tag#0 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
[23372.010472] blk_update_request: I/O error, dev sdb, sector 0
[23372.010995] Buffer I/O error on dev sdb, logical block 0, lost sync page write
[23372.011532] EXT4-fs error (device sdb): ext4_journal_check_start:56: Detected aborted journal
[23372.012121] EXT4-fs (sdb): Remounting filesystem read-only
[23372.012646] EXT4-fs (sdb): previous I/O error to superblock detected
[23372.050557] sd 2:0:0:1: [sdb] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[23372.050586] sd 2:0:0:1: [sdb] tag#0 Sense Key : Aborted Command [current]
[23372.050589] sd 2:0:0:1: [sdb] tag#0 Add. Sense: I/O process terminated
[23372.050592] sd 2:0:0:1: [sdb] tag#0 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
[23372.050595] blk_update_request: I/O error, dev sdb, sector 0
[23372.051131] Buffer I/O error on dev sdb, logical block 0, lost sync page write
And on Dom0:
[Wed Jan 2 22:36:52 2019] INFO: task kvm:11230 blocked for more than 120 seconds.
[Wed Jan 2 22:36:52 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 22:36:52 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 22:36:52 2019] kvm D 0 11230 1 0x00000000
[Wed Jan 2 22:36:52 2019] Call Trace:
[Wed Jan 2 22:36:52 2019] __schedule+0x3e0/0x870
[Wed Jan 2 22:36:52 2019] schedule+0x36/0x80
[Wed Jan 2 22:36:52 2019] schedule_timeout+0x1d4/0x360
[Wed Jan 2 22:36:52 2019] io_schedule_timeout+0x1e/0x50
[Wed Jan 2 22:36:52 2019] wait_for_completion_io+0xb4/0x140
[Wed Jan 2 22:36:52 2019] ? wake_up_q+0x80/0x80
[Wed Jan 2 22:36:52 2019] submit_bio_wait+0x61/0x90
[Wed Jan 2 22:36:52 2019] blkdev_issue_flush+0x85/0xb0
[Wed Jan 2 22:36:52 2019] blkdev_fsync+0x35/0x50
[Wed Jan 2 22:36:52 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 22:36:52 2019] do_fsync+0x3d/0x70
[Wed Jan 2 22:36:52 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 22:36:52 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 22:36:52 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 22:36:52 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 22:36:52 2019] RSP: 002b:00007fb6e4ffc5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 22:36:52 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 22:36:52 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000010
[Wed Jan 2 22:36:52 2019] RBP: 00007fb717308ac0 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 22:36:52 2019] R10: 00007fb6e4ffc620 R11: 0000000000000293 R12: 00007fbf23dbfd40
[Wed Jan 2 22:36:52 2019] R13: 00007fbf23cc1ef8 R14: 00007fb718f3d890 R15: 00007fbf4a531040
[Wed Jan 2 22:38:53 2019] INFO: task kvm:10826 blocked for more than 120 seconds.
[Wed Jan 2 22:38:53 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 22:38:53 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 22:38:53 2019] kvm D 0 10826 1 0x00000002
[Wed Jan 2 22:38:53 2019] Call Trace:
[Wed Jan 2 22:38:53 2019] __schedule+0x3e0/0x870
[Wed Jan 2 22:38:53 2019] schedule+0x36/0x80
[Wed Jan 2 22:38:53 2019] io_schedule+0x16/0x40
[Wed Jan 2 22:38:53 2019] wait_on_page_bit_common+0xf3/0x190
[Wed Jan 2 22:38:53 2019] ? page_cache_tree_insert+0xe0/0xe0
[Wed Jan 2 22:38:53 2019] __filemap_fdatawait_range+0xfa/0x160
[Wed Jan 2 22:38:53 2019] file_write_and_wait_range+0x70/0xb0
[Wed Jan 2 22:38:53 2019] blkdev_fsync+0x1b/0x50
[Wed Jan 2 22:38:53 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 22:38:53 2019] do_fsync+0x3d/0x70
[Wed Jan 2 22:38:53 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 22:38:53 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 22:38:53 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 22:38:53 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 22:38:53 2019] RSP: 002b:00007fb6e7ffc5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 22:38:53 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 22:38:53 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000011
[Wed Jan 2 22:38:53 2019] RBP: 00007fb718b6a200 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 22:38:53 2019] R10: 00007fb6e7ffc620 R11: 0000000000000293 R12: 00007fbf23dbfe00
[Wed Jan 2 22:38:53 2019] R13: 00007fbf23cc1ef8 R14: 00007fb717372160 R15: 00007fbf4a531040
[Wed Jan 2 22:38:53 2019] INFO: task kvm:11230 blocked for more than 120 seconds.
[Wed Jan 2 22:38:53 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 22:38:53 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 22:38:53 2019] kvm D 0 11230 1 0x00000000
[Wed Jan 2 22:38:53 2019] Call Trace:
[Wed Jan 2 22:38:53 2019] __schedule+0x3e0/0x870
[Wed Jan 2 22:38:53 2019] schedule+0x36/0x80
[Wed Jan 2 22:38:53 2019] schedule_timeout+0x1d4/0x360
[Wed Jan 2 22:38:53 2019] io_schedule_timeout+0x1e/0x50
[Wed Jan 2 22:38:53 2019] wait_for_completion_io+0xb4/0x140
[Wed Jan 2 22:38:53 2019] ? wake_up_q+0x80/0x80
[Wed Jan 2 22:38:53 2019] submit_bio_wait+0x61/0x90
[Wed Jan 2 22:38:53 2019] blkdev_issue_flush+0x85/0xb0
[Wed Jan 2 22:38:53 2019] blkdev_fsync+0x35/0x50
[Wed Jan 2 22:38:53 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 22:38:53 2019] do_fsync+0x3d/0x70
[Wed Jan 2 22:38:53 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 22:38:53 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 22:38:53 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 22:38:53 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 22:38:53 2019] RSP: 002b:00007fb6e4ffc5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 22:38:53 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 22:38:53 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000010
[Wed Jan 2 22:38:53 2019] RBP: 00007fb717308ac0 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 22:38:53 2019] R10: 00007fb6e4ffc620 R11: 0000000000000293 R12: 00007fbf23dbfd40
[Wed Jan 2 22:38:53 2019] R13: 00007fbf23cc1ef8 R14: 00007fb718f3d890 R15: 00007fbf4a531040
[Wed Jan 2 22:42:55 2019] INFO: task kvm:11108 blocked for more than 120 seconds.
[Wed Jan 2 22:42:55 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 22:42:55 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 22:42:55 2019] kvm D 0 11108 1 0x00000002
[Wed Jan 2 22:42:55 2019] Call Trace:
[Wed Jan 2 22:42:55 2019] __schedule+0x3e0/0x870
[Wed Jan 2 22:42:55 2019] schedule+0x36/0x80
[Wed Jan 2 22:42:55 2019] io_schedule+0x16/0x40
[Wed Jan 2 22:42:55 2019] wait_on_page_bit_common+0xf3/0x190
[Wed Jan 2 22:42:55 2019] ? page_cache_tree_insert+0xe0/0xe0
[Wed Jan 2 22:42:55 2019] __filemap_fdatawait_range+0xfa/0x160
[Wed Jan 2 22:42:55 2019] file_write_and_wait_range+0x70/0xb0
[Wed Jan 2 22:42:55 2019] blkdev_fsync+0x1b/0x50
[Wed Jan 2 22:42:55 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 22:42:55 2019] do_fsync+0x3d/0x70
[Wed Jan 2 22:42:55 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 22:42:55 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 22:42:55 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 22:42:55 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 22:42:55 2019] RSP: 002b:00007fb702ffb5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 22:42:55 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 22:42:55 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000011
[Wed Jan 2 22:42:55 2019] RBP: 00007fb7172bbac0 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 22:42:55 2019] R10: 00007fb702ffb620 R11: 0000000000000293 R12: 00007fbf23dbfe00
[Wed Jan 2 22:42:55 2019] R13: 00007fbf23cc1ef8 R14: 00007fb717372080 R15: 00007fbf4a531040
[Wed Jan 2 22:42:55 2019] INFO: task kvm:12220 blocked for more than 120 seconds.
[Wed Jan 2 22:42:55 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 22:42:55 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 22:42:55 2019] kvm D 0 12220 1 0x00000000
[Wed Jan 2 22:42:55 2019] Call Trace:
[Wed Jan 2 22:42:55 2019] __schedule+0x3e0/0x870
[Wed Jan 2 22:42:55 2019] schedule+0x36/0x80
[Wed Jan 2 22:42:55 2019] io_schedule+0x16/0x40
[Wed Jan 2 22:42:55 2019] wait_on_page_bit_common+0xf3/0x190
[Wed Jan 2 22:42:55 2019] ? page_cache_tree_insert+0xe0/0xe0
[Wed Jan 2 22:42:55 2019] __filemap_fdatawait_range+0xfa/0x160
[Wed Jan 2 22:42:55 2019] file_write_and_wait_range+0x70/0xb0
[Wed Jan 2 22:42:55 2019] blkdev_fsync+0x1b/0x50
[Wed Jan 2 22:42:55 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 22:42:55 2019] do_fsync+0x3d/0x70
[Wed Jan 2 22:42:55 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 22:42:55 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 22:42:55 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 22:42:55 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 22:42:55 2019] RSP: 002b:00007fb6deffc5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 22:42:55 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 22:42:55 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000010
[Wed Jan 2 22:42:55 2019] RBP: 00007fb716086980 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 22:42:55 2019] R10: 00007fb6deffc620 R11: 0000000000000293 R12: 00007fbf23dbfd40
[Wed Jan 2 22:42:55 2019] R13: 00007fbf23cc1ef8 R14: 00007fb718f3c550 R15: 00007fbf4a531040
[Wed Jan 2 22:44:56 2019] INFO: task kvm:11108 blocked for more than 120 seconds.
[Wed Jan 2 22:44:56 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 22:44:56 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 22:44:56 2019] kvm D 0 11108 1 0x00000002
[Wed Jan 2 22:44:56 2019] Call Trace:
[Wed Jan 2 22:44:56 2019] __schedule+0x3e0/0x870
[Wed Jan 2 22:44:56 2019] schedule+0x36/0x80
[Wed Jan 2 22:44:56 2019] io_schedule+0x16/0x40
[Wed Jan 2 22:44:56 2019] wait_on_page_bit_common+0xf3/0x190
[Wed Jan 2 22:44:56 2019] ? page_cache_tree_insert+0xe0/0xe0
[Wed Jan 2 22:44:56 2019] __filemap_fdatawait_range+0xfa/0x160
[Wed Jan 2 22:44:56 2019] file_write_and_wait_range+0x70/0xb0
[Wed Jan 2 22:44:56 2019] blkdev_fsync+0x1b/0x50
[Wed Jan 2 22:44:56 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 22:44:56 2019] do_fsync+0x3d/0x70
[Wed Jan 2 22:44:56 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 22:44:56 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 22:44:56 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 22:44:56 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 22:44:56 2019] RSP: 002b:00007fb702ffb5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 22:44:56 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 22:44:56 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000011
[Wed Jan 2 22:44:56 2019] RBP: 00007fb7172bbac0 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 22:44:56 2019] R10: 00007fb702ffb620 R11: 0000000000000293 R12: 00007fbf23dbfe00
[Wed Jan 2 22:44:56 2019] R13: 00007fbf23cc1ef8 R14: 00007fb717372080 R15: 00007fbf4a531040
[Wed Jan 2 22:44:56 2019] INFO: task kvm:12220 blocked for more than 120 seconds.
[Wed Jan 2 22:44:56 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 22:44:56 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 22:44:56 2019] kvm D 0 12220 1 0x00000000
[Wed Jan 2 22:44:56 2019] Call Trace:
[Wed Jan 2 22:44:56 2019] __schedule+0x3e0/0x870
[Wed Jan 2 22:44:56 2019] schedule+0x36/0x80
[Wed Jan 2 22:44:56 2019] io_schedule+0x16/0x40
[Wed Jan 2 22:44:56 2019] wait_on_page_bit_common+0xf3/0x190
[Wed Jan 2 22:44:56 2019] ? page_cache_tree_insert+0xe0/0xe0
[Wed Jan 2 22:44:56 2019] __filemap_fdatawait_range+0xfa/0x160
[Wed Jan 2 22:44:56 2019] file_write_and_wait_range+0x70/0xb0
[Wed Jan 2 22:44:56 2019] blkdev_fsync+0x1b/0x50
[Wed Jan 2 22:44:56 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 22:44:56 2019] do_fsync+0x3d/0x70
[Wed Jan 2 22:44:56 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 22:44:56 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 22:44:56 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 22:44:56 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 22:44:56 2019] RSP: 002b:00007fb6deffc5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 22:44:56 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 22:44:56 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000010
[Wed Jan 2 22:44:56 2019] RBP: 00007fb716086980 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 22:44:56 2019] R10: 00007fb6deffc620 R11: 0000000000000293 R12: 00007fbf23dbfd40
[Wed Jan 2 22:44:56 2019] R13: 00007fbf23cc1ef8 R14: 00007fb718f3c550 R15: 00007fbf4a531040
[Wed Jan 2 23:17:09 2019] INFO: task kvm:16152 blocked for more than 120 seconds.
[Wed Jan 2 23:17:09 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 23:17:09 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 23:17:09 2019] kvm D 0 16152 1 0x00000000
[Wed Jan 2 23:17:09 2019] Call Trace:
[Wed Jan 2 23:17:09 2019] __schedule+0x3e0/0x870
[Wed Jan 2 23:17:09 2019] schedule+0x36/0x80
[Wed Jan 2 23:17:09 2019] schedule_timeout+0x1d4/0x360
[Wed Jan 2 23:17:09 2019] io_schedule_timeout+0x1e/0x50
[Wed Jan 2 23:17:09 2019] wait_for_completion_io+0xb4/0x140
[Wed Jan 2 23:17:09 2019] ? wake_up_q+0x80/0x80
[Wed Jan 2 23:17:09 2019] submit_bio_wait+0x61/0x90
[Wed Jan 2 23:17:09 2019] blkdev_issue_flush+0x85/0xb0
[Wed Jan 2 23:17:09 2019] blkdev_fsync+0x35/0x50
[Wed Jan 2 23:17:09 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 23:17:09 2019] do_fsync+0x3d/0x70
[Wed Jan 2 23:17:09 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 23:17:09 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 23:17:09 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 23:17:09 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 23:17:09 2019] RSP: 002b:00007fb6e2ffc5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 23:17:09 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 23:17:09 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000010
[Wed Jan 2 23:17:09 2019] RBP: 00007fb716087b00 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 23:17:09 2019] R10: 00007fb6e2ffc620 R11: 0000000000000293 R12: 00007fbf23dbfd40
[Wed Jan 2 23:17:09 2019] R13: 00007fbf23cc1ef8 R14: 00007fb7173733c0 R15: 00007fbf4a531040
[Wed Jan 2 23:23:11 2019] INFO: task kvm:16934 blocked for more than 120 seconds.
[Wed Jan 2 23:23:11 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 23:23:11 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 23:23:11 2019] kvm D 0 16934 1 0x00000000
[Wed Jan 2 23:23:11 2019] Call Trace:
[Wed Jan 2 23:23:11 2019] __schedule+0x3e0/0x870
[Wed Jan 2 23:23:11 2019] schedule+0x36/0x80
[Wed Jan 2 23:23:11 2019] schedule_timeout+0x1d4/0x360
[Wed Jan 2 23:23:11 2019] io_schedule_timeout+0x1e/0x50
[Wed Jan 2 23:23:11 2019] wait_for_completion_io+0xb4/0x140
[Wed Jan 2 23:23:11 2019] ? wake_up_q+0x80/0x80
[Wed Jan 2 23:23:11 2019] submit_bio_wait+0x61/0x90
[Wed Jan 2 23:23:11 2019] blkdev_issue_flush+0x85/0xb0
[Wed Jan 2 23:23:11 2019] blkdev_fsync+0x35/0x50
[Wed Jan 2 23:23:11 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 23:23:11 2019] do_fsync+0x3d/0x70
[Wed Jan 2 23:23:11 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 23:23:11 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 23:23:11 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 23:23:11 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 23:23:11 2019] RSP: 002b:00007fb6efffc5f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 23:23:11 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 23:23:11 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000010
[Wed Jan 2 23:23:11 2019] RBP: 00007fb719ba1600 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 23:23:11 2019] R10: 00007fb6efffc620 R11: 0000000000000293 R12: 00007fbf23dbfd40
[Wed Jan 2 23:23:11 2019] R13: 00007fbf23cc1ef8 R14: 00007fb7173732e0 R15: 00007fbf4a531040
[Wed Jan 2 23:29:14 2019] INFO: task kvm:17669 blocked for more than 120 seconds.
[Wed Jan 2 23:29:14 2019] Tainted: P O 4.15.18-9-pve #1
[Wed Jan 2 23:29:14 2019] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Wed Jan 2 23:29:14 2019] kvm D 0 17669 1 0x00000000
[Wed Jan 2 23:29:14 2019] Call Trace:
[Wed Jan 2 23:29:14 2019] __schedule+0x3e0/0x870
[Wed Jan 2 23:29:14 2019] schedule+0x36/0x80
[Wed Jan 2 23:29:14 2019] schedule_timeout+0x1d4/0x360
[Wed Jan 2 23:29:14 2019] io_schedule_timeout+0x1e/0x50
[Wed Jan 2 23:29:14 2019] wait_for_completion_io+0xb4/0x140
[Wed Jan 2 23:29:14 2019] ? wake_up_q+0x80/0x80
[Wed Jan 2 23:29:14 2019] submit_bio_wait+0x61/0x90
[Wed Jan 2 23:29:14 2019] blkdev_issue_flush+0x85/0xb0
[Wed Jan 2 23:29:14 2019] blkdev_fsync+0x35/0x50
[Wed Jan 2 23:29:14 2019] vfs_fsync_range+0x51/0xb0
[Wed Jan 2 23:29:14 2019] do_fsync+0x3d/0x70
[Wed Jan 2 23:29:14 2019] SyS_fdatasync+0x13/0x20
[Wed Jan 2 23:29:14 2019] do_syscall_64+0x73/0x130
[Wed Jan 2 23:29:14 2019] entry_SYSCALL_64_after_hwframe+0x3d/0xa2
[Wed Jan 2 23:29:14 2019] RIP: 0033:0x7fbf31a1e60d
[Wed Jan 2 23:29:14 2019] RSP: 002b:00007fb70bef75f0 EFLAGS: 00000293 ORIG_RAX: 000000000000004b
[Wed Jan 2 23:29:14 2019] RAX: ffffffffffffffda RBX: 00000000fffffffb RCX: 00007fbf31a1e60d
[Wed Jan 2 23:29:14 2019] RDX: 00007fbf23cc1e90 RSI: 000055bdd2f006e0 RDI: 0000000000000010
[Wed Jan 2 23:29:14 2019] RBP: 00007fb717383640 R08: 0000000000000000 R09: 00000000ffffffff
[Wed Jan 2 23:29:14 2019] R10: 00007fb70bef7620 R11: 0000000000000293 R12: 00007fbf23dbfd40
[Wed Jan 2 23:29:14 2019] R13: 00007fbf23cc1ef8 R14: 00007fb7173734a0 R15: 00007fbf4a531040
[Wed Jan 2 23:44:32 2019] perf: interrupt took too long (2504 > 2500), lowering kernel.perf_event_max_sample_rate to 79750
[Thu Jan 3 00:54:41 2019] perf: interrupt took too long (3150 > 3130), lowering kernel.perf_event_max_sample_rate to 63500
[Thu Jan 3 02:48:07 2019] perf: interrupt took too long (3939 > 3937), lowering kernel.perf_event_max_sample_rate to 50750
[Thu Jan 3 03:47:49 2019] device-mapper: thin: 253:5: reached low water mark for metadata device: sending event.
[cut]
[Thu Jan 3 04:24:45 2019] device-mapper: space map metadata: unable to allocate new metadata block
[Thu Jan 3 04:24:45 2019] device-mapper: thin: 253:5: metadata operation 'dm_thin_insert_block' failed: error = -28
[Thu Jan 3 04:24:45 2019] device-mapper: thin: 253:5: aborting current metadata transaction
[Thu Jan 3 04:24:45 2019] device-mapper: thin: 253:5: switching pool to read-only mode
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061824, lost async page write
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061825, lost async page write
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061826, lost async page write
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061827, lost async page write
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061828, lost async page write
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061829, lost async page write
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061830, lost async page write
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061831, lost async page write
[Thu Jan 3 04:24:45 2019] Buffer I/O error on dev dm-21, logical block 95061832, lost async page write
[Thu Jan 3 04:24:46 2019] Buffer I/O error on dev dm-21, logical block 95061833, lost async page write
Even if I tried to copy content from one partition to aother on Proxmox 5.2/5.3 I got errors and freezing.
Same job on the Proxmox 5.1 did to the end without any errors.