I have a serious problem on one of my nodes.
The raid (3ware raid-10) gives errors on the backup of one of the containers.
Yesterday I needed to reboot the node to be able to get into the container again.
While rebooting linux stopped the diskcheck telling fsck could'nt continue because of problems it needed to be fixed by hand.
Since the node was already down for a while I just let it boot and everything runs fine.
The backups of that container gives a lot of errors about 'couldn't stat' and missing files.
It results in a backup log of GB's big.
How do I get this container save from this node?
I guess if I migrate it it will give the same errors and I will be left with a container with a lot of missing files (or loose it completly?).
If I stop the node again and do something manually (what?) with fsck will it corrupt the container or files in it?
How do I save this container?
This is a part from the 3w kernel errors:
The raid (3ware raid-10) gives errors on the backup of one of the containers.
Yesterday I needed to reboot the node to be able to get into the container again.
While rebooting linux stopped the diskcheck telling fsck could'nt continue because of problems it needed to be fixed by hand.
Since the node was already down for a while I just let it boot and everything runs fine.
The backups of that container gives a lot of errors about 'couldn't stat' and missing files.
It results in a backup log of GB's big.
How do I get this container save from this node?
I guess if I migrate it it will give the same errors and I will be left with a container with a lot of missing files (or loose it completly?).
If I stop the node again and do something manually (what?) with fsck will it corrupt the container or files in it?
How do I save this container?
This is a part from the 3w kernel errors:
Jan 3 08:14:22 node3 kernel: sd 0:0:0:0: [sda] Add. Sense: No additional sense information
Jan 3 08:14:25 node3 kernel: 3w-9xxx: scsi0: ERROR: (0x03:0x101A): Retry queued command:.
Jan 3 08:14:25 node3 kernel: sd 0:0:0:0: [sda] Sense Key : No Sense [deferred] [descriptor]
Jan 3 08:14:25 node3 kernel: Descriptor sense data with sense descriptors (in hex):
Jan 3 08:14:25 node3 kernel: 7f 00 00 00 00 00 00 28 00 00 00 00 00 00 00 00
Jan 3 08:14:25 node3 kernel: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jan 3 08:14:25 node3 kernel: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jan 3 08:14:25 node3 kernel: sd 0:0:0:0: [sda] Add. Sense: No additional sense information
Jan 3 08:14:29 node3 kernel: 3w-9xxx: scsi0: ERROR: (0x03:0x101A): Retry queued command:.
Jan 3 08:14:29 node3 kernel: sd 0:0:0:0: [sda] Sense Key : No Sense [deferred] [descriptor]
Jan 3 08:14:29 node3 kernel: Descriptor sense data with sense descriptors (in hex):
Jan 3 08:14:29 node3 kernel: 7f 00 00 00 00 00 00 28 00 00 00 00 00 00 00 00
Jan 3 08:14:29 node3 kernel: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jan 3 08:14:29 node3 kernel: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jan 3 08:14:29 node3 kernel: sd 0:0:0:0: [sda] Add. Sense: No additional sense information
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:52 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:22:53 node3 kernel: WARNING: at fs/buffer.c:1173 mark_buffer_dirty()
Jan 3 08:22:53 node3 kernel: Pid: 2960, comm: updatedb.mlocat Not tainted 2.6.24-7-pve #1
Jan 3 08:22:53 node3 kernel:
Jan 3 08:22:53 node3 kernel: Call Trace:
Jan 3 08:22:53 node3 kernel: [<ffffffff802fd117>] mark_buffer_dirty+0x87/0xa0
Jan 3 08:22:53 node3 kernel: [<ffffffff8033d7f7>] ext3_commit_super+0x57/0xa0
Jan 3 08:22:53 node3 kernel: [<ffffffff8033f512>] ext3_handle_error+0x52/0xd0
Jan 3 08:22:53 node3 kernel: [<ffffffff8033f696>] ext3_error+0x96/0xc0
Jan 3 08:22:53 node3 kernel: [<ffffffff802fca91>] __find_get_block+0xb1/0x1e0
Jan 3 08:22:53 node3 kernel: [<ffffffff802fc26c>] submit_bh+0xfc/0x130
Jan 3 08:22:53 node3 kernel: [<ffffffff80334a3a>] __ext3_get_inode_loc+0x31a/0x380
Jan 3 08:22:53 node3 kernel: [<ffffffff80334ad2>] ext3_read_inode+0x32/0x3c0
Jan 3 08:22:53 node3 kernel: [<ffffffff8033bc23>] ext3_lookup+0x143/0x170
Jan 3 08:22:53 node3 kernel: [<ffffffff802db675>] do_lookup+0x255/0x280
Jan 3 08:22:53 node3 kernel: [<ffffffff802dda50>] __link_path_walk+0x810/0x1380
Jan 3 08:22:53 node3 kernel: [<ffffffff802fc26c>] submit_bh+0xfc/0x130
Jan 3 08:22:53 node3 kernel: [<ffffffff802de665>] link_path_walk+0xa5/0x170
Jan 3 08:22:53 node3 kernel: [<ffffffff802decf5>] do_path_lookup+0xe5/0x380
Jan 3 08:22:53 node3 kernel: [<ffffffff802dd185>] getname+0xc5/0x180
Jan 3 08:22:53 node3 kernel: [<ffffffff802dfc6b>] __user_walk_fd+0x4b/0x80
Jan 3 08:22:53 node3 kernel: [<ffffffff802d64bc>] vfs_lstat_fd+0x2c/0x70
Jan 3 08:22:53 node3 kernel: [<ffffffff802d6527>] sys_newlstat+0x27/0x50
Jan 3 08:22:53 node3 kernel: [<ffffffff8020c69e>] system_call+0x7e/0x83
Jan 3 08:22:53 node3 kernel:
Jan 3 08:23:09 node3 kernel: printk: 93 messages suppressed.
Jan 3 08:23:09 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:23:09 node3 kernel: lost page write due to I/O error on dm-3
Jan 3 08:23:09 node3 kernel: lost page write due to I/O error on dm-3
Last edited: