So I have a machine (Quad core w/8GB RAM, VM's on a RAID-10 w/4 7200 RPM drives formatted as EXT4) running ProxMox v1.5 and about 30 VM's.
Top shows:
top - 10:19:35 up 2 days, 1:31, 4 users, load average: 18.07, 18.91, 18.25
Tasks: 792 total, 4 running, 788 sleeping, 0 stopped, 0 zombie
Cpu(s): 5.8%us, 2.6%sy, 0.1%ni, 42.3%id, 48.5%wa, 0.2%hi, 0.5%si, 0.0%st
Mem: 7634544k total, 6231280k used, 1403264k free, 341432k buffers
Swap: 7340024k total, 2804k used, 7337220k free, 3037444k cached
iostat -d -x -m shows:
Linux 2.6.18-2-pve (server01) 03/11/10 _x86_64_
Device: rrqm/s wrqm/s r/s w/s rMB/s wMB/s avgrq-sz avgqu-sz await svctm %util
sda 42.70 549.77 181.54 133.10 6.55 2.68 60.04 12.94 41.13 2.76 86.82
sda1 42.70 549.77 181.54 133.10 6.55 2.68 60.04 12.94 41.13 2.76 86.82
sdb 0.12 3.03 0.33 0.73 0.01 0.01 40.65 0.00 4.10 1.52 0.16
sdb1 0.00 0.00 0.00 0.00 0.00 0.00 15.73 0.00 3.26 3.19 0.00
sdb2 0.11 3.03 0.33 0.73 0.01 0.01 40.67 0.00 4.10 1.52 0.16
sdc 15.70 1334.08 11.80 149.48 0.11 5.88 76.06 10.98 68.07 0.73 11.74
sdc1 15.70 1334.08 11.80 149.48 0.11 5.88 76.06 10.98 68.07 0.73 11.74
dm-0 0.00 0.00 0.00 0.00 0.00 0.00 8.00 0.00 5.83 1.69 0.00
dm-1 0.00 0.00 0.43 3.76 0.01 0.01 10.28 0.10 22.71 0.38 0.16
dm-2 0.00 0.00 0.00 0.00 0.00 0.00 7.99 0.00 5.35 1.85 0.00
dm-3 0.00 0.00 27.94 1505.52 0.11 5.88 8.00 21.73 14.15 0.08 11.74
dm-4 0.00 0.00 224.70 685.65 6.55 2.68 20.75 0.95 27.49 0.95 86.82
ps ax | awk '$3 ~ /^D/ { print $0 }' shows:
4134 ? D 0:04 imap
6527 ? D< 10:02 [kjournald2]
9000 ? D 0:00 local -t unix
13131 ? D 0:00 smtp -t unix -u -c
14843 ? D 11:08 qmgr -l -t fifo -u
25269 ? D 0:00 cleanup -z -t unix -u -c
26776 ? D 0:00 cleanup -z -t unix -u -c
27364 ? D 0:00 cleanup -z -t unix -u -c
27458 ? D 0:19 /usr/bin/updatedb.mlocate
28977 ? D 0:00 smtp -t unix -u -c
29467 ? D 0:00 /usr/sbin/apache2 -k start
So what this shows is that ~50% of my time is spent in IOWAIT, and my /dev/sda "drive" (which is the RAID-10 VM drive) is 86% - 100% utilized. The 2 processes that standout as using the most time are kjournald2 and qmgr.
The question is, is there some way to track down WHICH VM processes belong to?? I am running several mail server VM's and it'd be nice to know which qmgr is freaking out...
The other question is, is there a more fine-grain way to track down which PROCESSES are using the most disk I/O per second?
Top shows:
top - 10:19:35 up 2 days, 1:31, 4 users, load average: 18.07, 18.91, 18.25
Tasks: 792 total, 4 running, 788 sleeping, 0 stopped, 0 zombie
Cpu(s): 5.8%us, 2.6%sy, 0.1%ni, 42.3%id, 48.5%wa, 0.2%hi, 0.5%si, 0.0%st
Mem: 7634544k total, 6231280k used, 1403264k free, 341432k buffers
Swap: 7340024k total, 2804k used, 7337220k free, 3037444k cached
iostat -d -x -m shows:
Linux 2.6.18-2-pve (server01) 03/11/10 _x86_64_
Device: rrqm/s wrqm/s r/s w/s rMB/s wMB/s avgrq-sz avgqu-sz await svctm %util
sda 42.70 549.77 181.54 133.10 6.55 2.68 60.04 12.94 41.13 2.76 86.82
sda1 42.70 549.77 181.54 133.10 6.55 2.68 60.04 12.94 41.13 2.76 86.82
sdb 0.12 3.03 0.33 0.73 0.01 0.01 40.65 0.00 4.10 1.52 0.16
sdb1 0.00 0.00 0.00 0.00 0.00 0.00 15.73 0.00 3.26 3.19 0.00
sdb2 0.11 3.03 0.33 0.73 0.01 0.01 40.67 0.00 4.10 1.52 0.16
sdc 15.70 1334.08 11.80 149.48 0.11 5.88 76.06 10.98 68.07 0.73 11.74
sdc1 15.70 1334.08 11.80 149.48 0.11 5.88 76.06 10.98 68.07 0.73 11.74
dm-0 0.00 0.00 0.00 0.00 0.00 0.00 8.00 0.00 5.83 1.69 0.00
dm-1 0.00 0.00 0.43 3.76 0.01 0.01 10.28 0.10 22.71 0.38 0.16
dm-2 0.00 0.00 0.00 0.00 0.00 0.00 7.99 0.00 5.35 1.85 0.00
dm-3 0.00 0.00 27.94 1505.52 0.11 5.88 8.00 21.73 14.15 0.08 11.74
dm-4 0.00 0.00 224.70 685.65 6.55 2.68 20.75 0.95 27.49 0.95 86.82
ps ax | awk '$3 ~ /^D/ { print $0 }' shows:
4134 ? D 0:04 imap
6527 ? D< 10:02 [kjournald2]
9000 ? D 0:00 local -t unix
13131 ? D 0:00 smtp -t unix -u -c
14843 ? D 11:08 qmgr -l -t fifo -u
25269 ? D 0:00 cleanup -z -t unix -u -c
26776 ? D 0:00 cleanup -z -t unix -u -c
27364 ? D 0:00 cleanup -z -t unix -u -c
27458 ? D 0:19 /usr/bin/updatedb.mlocate
28977 ? D 0:00 smtp -t unix -u -c
29467 ? D 0:00 /usr/sbin/apache2 -k start
So what this shows is that ~50% of my time is spent in IOWAIT, and my /dev/sda "drive" (which is the RAID-10 VM drive) is 86% - 100% utilized. The 2 processes that standout as using the most time are kjournald2 and qmgr.
The question is, is there some way to track down WHICH VM processes belong to?? I am running several mail server VM's and it'd be nice to know which qmgr is freaking out...
The other question is, is there a more fine-grain way to track down which PROCESSES are using the most disk I/O per second?