drbd Diskless/DUnknown issue on both nodes

fjs · Jul 6, 2015

Hi All

Sorry to bother you again gents, but I have an issue and I have been looking on forums for the last 3 days to try and resolve. I had a power failure and all my servers went down on at the same time. I ended up with a drbd setup like this:

root@jt1:~# /etc/init.d/drbd status
drbd driver loaded OK; device status:
version: 8.3.13 (api:88/proto:86-96)
GIT-hash: 83ca112086600faacab2f157bc5a9324f7bd7f77 build by root@sighted, 2012-10-09 12:47:51
m:res cs ro ds p mounted fstype
0:r0 WFConnection Secondary/Unknown Diskless/DUnknown C

root@jt2:~# /etc/init.d/drbd status
drbd driver loaded OK; device status:
version: 8.3.13 (api:88/proto:86-96)
GIT-hash: 83ca112086600faacab2f157bc5a9324f7bd7f77 build by root@sighted, 2012-10-09 12:47:51
m:res cs ro ds p mounted fstype
0:r0 Unconfigured

If I bring drbd up on the secondary node (jt2) then the output is exactly the same as jt1. I have tried many a things and the next step would be just to recreate everything as it is a test solution, but I would like to know how to fix this issue in the event that it happens again under production. Any help appreciated. Here are my settings:

root@jt1:~# cat /etc/drbd.d/r0.res
resource r0 {
device /dev/drbd0;
disk /dev/sdb1;
meta-disk internal;
on jt1 {
address 10.10.10.1:7789;
}
on jt2 {
address 10.10.10.2:7789;
}

}
root@jt1:~# cat /etc/drbd.d/global_common.conf
global {
usage-count ask;
# minor-count dialog-refresh disable-ip-verification
}

common {
protocol C;

handlers {
split-brain "/usr/lib/drbd/notify-split-brain.sh root";
# The following 3 handlers were disabled due to #576511.
# Please check the DRBD manual and enable them, if they make sense in your setup.
# pri-on-incon-degr "/usr/lib/drbd/notify-pri-on-incon-degr.sh; /usr/lib/drbd/notify-emergency-reboot.sh; echo b > /proc/sysrq-trigger ; reboot -f";
# pri-lost-after-sb "/usr/lib/drbd/notify-pri-lost-after-sb.sh; /usr/lib/drbd/notify-emergency-reboot.sh; echo b > /proc/sysrq-trigger ; reboot -f";
# local-io-error "/usr/lib/drbd/notify-io-error.sh; /usr/lib/drbd/notify-emergency-shutdown.sh; echo o > /proc/sysrq-trigger ; halt -f";

# fence-peer "/usr/lib/drbd/crm-fence-peer.sh";
# split-brain "/usr/lib/drbd/notify-split-brain.sh root";
# out-of-sync "/usr/lib/drbd/notify-out-of-sync.sh root";
# before-resync-target "/usr/lib/drbd/snapshot-resync-target-lvm.sh -p 15 -- -c 16k";
# after-resync-target /usr/lib/drbd/unsnapshot-resync-target-lvm.sh;
}

startup {
wfc-timeout 60;
degr-wfc-timeout 60;
become-primary-on both;
# wfc-timeout degr-wfc-timeout outdated-wfc-timeout wait-after-sb
}

disk {
on-io-error detach;
# on-io-error fencing use-bmbv no-disk-barrier no-disk-flushes
# no-disk-drain no-md-flushes max-bio-bvecs
}

net {
cram-hmac-alg sha1;
shared-secret "3KkanieH00RN13w@ts3JY!?";
allow-two-primaries;
after-sb-0pri discard-zero-changes;
after-sb-1pri discard-secondary;
after-sb-2pri disconnect;

# sndbuf-size rcvbuf-size timeout connect-int ping-int ping-timeout max-buffers
# max-epoch-size ko-count allow-two-primaries cram-hmac-alg shared-secret
# after-sb-0pri after-sb-1pri after-sb-2pri data-integrity-alg no-tcp-cork
}

syncer {
rate 200M;
verify-alg md5;

# rate after al-extents use-rle cpu-mask verify-alg csums-alg
}

}

root@jt1:~# vgscan
Reading all physical volumes. This may take a while...
Found volume group "drbdr0" using metadata type lvm2
Found volume group "pve" using metadata type lvm2

I am happy to send other details through upon request. Thanks again.

udo · Jul 7, 2015

fjs said:
Hi All

Sorry to bother you again gents, but I have an issue and I have been looking on forums for the last 3 days to try and resolve. I had a power failure and all my servers went down on at the same time. I ended up with a drbd setup like this:

root@jt1:~# /etc/init.d/drbd status
drbd driver loaded OK; device status:
version: 8.3.13 (api:88/proto:86-96)
GIT-hash: 83ca112086600faacab2f157bc5a9324f7bd7f77 build by root@sighted, 2012-10-09 12:47:51
m:res cs ro ds p mounted fstype
0:r0 WFConnection Secondary/Unknown Diskless/DUnknown C

root@jt2:~# /etc/init.d/drbd status
drbd driver loaded OK; device status:
version: 8.3.13 (api:88/proto:86-96)
GIT-hash: 83ca112086600faacab2f157bc5a9324f7bd7f77 build by root@sighted, 2012-10-09 12:47:51
m:res cs ro ds p mounted fstype
0:r0 Unconfigured

...
root@jt1:~# vgscan
Reading all physical volumes. This may take a while...
Found volume group "drbdr0" using metadata type lvm2
Found volume group "pve" using metadata type lvm2

I am happy to send other details through upon request. Thanks again.

Hi,
you see the VG drbd0 but your resource is down, or secondary??
This shows, that you don't filter the drbd-blockdevice in the lvm.conf right!

If you do an "pvs" is sdb1 showed? Which is wrong, /dev/drbd0 is right.

Udo

fjs · Jul 7, 2015

Hi Udo

This is the first I hear of lvm.conf. I am not aware that I have to configure anything in that file, should I? Do you have an example perhaps?

root@jt1:~# pvs
PV VG Fmt Attr PSize PFree
/dev/sda3 pve lvm2 a-- 558.25g 16.00g
/dev/sdb1 lvm2 a-- 1.09t 1.09t

udo · Jul 7, 2015

fjs said:
Hi Udo

This is the first I hear of lvm.conf. I am not aware that I have to configure anything in that file, should I? Do you have an example perhaps?

root@jt1:~# pvs
PV VG Fmt Attr PSize PFree
/dev/sda3 pve lvm2 a-- 558.25g 16.00g
/dev/sdb1 lvm2 a-- 1.09t 1.09t

Hi,
read the wiki: http://pve.proxmox.com/wiki/DRBD#Adapt_your_lvm.conf

Code:

grep filter /etc/lvm/lvm.conf | grep -v \#
    filter = [ "r|/dev/sdc1|", "r|/dev/sdc2|", "r|/dev/disk/|", "r|/dev/block/|", "a/.*/" ]

pvs
  PV                              VG            Fmt  Attr PSize   PFree  
  /dev/drbd0                      a_sas_r0      lvm2 a--  139.69g  65.68g
  /dev/drbd1                      b_sas_r1      lvm2 a--  139.68g  53.68g
...

Udo

fjs · Jul 7, 2015

Thanks Udo. I will read the wiki regarding lvm.conf. I followed this howto initially and learned from there:
https://www.youtube.com/watch?v=OjORUwDY63U&list=PLNSYULi38glvExordFRiEX9a6B09872sm&index=1

Thanks for the help.

Search

Search

drbd Diskless/DUnknown issue on both nodes

fjs

New Member

udo

Distinguished Member

fjs

New Member

udo

Distinguished Member

fjs

New Member