Cluster iSCSI issues

troycarpenter · Jun 14, 2016

I have a cluster that was originally set up with 5 nodes. Each node has a management interface, as well as a private network for corosync to communicate over. Over time I added three more nodes to the cluster and all seems to be working.

I have now configured iSCSI on the datacenter as outlined in various sources, and setup LVM on that. What I have noticed is that the original 5 nodes in the cluster have no issues using the shared storage. What is weird is that the three new nodes are having all kinds of issues using the shared storage.

I've seen the following errors on one of the new nodes:
[431436.828980] blk_update_request: I/O error, dev dm-0, sector 4294967168
[431436.835805] blk_update_request: I/O error, dev dm-0, sector 4294967280
[431436.842690] blk_update_request: I/O error, dev dm-0, sector 0
[431436.848771] blk_update_request: I/O error, dev dm-0, sector 8

I'm also seeing this error on another new node (this one is by far the most common error I see):
[3284852.201673] connection1:0: detected conn error (1020)

Like I said, the orignal nodes in the cluster do not have the issue and appear to be working perfectly with the iSCSI configuration, only the three nodes I added at a later date.

Any ideas?

RobFantini · Jun 16, 2016

I tried a couple of time to get lxc on iscsi lvm working, but had issues. see iscsi/napp-it wiki page , lxc on isci section.

On the nodes with issues, can you check /etc/lvm/archive/ to see if it is filling up?

troycarpenter · Jul 6, 2016

Sorry, been on PTO for the past 3 weeks. I'm not using lxc on iscsi lvm, these are all kvm.

So far what I found is that the three new nodes all had the same iSCSI initiator ID. I've changed them so they are all unique (the initiator IDs were already unique for the other nodes) and that seems to have solve the connection issues. However I'm still having trouble with the one node giving sector errors.

UPDATE: I made all the iSCSI initiators unique and rebooted the trouble nodes. Now all nodes seem to be working fine with iSCSI.

RobFantini · Jul 6, 2016

troycarpenter said:
Sorry, been on PTO for the past 3 weeks. I'm not using lxc on iscsi lvm, these are all kvm.

So far what I found is that the three new nodes all had the same iSCSI initiator ID. I've changed them so they are all unique (the initiator IDs were already unique for the other nodes) and that seems to have solve the connection issues. However I'm still having trouble with the one node giving sector errors.

how do you check for 'iSCSI initiator ID' ?

troycarpenter · Jul 6, 2016

cat /etc/iscsi/initiatorname.iscsi

I guess more properly called the InitiatorName.

I should have paid more close attention to the FreeNAS logs...it was telling me the problem all along.

RobFantini · Jul 6, 2016

troycarpenter said:
cat /etc/iscsi/initiatorname.iscsi

I guess more properly called the InitiatorName.

I should have paid more close attention to the FreeNAS logs...it was telling me the problem all along.

So the InitiatorName= should be different at every node . Any idea how there were duplicates?

troycarpenter · Jul 6, 2016

I'm guessing it's because the newer nodes were made from a disk image of one of the other nodes. That would explain how they had duplicate names...I'll have to add it to my list of things to modify when a new node is added to the cluster.

mir · Jul 6, 2016

RobFantini said:
So the InitiatorName= should be different at every node . Any idea how there were duplicates?

A cloned VM?

troycarpenter · Jul 6, 2016

No, a cloned host image.

mir · Jul 7, 2016

troycarpenter said:
No, a cloned host image.

The magic word here is cloned. Every clone will have identical contents in /etc/iscsi/initiatorname.iscsi

troycarpenter · Jul 7, 2016

mir said:
The magic word here is cloned. Every clone will have identical contents in /etc/iscsi/initiatorname.iscsi

Of course. It's one of the files I need to add to my notes to make sure it gets changed. I missed it before because I wasn't using iSCSI until now.

mir · Jul 7, 2016

initiatorname for iSCSI serves the same purpose as MAC does for ethernet.

RobFantini · Jul 7, 2016

mir said:
initiatorname for iSCSI serves the same purpose as MAC does for ethernet.

that is a good analogy Mir.

i'll add to the wiki later. that is a good explanation.

Search

Search

Cluster iSCSI issues

troycarpenter

Renowned Member

RobFantini

Famous Member

troycarpenter

Renowned Member

RobFantini

Famous Member

troycarpenter

Renowned Member

RobFantini

Famous Member

troycarpenter

Renowned Member

mir

Famous Member

troycarpenter

Renowned Member

mir

Famous Member

troycarpenter

Renowned Member

mir

Famous Member

RobFantini

Famous Member

We value your privacy