git.samba.org - sahlberg/ctdb.git/commit

git.samba.org / sahlberg / ctdb.git / commit

author	Andrew Tridgell <tridge@samba.org>
	Thu, 20 Nov 2008 21:05:59 +0000 (08:05 +1100)
committer	Ronnie Sahlberg <ronniesahlberg@gmail.com>
	Thu, 20 Nov 2008 23:29:22 +0000 (10:29 +1100)
commit	ee4e962eba3452d83645baeebacff98318915af3
tree	880b143fca58ea513b6bd26fd5d4cad3c66c9854	tree
parent	1a7ff4577d33f0dd470f7465c7d0e875c962f54e	commit \| diff

fixed problem with looping ctdb recoveries

After a node failure, GPFS can get into a state where non-blocking
fcntl() locks can take a long time. This means to the ctdb set_recmode
test timing out, which leads to a recovery failure, and a new
recovery. The recovery loop can last a long time.

The fix is to consider a fcntl timeout as a success of this test. The
test is to see that we can't lock the shared reclock file, so a
timeout is fine for a success.

server/ctdb_recover.c

diff | blob | history

CTDB repository

RSS Atom