additional monitoring between the two daemons.
authorRonnie Sahlberg <ronniesahlberg@gmail.com>
Tue, 9 Sep 2008 03:44:46 +0000 (13:44 +1000)
committerRonnie Sahlberg <ronniesahlberg@gmail.com>
Tue, 9 Sep 2008 03:44:46 +0000 (13:44 +1000)
commitcd69d292292eaab3aac0e9d9fc57cb621597c63c
tree47d19f322babe3cb644fd292d0ea97ae3adc15a8
parente26ce5140ed005725f8b7ac8ba23a180fd7d5337
additional monitoring between the two daemons.

we currently only monitor that the dameons are running by kill(0, pid)
and verifying the the domain socket between them is ok.

this is not sufficient since we can have a situation where the recovery
daemon is hung.

this new code monitors that the recovery daemon is operating.
if the recovery hangs, we log this and shut down the main daemon
client/ctdb_client.c
include/ctdb.h
include/ctdb_private.h
server/ctdb_control.c
server/ctdb_daemon.c
server/ctdb_recover.c
server/ctdb_recoverd.c
server/ctdb_tunables.c