samba.git
15 years agosome platforms are very picky about the third argument passed to bind().
root [Thu, 4 Dec 2008 23:33:38 +0000 (10:33 +1100)]
some platforms are very picky about the third argument passed to bind().
and would complain if sa.family is AF_INET and the third argument is not exactly the size of a sockaddr_in.

We used to pass a union containing both a sockaddr_in and a sockaddr_in6  which would mean that on those platforms bind() would fail since the passed structure for AF_INET would be too big.

Thus we need to set and pass the appropriate size to bind. At the same time for thos eplatforms we can also set sin[6]_size to the expected size.
(bind() on those platforms were isurprisingly perfectly ok with sin_len was "too big")

(This used to be ctdb commit 5d3018c37179966f75183d9a98790eaaaf1d2cfc)

15 years agonew version 1.0.67
Ronnie Sahlberg [Thu, 4 Dec 2008 04:25:03 +0000 (15:25 +1100)]
new version 1.0.67

(This used to be ctdb commit 01633d30f764e8da42c4e5c4710b69cd379c9b39)

15 years agofix an incorrect path
root [Thu, 4 Dec 2008 04:03:40 +0000 (15:03 +1100)]
fix an incorrect path

(This used to be ctdb commit aeb09b08e060be6f833d84ba88abfb168cc5f50e)

15 years agoadd a description of the recovery-process
Ronnie Sahlberg [Thu, 4 Dec 2008 03:35:00 +0000 (14:35 +1100)]
add a description of the recovery-process

(This used to be ctdb commit 194abb41e1a0036956a9401efcae8b14ed66c532)

15 years agoprint the list of valid debug level literals when an invalid debug level
Ronnie Sahlberg [Tue, 2 Dec 2008 03:08:10 +0000 (14:08 +1100)]
print the list of valid debug level literals when an invalid debug level
is specified in 'ctdb setdebug'

(This used to be ctdb commit 979e78cfd96d74686af6f55f726c395a75275803)

15 years agoredesign how reloadnodes is implemented.
Ronnie Sahlberg [Tue, 2 Dec 2008 02:26:30 +0000 (13:26 +1100)]
redesign how reloadnodes is implemented.

modify the transport methods to allow to restart individual connections
and set up destructors properly.

only tear down/set-up tcp connections to nodes removed from the cluster
or nodes added to the cluster.
Leave tcp connections to unchanged nodes connected.

make "ctdb reloadnodes" explicitely cause a recovery of the cluster once
the files have been realoaded

(This used to be ctdb commit d1057ed6de7de9f2a64d8fa012c52647e89b515b)

15 years agodebuglevel is a signed int, not usnigned.
root [Fri, 28 Nov 2008 00:29:43 +0000 (11:29 +1100)]
debuglevel is a signed int, not usnigned.

(This used to be ctdb commit e577a276900854622f4e9da9d1ccd7b484d0d1ec)

15 years agomake it possible to delete an ip from all nodes at once using
Ronnie Sahlberg [Thu, 27 Nov 2008 22:52:26 +0000 (09:52 +1100)]
make it possible to delete an ip from all nodes at once using
"ctdb delip x.x.x.x -n all"

This is not as straightforward as one might think since during the
delete process we don not want the ip to be bouncing from one node to
another as node by node deletes it.

Thus we first delete the ip from all connected nodes which are not
currently hosting it.

After this we delete the ip from the node which is hosting it.

(This used to be ctdb commit bbd46f341e9aa32d8dbd49f7a9a07cb3f1f92ea3)

15 years agoinew version 1.0.66
Ronnie Sahlberg [Mon, 24 Nov 2008 08:06:02 +0000 (19:06 +1100)]
inew version 1.0.66
ddwq

(This used to be ctdb commit 499a01fece2a5f24f1b2943cf3dc6e9a3a8ca3b5)

15 years agoallow to change the recmaster even the database is not frozen
Ronnie Sahlberg [Fri, 21 Nov 2008 05:24:12 +0000 (16:24 +1100)]
allow to change the recmaster even the database is not frozen

(This used to be ctdb commit 03e2e436db5cfd29a56d13f5d2101e42389bfc94)

15 years agoremove two variables no longer used from the example sysconfig file
Ronnie Sahlberg [Fri, 21 Nov 2008 00:30:32 +0000 (11:30 +1100)]
remove two variables no longer used from the example sysconfig file

(This used to be ctdb commit dab594caf0bfc23c75c8cd2aa75479c7d2e79f1c)

15 years agofixed problem with looping ctdb recoveries
Andrew Tridgell [Thu, 20 Nov 2008 21:05:59 +0000 (08:05 +1100)]
fixed problem with looping ctdb recoveries

After a node failure, GPFS can get into a state where non-blocking
fcntl() locks can take a long time. This means to the ctdb set_recmode
test timing out, which leads to a recovery failure, and a new
recovery. The recovery loop can last a long time.

The fix is to consider a fcntl timeout as a success of this test. The
test is to see that we can't lock the shared reclock file, so a
timeout is fine for a success.

(This used to be ctdb commit 6579a6a2a7161214adedf0f67dce62f4a4ad1afe)

15 years agoMerge commit 'ronnie/master'
Andrew Tridgell [Thu, 20 Nov 2008 10:23:26 +0000 (21:23 +1100)]
Merge commit 'ronnie/master'

(This used to be ctdb commit fe6ddf7992ca3e72a26dbac6666e0f6270da611f)

15 years agodont override/change CTDB_BASE if it is already set by the shell
Ronnie Sahlberg [Thu, 20 Nov 2008 05:39:56 +0000 (16:39 +1100)]
dont override/change CTDB_BASE if it is already set by the shell

(This used to be ctdb commit 0a6f9326cb99f14b5c9edd0d8854d8229df49910)

15 years agoKeepalive packets were only sent every KeepaliveInterval if the socket
Ronnie Sahlberg [Thu, 20 Nov 2008 02:35:08 +0000 (13:35 +1100)]
Keepalive packets were only sent every KeepaliveInterval if the socket
had been completely idle during that interval.
If we had been sending other packets such as Messages, Calls or Controls
there wouldnt be any need for an explicit keepalive and thus we didnt
send one.

This does make it somewhat awkward when analyzing traces since it is
non-intuitive when keepalives are sent and when they are not sent.

Change the keepalive logic to always send a keepalive regardless of
whether the link is idle or not.

(This used to be ctdb commit 7a18f33ec7512100dd067c65f0470889ff8fd591)

15 years agoreqrite the handling of flag updates across the cluster to eliminate a
Ronnie Sahlberg [Wed, 19 Nov 2008 03:43:46 +0000 (14:43 +1100)]
reqrite the handling of flag updates across the cluster to eliminate a
race between the ctdb tool and the recovery daemon both at once
trying to push flag changes across the cluster.

(This used to be ctdb commit a9a1156ea4e10483a4bf4265b8e9203f0af033aa)

15 years agonew version 1.0.65
Ronnie Sahlberg [Wed, 12 Nov 2008 23:55:20 +0000 (10:55 +1100)]
new version 1.0.65

update the example sysconfig file. the default log level is 2, not 0

(This used to be ctdb commit 1f25958dc739677a487fa496fbeffcda7a0f2204)

15 years agoadd a CTDB_SOCKET variable that can be used to override the default
Ronnie Sahlberg [Tue, 11 Nov 2008 03:49:30 +0000 (14:49 +1100)]
add a CTDB_SOCKET variable that can be used to override the default
/tmp/ctdb.socket

(This used to be ctdb commit b75e2263c565c21ecbbd98fbd2c10787e467bf5c)

15 years agowe actually need a ctdb_db variable
Ronnie Sahlberg [Mon, 3 Nov 2008 10:54:52 +0000 (21:54 +1100)]
we actually need a ctdb_db variable

(This used to be ctdb commit aba984f1b85f5a2d370b093061cf15843ee53758)

15 years agolatency is measured in us, not ms
Ronnie Sahlberg [Thu, 30 Oct 2008 02:34:10 +0000 (13:34 +1100)]
latency is measured in us, not ms

use an explicit ctdb_db variable instead of dereferencing state

(This used to be ctdb commit 8c6a02fb423a8cbcbfc706767e3d353cd48073c3)

15 years agoadd control and logging of very high latencies.
Ronnie Sahlberg [Thu, 30 Oct 2008 01:49:53 +0000 (12:49 +1100)]
add control and logging of very high latencies.

log the type of operation and the database name for all latencies higher
than a treshold

(This used to be ctdb commit 1d581dcd507e8e13d7ae085ff4d6a9f3e2aaeba5)

15 years agonew version 1.0.64
Ronnie Sahlberg [Wed, 22 Oct 2008 00:06:18 +0000 (11:06 +1100)]
new version 1.0.64

(This used to be ctdb commit 1a7ff4577d33f0dd470f7465c7d0e875c962f54e)

15 years agoadd a context and a timed event so that once we have been in recovery
Ronnie Sahlberg [Wed, 22 Oct 2008 00:04:41 +0000 (11:04 +1100)]
add a context and a timed event so that once we have been in recovery
mode for too long we drop all public ip addresses

(This used to be ctdb commit 403c68f96e1380dd07217c688de2730464f77ea0)

15 years agonew version 1.0.63
Ronnie Sahlberg [Sun, 19 Oct 2008 22:47:54 +0000 (09:47 +1100)]
new version 1.0.63

(This used to be ctdb commit 59a879626a6a55fb6a43cadf5338c1aa6afe96d1)

15 years agodont log "running periodic cleanup" ...
Ronnie Sahlberg [Sun, 19 Oct 2008 22:45:15 +0000 (09:45 +1100)]
dont log "running periodic cleanup" ...

(This used to be ctdb commit e25ea88ea4f270ba65ed5fdacd693f1248f343c0)

15 years agonull out the pointer before we reload the nodes file
Ronnie Sahlberg [Fri, 17 Oct 2008 10:38:42 +0000 (21:38 +1100)]
null out the pointer before we reload the nodes file

(This used to be ctdb commit 4b0f32047e8bece0a052bdbe2209afe91b7e8ce3)

15 years agowhen we reload the nodes file, we may need to reload the nodes file
Ronnie Sahlberg [Fri, 17 Oct 2008 10:18:06 +0000 (21:18 +1100)]
when we reload the nodes file,   we may need to reload the nodes file
inside the recovery daemon as well.

(This used to be ctdb commit 82fd2b6b5cd8e988c38fa6b74121a048757bdeef)

15 years agomake it possible to set the script log level in CTDB sysconfig
Ronnie Sahlberg [Thu, 16 Oct 2008 22:02:03 +0000 (09:02 +1100)]
make it possible to set the script log level in CTDB sysconfig

(This used to be ctdb commit 06097b88709ced09d1f9f869eed9a54e6d2fedbf)

15 years agospecify a "script log level" on the commandline to set under which log
Ronnie Sahlberg [Thu, 16 Oct 2008 20:56:12 +0000 (07:56 +1100)]
specify a "script log level" on the commandline to set under which log
level any/all output from eventscripts will be logged as

(This used to be ctdb commit cdc79d4f22f1a6aec5c34115969421f93663932a)

15 years agonew version 1.0.62
Ronnie Sahlberg [Thu, 16 Oct 2008 06:59:55 +0000 (17:59 +1100)]
new version 1.0.62

(This used to be ctdb commit 49431e799ba7f7c78f596fdf896316a2e22c745e)

15 years agoallow multiple eventscripts using the same prefix.
Ronnie Sahlberg [Thu, 16 Oct 2008 06:57:50 +0000 (17:57 +1100)]
allow multiple eventscripts using the same prefix.
this eases the pain for users that use out of tree eventscripts

(This used to be ctdb commit 8313dfb6fc5404cd2d065af6620412f8664ada11)

15 years agoMerge commit 'ronnie/master'
Andrew Tridgell [Thu, 16 Oct 2008 01:58:25 +0000 (12:58 +1100)]
Merge commit 'ronnie/master'

(This used to be ctdb commit 5403ed6dcfdfc101b05b43f83002e720d81b4e38)

15 years agonew version 1.0.61
Ronnie Sahlberg [Wed, 15 Oct 2008 05:40:44 +0000 (16:40 +1100)]
new version 1.0.61

(This used to be ctdb commit 0098efd4443038f2d902e3a7c3640e63f06be7d1)

15 years agoinstall the new multipath monitoring event script
Ronnie Sahlberg [Wed, 15 Oct 2008 05:29:09 +0000 (16:29 +1100)]
install the new multipath monitoring event script

(This used to be ctdb commit 3b8d49bf58f4145cdca08565f06cd43fd36991e1)

15 years agoadd an eventscript to monitor that the multipath devices are healthy
Ronnie Sahlberg [Wed, 15 Oct 2008 05:27:33 +0000 (16:27 +1100)]
add an eventscript to monitor that the multipath devices are healthy

(This used to be ctdb commit f9779d3a237db59d7fdad92185ac7e42715466e6)

15 years agowe must also check the status returned from the get tickles control to
Ronnie Sahlberg [Tue, 14 Oct 2008 21:33:37 +0000 (08:33 +1100)]
we must also check the status returned from the get tickles control to
determine whether it was successful or not

(This used to be ctdb commit 6fb2f8a36239e5902e27cf10213f85faf216d6f1)

15 years agolower the loglevel for the informational message that a TCP_ADD opeation
Ronnie Sahlberg [Tue, 14 Oct 2008 16:02:09 +0000 (03:02 +1100)]
lower the loglevel for the informational message that a TCP_ADD opeation
described an ip address not known to be a public address.

This could happen if someone for genuine reasons accesses a share
through a static ip address.
It can also happen if non homogenous public address configurations are
used and when a tcp description is pushed out to a different node that
does not server/know the specific ip address.

(This used to be ctdb commit 9b1d089c99413f3681440f3cf33c293d118c9108)

15 years agochange ip route add to route add -net since this works more reliably
Ronnie Sahlberg [Tue, 14 Oct 2008 14:49:19 +0000 (01:49 +1100)]
change ip route add to route add -net  since this works more reliably

update the makefile and rpm to install 99.routing

(This used to be ctdb commit c0b3bd8a3fa580dca5afa97c8012fccb25231373)

15 years agonew version 1.0.60
Ronnie Sahlberg [Tue, 14 Oct 2008 14:32:46 +0000 (01:32 +1100)]
new version 1.0.60

(This used to be ctdb commit 77ed0d71b1fb8d06d70d01a8e8f9eb04ffe7f02f)

15 years agoverify that the nodes we try to ban/unban are operational and print an
Ronnie Sahlberg [Tue, 14 Oct 2008 14:23:57 +0000 (01:23 +1100)]
verify that the nodes we try to ban/unban are operational and print an
error to the user othervise.

(This used to be ctdb commit 5747dd2d80af29d6252afb6aeb3e66328ee20de5)

15 years agoRevert "from Mathieu Parent <math.parent@gmail.com>"
Ronnie Sahlberg [Tue, 14 Oct 2008 14:08:29 +0000 (01:08 +1100)]
Revert "from Mathieu Parent <math.parent@gmail.com>"

This reverts commit dc9cd4779db4a89697731e4cf415be51067a07c1.

Conflicts:

(This used to be ctdb commit d13da2e8fe2fab619540525d98a5502a23ab7d20)

15 years agoupdate the client side of getnodemap and getpublicips controls to
Ronnie Sahlberg [Tue, 14 Oct 2008 13:24:44 +0000 (00:24 +1100)]
update the client side of getnodemap and getpublicips controls to
fallback to the old-style ipv4-only controls if the new-style ipv4/ipv6
control fails.

this allows a 1.0.59+ (ipv4/ipv6) ctdb daemon being recmaster  to be
compatible with
pre-1.0.59  versions of ctdb that are ipv4 only.

(This used to be ctdb commit 8e912abc2c68f5fe7b06c600ba6fec1a6900127c)

15 years agoupdate TAKEIP/RELEASEIP/GETPUBLICIP/GETNODEMAP controls so we retain an
Ronnie Sahlberg [Mon, 13 Oct 2008 23:40:29 +0000 (10:40 +1100)]
update TAKEIP/RELEASEIP/GETPUBLICIP/GETNODEMAP controls so we retain an
older ipv4-only version of these controls.

We need this so that we are backwardcompatible with old versions of ctdb
and so that we can interoperate with a ipv4-only recmaster during a
rolling upgrade.

(This used to be ctdb commit 6b76c520f97127099bd9fbaa0fa7af1c61947fb7)

15 years agofrom Mathieu Parent <math.parent@gmail.com>
Ronnie Sahlberg [Sun, 12 Oct 2008 21:27:33 +0000 (08:27 +1100)]
from Mathieu Parent <math.parent@gmail.com>
Hi,

I have attached a patch necessary as debian log dir (/var/log) is not
a subdir of VARDIR (/var/lib on rpm systems, /var/lib/ctdb on debian).
As I don't know much about autotools and friends, this patch may be
hacky.

This is part of the process to minimize diff between distributions.

(This used to be ctdb commit dc9cd4779db4a89697731e4cf415be51067a07c1)

15 years agoFrom Mathieu Parent
Ronnie Sahlberg [Sun, 12 Oct 2008 21:21:20 +0000 (08:21 +1100)]
From Mathieu Parent
patch to make debian systems log the package versions in
ctdb_diagnostics

(This used to be ctdb commit 07dd4c7d2e8ba10f53d4cf2644fc4b7b8647e286)

15 years agoadded some more gpfs commands per-filesystem
Andrew Tridgell [Thu, 9 Oct 2008 07:45:12 +0000 (18:45 +1100)]
added some more gpfs commands per-filesystem

(This used to be ctdb commit a5d5aa455c7f7eb93d3fa6f403d5b8e0b795109d)

15 years agoskip empty lines in the public addresses file, not skip all non-empty
Ronnie Sahlberg [Tue, 7 Oct 2008 08:34:34 +0000 (19:34 +1100)]
skip empty lines in the public addresses file,   not skip all non-empty
lines

(This used to be ctdb commit dc108adada33bb713f71a2859eda3b439ed0cd1a)

15 years agofrom Michael Adams : allow #-style comments in the nodes and public
Ronnie Sahlberg [Tue, 7 Oct 2008 08:25:10 +0000 (19:25 +1100)]
from Michael Adams : allow #-style comments in the nodes and public
addresses file

(This used to be ctdb commit 5f96b33a379c80ed8a39de1ee41f254cf48733f9)

15 years agonew version 1.0.59
Ronnie Sahlberg [Tue, 7 Oct 2008 07:23:12 +0000 (18:23 +1100)]
new version   1.0.59

(This used to be ctdb commit 8e504fc8c01f750d1cb5d2289ef12c423daf1ad4)

15 years agoremove an unused variable
Ronnie Sahlberg [Tue, 7 Oct 2008 07:14:44 +0000 (18:14 +1100)]
remove an unused variable

(This used to be ctdb commit 4237bd3753dcb024c17461e974414bef1b609416)

15 years agoWhen we reload the nodes file
Ronnie Sahlberg [Tue, 7 Oct 2008 07:12:54 +0000 (18:12 +1100)]
When we reload the nodes file
instead of shutting down/restarting the entire tcp layer
just bounce all outgoing connections and reconnect

(This used to be ctdb commit e701a531868149f16561011e65794a4a46ee6596)

15 years agoadd a new eventscript : 99.routing that is used to add static routes to
Ronnie Sahlberg [Tue, 7 Oct 2008 00:03:30 +0000 (11:03 +1100)]
add a new eventscript : 99.routing that is used to add static routes to
interfaces when they are activated (an ip address is added during
takeip)

(This used to be ctdb commit d9779c310e98c9d4eab71a8d1705849ac90deb10)

15 years agoThe author of the upstream code asked for this code to be GPLv2+ not GPLv3
Andrew Tridgell [Tue, 30 Sep 2008 14:16:17 +0000 (07:16 -0700)]
The author of the upstream code asked for this code to be GPLv2+ not GPLv3

(This used to be ctdb commit 000018f2f4fb9f2452f56731b027dd6a7beda111)

15 years agomerged a bugfix for the idtree code from the Linux kernel. This
Andrew Tridgell [Tue, 30 Sep 2008 14:09:06 +0000 (07:09 -0700)]
merged a bugfix for the idtree code from the Linux kernel. This
matches commit 7aae6dd80e265aa9402ed507caaff4a5dba55069 in the kernel.

Many thanks to Jim Houston for pointing out this fix to us

(This used to be ctdb commit 85f6032cbdb197f3a003d86f086afa2cee898a4d)

15 years agoCheck that a database exists first before we dump its content (and
Ronnie Sahlberg [Mon, 22 Sep 2008 15:38:28 +0000 (01:38 +1000)]
Check that a database exists first before we dump its content (and
implicitely also create it) using 'ctdb catdb'

(This used to be ctdb commit 647003da975d4823abe8ed2bfb46153d68ea0fb0)

15 years agoexpanded ctdb_diagnostics based on recent experience
Andrew Tridgell [Wed, 17 Sep 2008 11:00:04 +0000 (21:00 +1000)]
expanded ctdb_diagnostics based on recent experience

(This used to be ctdb commit a06abf6bff6c4d379453e5063d8de1a6542c982a)

15 years agouse the correct tunable failcount not timeout
Ronnie Sahlberg [Wed, 17 Sep 2008 04:24:12 +0000 (14:24 +1000)]
use the correct tunable   failcount not timeout

(This used to be ctdb commit 475cfada33b4c13aaaca773d5485bbe26bffbf46)

15 years agoThe ctdb daemon keeps track of whether the recovery process is running
Ronnie Sahlberg [Wed, 17 Sep 2008 04:17:41 +0000 (14:17 +1000)]
The ctdb daemon keeps track of whether the recovery process is running
correctly by measuring how long it was since the last successful
communication with the recovery daemon was recorded.

After a certain timeout the ctdb daemon would deem the recovery daemon
as inoperable and shut down.

If the system clock is suddenly changed forward by many (60 or more)
seconds this could cause the timeout to trigger prematurely/immediately
where ctdb would incorrectly think that more than 60 seconds had passed
since last successful communications and thus abort.

Instead of cehcking for one timeout occuring, only deem the recovery
daemon to be "down" and trigger a shutdown if communications have
timedout for three intervals in a row.

(This used to be ctdb commit 196968c552e6ebcb57389d769a4b25f42fa8bc5d)

15 years agofix a slow memory leak in the recovery daemon in the error paths for the
Ronnie Sahlberg [Mon, 15 Sep 2008 23:00:48 +0000 (09:00 +1000)]
fix a slow memory leak in the recovery daemon in the error paths for the
memdump function

(This used to be ctdb commit 5e641ef9d6cca286061138a9680dcf2495736e8b)

15 years agofix some slow memory leaks in the vacuuming handler in the recovery
Ronnie Sahlberg [Mon, 15 Sep 2008 21:55:57 +0000 (07:55 +1000)]
fix some slow memory leaks in the vacuuming handler in the recovery
daemon

(This used to be ctdb commit 95bf36559d62f29e6f538f3a173b504ef3258341)

15 years agoFrom Volker L
Ronnie Sahlberg [Mon, 15 Sep 2008 20:50:28 +0000 (06:50 +1000)]
From Volker L
Fix a slow memory leak in the recovery daemon if there is a recoery
triggered during the public ip reassignment process

(This used to be ctdb commit 0aca4daf908b76d6013ff3dfad41beb9114fc1a3)

15 years agoupdates to the precompiled documentation
Ronnie Sahlberg [Sun, 14 Sep 2008 21:04:26 +0000 (07:04 +1000)]
updates to the precompiled documentation

(This used to be ctdb commit a120c734c5425ebb2e09c2009374aa0bc5a432e8)

15 years agoDocument the new descriptive node specifications.
Martin Schwenke [Fri, 12 Sep 2008 08:20:52 +0000 (18:20 +1000)]
Document the new descriptive node specifications.

Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit 2583d0c8f9932e023b7377582f62c840e264be80)

15 years agoonnode changes. "ok" is an alias for "healthy", "con" is an alias for
Martin Schwenke [Fri, 12 Sep 2008 06:55:18 +0000 (16:55 +1000)]
onnode changes.  "ok" is an alias for "healthy", "con" is an alias for
"connected".  Allow "rm" or "recmaster" to be a nodespec for the
recovery master. Better error handling for interaction with ctdb
client.

Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit 1d5d13345086201fe55517aea5f8674ff3db1090)

15 years agoMerge commit 'origin/master' into for-ronnie
Martin Schwenke [Fri, 12 Sep 2008 08:21:51 +0000 (18:21 +1000)]
Merge commit 'origin/master' into for-ronnie

(This used to be ctdb commit 2b70953cd51e879dc1fa113d68c6614051da5260)

15 years agoi add a new ctdb command "ctdb recmaster"
Ronnie Sahlberg [Fri, 12 Sep 2008 02:06:53 +0000 (12:06 +1000)]
i add a new ctdb command "ctdb recmaster"
this shows the node id of hte current recmaster

(This used to be ctdb commit 3ff0711fd3b288c153218ad33e8462a94b8d3275)

15 years agoChanges to onnode. Add "healthy" and "connected" as possible
Martin Schwenke [Fri, 12 Sep 2008 01:22:50 +0000 (11:22 +1000)]
Changes to onnode.  Add "healthy" and "connected" as possible
nodespecs.  Since we're now explicitly using bash, use local variables
when sensible.

Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit d1aa7953450de6ae215589404fb19e05edd3906f)

15 years agoMerge commit 'origin/master' into for-ronnie
Martin Schwenke [Fri, 12 Sep 2008 01:26:25 +0000 (11:26 +1000)]
Merge commit 'origin/master' into for-ronnie

(This used to be ctdb commit 666c3835376cd6b66aeaa110c76ecf052cd71a0a)

15 years agoMinor documentation fixes.
Martin Schwenke [Fri, 12 Sep 2008 00:36:15 +0000 (10:36 +1000)]
Minor documentation fixes.

Signed-off-by: Martin Schwenke <martin@meltin.net>
(This used to be ctdb commit 616b71fef54a3d1ea2638602099722a19ef6f41d)

15 years agolower the debuglevel when logging unknown idr in responses
Ronnie Sahlberg [Tue, 9 Sep 2008 03:59:48 +0000 (13:59 +1000)]
lower the debuglevel when logging unknown idr in responses

(This used to be ctdb commit a72f5b7d1560e427e18b1c55a2932a7fb037f4c7)

15 years agolower the debug level for when printing that the nodeflags have changed
Ronnie Sahlberg [Tue, 9 Sep 2008 03:55:31 +0000 (13:55 +1000)]
lower the debug level for when printing that the nodeflags have changed

(This used to be ctdb commit a89977f8cb2463a87147dcc0ad936cb5d4131670)

15 years agoadditional monitoring between the two daemons.
Ronnie Sahlberg [Tue, 9 Sep 2008 03:44:46 +0000 (13:44 +1000)]
additional monitoring between the two daemons.

we currently only monitor that the dameons are running by kill(0, pid)
and verifying the the domain socket between them is ok.

this is not sufficient since we can have a situation where the recovery
daemon is hung.

this new code monitors that the recovery daemon is operating.
if the recovery hangs, we log this and shut down the main daemon

(This used to be ctdb commit cd69d292292eaab3aac0e9d9fc57cb621597c63c)

15 years agoFrom C Cowan.
Ronnie Sahlberg [Sun, 7 Sep 2008 22:57:42 +0000 (08:57 +1000)]
From C Cowan.
Patch to make AIX compile with the new ipv6 additions.

(This used to be ctdb commit e26ce5140ed005725f8b7ac8ba23a180fd7d5337)

15 years agozero out the address structure to keep valgrind happy
Ronnie Sahlberg [Fri, 29 Aug 2008 02:26:02 +0000 (12:26 +1000)]
zero out the address structure to keep valgrind happy

(This used to be ctdb commit 8060e591b0eb2d184b5a7444487477225d2e1dbf)

15 years agonew version 1.0.58
Ronnie Sahlberg [Wed, 27 Aug 2008 00:26:34 +0000 (10:26 +1000)]
new version 1.0.58

(This used to be ctdb commit c4bc4317df4106e81b238dbfaf861ca6104304c1)

15 years agorename ctdb_tcp_client back to the original name ctdb_control_tcp
Ronnie Sahlberg [Wed, 27 Aug 2008 00:24:35 +0000 (10:24 +1000)]
rename ctdb_tcp_client back to the original name ctdb_control_tcp

(This used to be ctdb commit 4d1c0418cfe6170bc081684dbe45908a5d285f0b)

15 years agoFrom Abhijith Das <adas@redhat.com>:
Ronnie Sahlberg [Mon, 25 Aug 2008 00:13:18 +0000 (10:13 +1000)]
From Abhijith Das <adas@redhat.com>:

Fixup the initscript sdo it passes rpm-lint

(This used to be ctdb commit f84d0a9a8c7e9589e8833f21e1f977a0adab356b)

15 years agoAdd a "reload" option to the initscript.
Ronnie Sahlberg [Mon, 25 Aug 2008 00:03:16 +0000 (10:03 +1000)]
Add a "reload" option to the initscript.

(This used to be ctdb commit 2a8bf5e7dc7364a8280d96db0f9579d2582a8524)

15 years agoadd a link to my webpage
Ronnie Sahlberg [Sun, 24 Aug 2008 23:41:08 +0000 (09:41 +1000)]
add a link to my webpage

(This used to be ctdb commit 0709093af11e0735ed1e13b06ac073990631e692)

15 years agoversion 1.0.57 : initial ipv6 support
Ronnie Sahlberg [Sun, 24 Aug 2008 22:52:29 +0000 (08:52 +1000)]
version 1.0.57   : initial ipv6 support

(This used to be ctdb commit 7da0c65c8526d66d4f2a788bd646d39237befa54)

15 years agoDo not fail the takeip event if the "ip addr add ..." command failed.
Ronnie Sahlberg [Thu, 21 Aug 2008 23:25:47 +0000 (09:25 +1000)]
Do not fail the takeip event if the "ip addr add ..." command failed.
Let the event complete successfully.   the local recovery daemon will check that we have the address and reissue takip othervise.

There are several reasons why "ip addr add "  can fail, one is a misconfiguration
anothe ris that for ipv6 the stack is a lot more picky than for ipv4.     for examplke this WILL fail in ipv6 if there is a duplicate ip address on the network.

thus  this check could cause rolling-recoveries  which is why it has to go

(This used to be ctdb commit 12bc85c90a640a72ff538c003eb81da9dd1f2e3f)

15 years agowhen we collect all ip addresses and sort them for the "ctdb ip -n all" output we...
Ronnie Sahlberg [Thu, 21 Aug 2008 23:09:08 +0000 (09:09 +1000)]
when we collect all ip addresses and sort them for the "ctdb ip -n all" output we must look at more than just the first 4 bytes of the sockaddr address or ipv6 wont work

(This used to be ctdb commit 4dfbfb4618433d9ed79ca1bdb1e2e51d96d4ee62)

15 years agoWhen we harvest all tcp connections to kill off after a takeip/releaseip event we...
Ronnie Sahlberg [Wed, 20 Aug 2008 02:50:50 +0000 (12:50 +1000)]
When we harvest all tcp connections to kill off after a takeip/releaseip event we must also harvest the ipv4 connections which may be presented in ::ff:xxxx:xxxx form by netstat

(This used to be ctdb commit 293d12a40501320a21efaf592b8f20e8590a5197)

15 years agowe must canonicalize the sockaddr structures in killtcp so that we do the necessary...
Ronnie Sahlberg [Wed, 20 Aug 2008 02:02:54 +0000 (12:02 +1000)]
we must canonicalize the sockaddr structures in killtcp so that we do the necessary downgrade if required

(This used to be ctdb commit 2f8b33948e395228cbac3450c0c684e49069abf0)

15 years agomake the function to canonicalize a sockaddr structure public
Ronnie Sahlberg [Wed, 20 Aug 2008 01:58:27 +0000 (11:58 +1000)]
make the function to canonicalize a sockaddr structure public

(This used to be ctdb commit 1157d61a0bc557d8ffc453c518dfc48473492bfd)

15 years agowhen we compare ip addresses in ctdb_same_ip we must first canonicalize the addresses...
Ronnie Sahlberg [Wed, 20 Aug 2008 01:52:36 +0000 (11:52 +1000)]
when we compare ip addresses in ctdb_same_ip we must first canonicalize the addresses  so that we realize that 127.0.0.1:22 is really the same thing as ::ffff:127.0.0.1:22

Downgrade all AF_INET6 ::ffff:xxxx:xxxx sockaddresses into AF_INET ones

(This used to be ctdb commit b0fe4c45fc5ba1ecf62ebb921092c8a34e28a2bd)

15 years agoupdate the socketkiller in the eventscripts to be able to handle ipv6
Ronnie Sahlberg [Tue, 19 Aug 2008 23:47:00 +0000 (09:47 +1000)]
update the socketkiller in the eventscripts to be able to handle ipv6

(This used to be ctdb commit 6da7b36b7ccc4ee9b809867ea32036f09a801bb3)

15 years agofix a bug in the tcp socketkiller for ipv6
Ronnie Sahlberg [Tue, 19 Aug 2008 23:23:31 +0000 (09:23 +1000)]
fix a bug in the tcp socketkiller for ipv6

(This used to be ctdb commit 83735951352a243da185031e4853e7e40c43a0fb)

15 years agofix the ipv6 checksum calculation for pseudoheader so that it actually works
Ronnie Sahlberg [Tue, 19 Aug 2008 08:24:08 +0000 (18:24 +1000)]
fix the ipv6 checksum calculation for pseudoheader so that it actually works

add support to send ipv6 "gratious arp" aka neighbor solicitation packets from ctdb

Signed-off-by: Ronnie Sahlberg <ronniesahlberg@gmail.com>
(This used to be ctdb commit 0a38ea11af9237501f2951fee698a59b46f8750d)

15 years agoremove a file we dont need
Ronnie Sahlberg [Tue, 19 Aug 2008 04:58:57 +0000 (14:58 +1000)]
remove a file we dont need

Signed-off-by: Ronnie Sahlberg <ronniesahlberg@gmail.com>
(This used to be ctdb commit bf6effef0cc9e5f2eeeb38fce855a90624a76239)

15 years agoinitial ipv6 patch
Ronnie Sahlberg [Tue, 19 Aug 2008 04:58:29 +0000 (14:58 +1000)]
initial ipv6 patch

Signed-off-by: Ronnie Sahlberg <ronniesahlberg@gmail.com>
(This used to be ctdb commit 1f131f21386f428bbbbb29098d56c2f64596583b)

15 years agouse a local tdb_traverse instead of a ctdb_pulldb to lessen the impact of the system...
Ronnie Sahlberg [Thu, 14 Aug 2008 00:57:08 +0000 (10:57 +1000)]
use a local tdb_traverse instead of a ctdb_pulldb to lessen the impact of the system while performing a database backup

(This used to be ctdb commit 48fad9c06185a1f2580473cac02b3722e35c2023)

15 years agoonly freeze the local node when doing a backup and not the entire cluster
Ronnie Sahlberg [Wed, 13 Aug 2008 23:52:23 +0000 (09:52 +1000)]
only freeze the local node when doing a backup and not the entire cluster

(This used to be ctdb commit ff413beb4bb31e277e843235a1ce5e5ad7b92c71)

15 years agostore the database name, not the backup filename in the database header
Ronnie Sahlberg [Wed, 13 Aug 2008 22:36:39 +0000 (08:36 +1000)]
store the database name, not the backup filename in the database header

(This used to be ctdb commit 0674b33a7492cc1a194833f5ca87d8b30457faee)

15 years agoEncode a file version number in the database backup header
Ronnie Sahlberg [Wed, 13 Aug 2008 22:35:19 +0000 (08:35 +1000)]
Encode a file version number in the database backup header
Encode the database name in the header so we dont need to provide the database
name when doing a restore
Encode a timestamp in the header telling us when the backup was created

(This used to be ctdb commit 77762170ad1dbc4620565bb898af5d493fac117d)

15 years agoAdd two new ctdb commands :
Ronnie Sahlberg [Wed, 13 Aug 2008 12:03:29 +0000 (22:03 +1000)]
Add two new ctdb commands :

ctdb backupdb : which will copy a database out from ctdb and write it to a file
ctdb restoredb : which will read a database backup from a file and write it into ctdb

(This used to be ctdb commit b567e215f5c58d646a392408b9cc1df8ef029b33)

15 years agofixed merge
Andrew Tridgell [Mon, 11 Aug 2008 14:10:48 +0000 (00:10 +1000)]
fixed merge

(This used to be ctdb commit c4ad24e7c2b27b168d43ebfa95c459da27697d1e)

15 years agoup release version
Andrew Tridgell [Mon, 11 Aug 2008 13:52:46 +0000 (23:52 +1000)]
up release version

(This used to be ctdb commit 7769edbd5507f791e77cc5e5642d854bebec2df0)

15 years ago new version 1.0.56
Ronnie Sahlberg [Mon, 11 Aug 2008 13:50:42 +0000 (23:50 +1000)]
 new version 1.0.56

(This used to be ctdb commit 196973fef38a9b258e0976d5454161d11d573ddc)

15 years agoMerge commit 'ronnie/master'
Andrew Tridgell [Mon, 11 Aug 2008 13:33:46 +0000 (23:33 +1000)]
Merge commit 'ronnie/master'

(This used to be ctdb commit fb7f69b1d6dd8b0715258dde736dc03e1c5e43aa)