git.opendaylight Code Review - controller.git/commit

author	Tom Pantelis <tpanteli@brocade.com>
	Tue, 2 Aug 2016 02:23:33 +0000 (22:23 -0400)
committer	Tom Pantelis <tpanteli@brocade.com>
	Wed, 3 Aug 2016 04:02:06 +0000 (04:02 +0000)
commit	5e590ea1548dd4974ca59f1318a11f2a8651c3b7
tree	f2ad13928dc528e3fc0a284a4f1e6874c349e77d	tree \| snapshot
parent	364229dd715facec8ef8c73d6c60546c5f38b103	commit \| diff

Improve leader election convergence

When 2 nodes startup with the first node's log behind the second node's,
it usually takes several election rounds to converge - I've seen
anywhere from 40 s to 3 min, depending on timing. What happens is that
the first node goes to Candidate first but it's RequestVote is rejected
by the seconds node. Shortly after the seconds node goes to Candidate -
the term is higher than the first which causes the first node to go back
to Follower. However it doesn't respond to the RequestVote. Then the
first node goes to Candidate and the cycle repeats. Eventually, due to
the election variance, the seconds node times out first and the second
node process the RequestVote and grants it. But it can take more than 10
cycles.

We can improve the convergence by allowing a Candidate to process and
respond to RequestVote when the sender's term is greater. It still
transitions to Follower as per the raft rules. The raft paper does not
say whether or not a Candidate can/should process a RequestVote in this
case but it seems to make sense. With this change, the first RequestVote
sent by the second node is granted and it converges quickly.

Change-Id: If9416ddf7bf0dfc1220a169be4174f440626a0dd
Signed-off-by: Tom Pantelis <tpanteli@brocade.com>

opendaylight/md-sal/sal-akka-raft/src/main/java/org/opendaylight/controller/cluster/raft/behaviors/Candidate.java		diff \| blob \| history
opendaylight/md-sal/sal-akka-raft/src/test/java/org/opendaylight/controller/cluster/raft/behaviors/CandidateTest.java		diff \| blob \| history