Fix intermitent testFollowerResyncWith*LeaderRestart failure 81/62381/1
authorTom Pantelis <tompantelis@gmail.com>
Mon, 28 Aug 2017 19:44:31 +0000 (15:44 -0400)
committerTom Pantelis <tompantelis@gmail.com>
Mon, 28 Aug 2017 19:44:31 +0000 (15:44 -0400)
commit88e2974b8d391d6e91a6338b0a1b8dbf966a8a71
treec89f26c71fbe18ebfb777dde9bc2d400f2b85f14
parentdd0058fe845c7fae620289fc87f7a1149ee2da85
Fix intermitent testFollowerResyncWith*LeaderRestart failure

NonVotingFollowerIntegrationTest#testFollowerResyncWithOneMoreLeaderLogEntryAfterNonPersistentLeaderRestart fails intermittently:

NonVotingFollowerIntegrationTest.testFollowerResyncWithOneMoreLeaderLogEntryAfterNonPersistentLeaderRestart:233 Did not receive message of type class org.opendaylight.controller.cluster.raft.base.messages.SnapshotComplete

This seems to be a side-effect of https://git.opendaylight.org/gerrit/#/c/62255/
which changes the timing a bit such that an install snapshot doesn't occur on the
follower which should happen in order to completely re-sycnc it with the leader -
instead it ends up removing the stale out-of-sync entries and appending the new ones
from the leader which gets the journal up-to-date but the stale entries had already
been applied to the state which leaves the state out-of-sync with journal. I added
an additional check in the follower to force the leader to install a snapshot
if the first out-of-sync log entry index <= the lastAppliedIndex which means the
entries to be removed have already been applied to the state.

Change-Id: Ic3815a694a8531d9f7f42f19ad8978d52fc902b3
Signed-off-by: Tom Pantelis <tompantelis@gmail.com>
opendaylight/md-sal/sal-akka-raft/src/main/java/org/opendaylight/controller/cluster/raft/behaviors/Follower.java