1
0
Fork 0
arangodb/arangod
Max Neunhöffer 46e479376d
Further supervision fixes. (#8259)
* Do not schedule Coordinators in Plan.

* Finish failed server when server is no longer in health.

* Fix removeServer checks.

Check that server is no longer in use before removing it. Give 60s
waiting time for condition to be met. Also observer agency lock.

* Finish FailedFollower job if server no longer follower.

This can happen because RemoveFollower was faster.

* Only use GOOD servers as replacement followers.

* Fix AddFollower for satellite collections.

* Fix RemoveServer for satellite collections.

* MoveShard handles moves from leader to followers

* Prepare CleanoutServer and FailedServer for satellite collections.

* More sorting out of AddFollower and RemoveFollower.

* Fix RemoveFollower job w.r.t. choice of follower to remove.

* Fix message.

* kill you own sub jobs, please

* Added preconditions to payloads for supervision's job finishers

* Improve logging.

* Add agency diagnostics to failed move shard test, start.

* Add coordinator agency diagnostics.

* Remove warning.

* Add changelog entry.

* Add agency diagnostics if things go sour with move shard.

* Add agency diags when things go wrong 2.

* API /_api/agency/state: back to old format.

* Fix Windows compilation.

* handle aborts in supervision and wait for the last Raft log to be committed

* tests compiling, 2 failing for valid reasons

* Correctly report TRI_ERROR_CLUSTER_CONNECTION_LOST as 503.

* FailedLeader /FailedFollower cannot continue, when aborting blocks
2019-03-04 11:43:35 +01:00
..
Actions
Agency Further supervision fixes. (#8259) 2019-03-04 11:43:35 +01:00
Aql
Auth
Cache
Cluster Further supervision fixes. (#8259) 2019-03-04 11:43:35 +01:00
ClusterEngine
GeneralServer
GeoIndex
Graph
IResearch issue 525.1: backport 3.4: ensure RocksDB CreateIndex/DropIndex WAL markers are properly written during recovery (#8279) 2019-03-01 15:57:42 +03:00
Indexes
InternalRestHandler
MMFiles
Pregel Fix Pregel nullptr checks (#8276) 2019-03-01 13:34:05 +01:00
Replication abort ongoing transactions in all cases (#8290) 2019-02-28 14:41:22 +01:00
RestHandler
RestServer
RocksDBEngine issue 525.1: backport 3.4: ensure RocksDB CreateIndex/DropIndex WAL markers are properly written during recovery (#8279) 2019-03-01 15:57:42 +03:00
Scheduler use JobGuard when querying users from DB in cluster (#8057) 2019-01-30 12:00:50 +01:00
Sharding
Statistics
StorageEngine Feature 3.4/improve replication speed (#8268) 2019-02-28 14:37:40 +01:00
Transaction
Utils
V8Server Fix Pregel nullptr checks (#8276) 2019-03-01 13:34:05 +01:00
VocBase Feature 3.4/improve replication speed (#8268) 2019-02-28 14:37:40 +01:00
CMakeLists.txt
dbg.cmake