1
0
Fork 0
Commit Graph

86 Commits

Author SHA1 Message Date
Dan Larkin-York 069b456217 [3.3] Backport #6539. (#6569) 2018-09-24 13:00:08 +02:00
Tobias Gödderz 209e53aaf7 [3.3] Bugfix: Add missing plan version increments in repair-distributeShardsLike (#5952)
* Added missing plan version increments. Taken from #5649.

* Fix test: increase plan version after changing the plan
2018-07-24 09:14:08 +02:00
Max Neunhöffer a264d90227
Add an API to query for status of moveShard and cleanOutServer jobs. (#5594)
This is so far intentionally undocumented, since we want to collect
experience with it first.
2018-06-15 16:28:23 +02:00
Tobias Gödderz a391088dbf [3.3] Fix inconsistent distributeShardsLike job (#5300) 2018-05-17 12:07:20 +02:00
Kaveh Vahedipour 507418d9a4 stop supervision on demand (#5109)
* stop supervision on demand
* adding tests
* Correct an error message.
2018-04-20 11:58:47 +02:00
Max Neunhöffer 3e7425a392 More logging and increase timeout. (#4858) 2018-03-15 13:29:59 +01:00
Michael Hackstein 0a57be6863 Added missing semicolon (#4570) 2018-02-12 14:59:11 +01:00
Michael Hackstein 79a80558e1 Bug fix/distribute shards like (#4415) 2018-02-12 13:43:06 +01:00
Jan 607a9d9ebd
Wait for 5 dbservers to be there and GOOD before test starts. (#4373) 2018-01-22 09:26:03 +01:00
Max Neunhöffer 589bc68957 Fix shard distribution test. (#4262) 2018-01-15 22:45:12 +01:00
Kaveh Vahedipour 56a9ad69b1 Bug fix 3.3/supervision no longer fails to remove server from failed when back to good (#4210)
* let's not miss failedserver removal
* remove resetting of FailedServers in test code
* Only call abortRequestsToFailedServers at most every 3 seconds.
2018-01-03 21:55:01 +01:00
Jan ddd8d5c8e1 Added 503/0 inquire tests to resilience tests (#3839) (#3934) 2017-12-07 10:18:39 +01:00
Max Neunhöffer 766ab7c8cf
Fix agency shutdown bug. (#3683)
* Fix agency shutdown bug.
* Remove precondition that was not needed in AgencyComm::removeValues.
* Fail fatally if threads do not shut down.
2017-11-14 16:33:46 +01:00
Michael Hackstein 5c633f9fae Bug fix/speedup shard distribution (#3645)
* Added a more sophisticated test for shardDistribution format

* Updated shard distribution test to use request instead of download

* Added a cxx reporter for the shard distribuation. WIP

* Added some virtual functions/classes for Mocking

* Added a unittest for the new CXX ShardDistribution Reporter.

* The ShardDsitributionReporter now reports Plan and Current correctly. However it does not dare to find a good total/current value and just returns a default. Hence these tests are still red

* Shard distribution now uses the cxx variant

* The ShardDistribution reporter now tries to execute count on the shards

* Updated changelog

* Added error case tests. If the servers time out the mechanism will stop bothering after two seconds and just report default values.
2017-11-10 15:17:08 +01:00
Kaveh Vahedipour 00650e6a3f Bug fix/agency mt fixes (#3158)
* added debugging methods

* try to fix invalid access in case of error

* remove unused members

* bugfixes and comments

* all agency fixes in

* merge bug

* partially unguarded Agent::lead fixed

* all agency fixes in

* added nrBlocked to thread startup eval

* added nrBlocked to thread startup eval

* recombination of cases in State::get

* some maps replaced with unordered_maps

* optimized maps some
2017-08-30 10:43:51 +02:00
Max Neunhoeffer cc985becd3 One more logging line in resilience test output. 2017-06-09 10:46:26 +02:00
Kaveh Vahedipour d149baa317 ttls are considered, when key-value store read is invoced 2017-05-08 12:22:14 +02:00
Kaveh Vahedipour 1f81ce28b0 merge in cpp & js from 3.1.18 yet to do tests 2017-04-21 15:41:05 +02:00
Kaveh Vahedipour e65e7c2d60 resilience tests fail at times 2017-02-21 11:05:10 +01:00
Max Neunhoeffer cf2a47e52c Increase timeout for testServerEmpty.
Under heavy load 5 min might not be enough.
2017-02-09 22:14:53 +01:00
Kaveh Vahedipour b7b8a6cf88 lowering log output in agencycomm 2017-01-31 09:37:47 +01:00
Kaveh Vahedipour 5faef0bc9e resilienceMove will do more reporting in cleanouts 2017-01-26 15:30:27 +01:00
Kaveh Vahedipour 1eef4c7b43 resilienceMove will do more reporting in cleanouts 2017-01-26 14:44:39 +01:00
Kaveh Vahedipour f06fdf9e0f getDBServers response changed for aardvark frontend. needed reflection in resilienceMove tests 2017-01-26 12:37:59 +01:00
Kaveh Vahedipour 95a165d5f4 inquiring about 503 and 0 2017-01-23 16:35:50 +01:00
Kaveh Vahedipour 981e71953d handling agency comm failures 2017-01-23 11:26:03 +01:00
Kaveh Vahedipour d2760f4ef1 pushing avoidServers property 2017-01-20 15:15:03 +01:00
Kaveh Vahedipour 40c6e2b48c unobserve should not create entries in agency, if empty before 2017-01-20 08:06:27 +01:00
Max Neunhoeffer 4991056d61 Reactivate all shard move tests. 2017-01-19 14:21:35 +01:00
Max Neunhoeffer 370cb09d89 Merge branch 'devel' into schmutz-ng 2017-01-19 10:32:43 +01:00
Andreas Streichardt 9bcaf08753 Merge branch 'schmutz-ng' of github.com:arangodb/arangodb into schmutz-ng 2017-01-18 18:07:24 +01:00
Kaveh Vahedipour f47b3b3c9d transient heartbeats 2017-01-18 17:26:45 +01:00
Kaveh Vahedipour 879102117d more replicationTest 2017-01-16 15:43:32 +01:00
Kaveh Vahedipour aad9d6d576 agency-test should not break during reelection 2017-01-16 13:05:47 +01:00
Kaveh Vahedipour a75b3624de resilience move ok again? 2017-01-16 12:09:21 +01:00
Kaveh Vahedipour 5bf3a1ee33 fixing move shards resilience test 2017-01-16 10:29:21 +01:00
Kaveh Vahedipour 7b023bd66f move shard jobs fail early if the contracting goes south 2017-01-11 11:50:18 +01:00
Kaveh Vahedipour f34796b432 move resilience should now be correct as a test 2017-01-10 17:30:09 +01:00
Kaveh Vahedipour edac0586ab move shard has always been a matter of luck as a test in the cleanoutserver part 2017-01-10 15:37:08 +01:00
Kaveh Vahedipour ea4467e0d4 shortening resilienceFail 2017-01-05 12:21:27 +01:00
Max Neunhoeffer a7f295bfe8 Fix error handling in resilienceMove test. 2016-12-16 11:02:13 +01:00
Kaveh Vahedipour f66d3b1dac fixed moveshard tests 2016-12-15 17:22:01 +01:00
Kaveh Vahedipour 842d1030f0 Fixed dangling UUID problem in missing database directory 2016-12-13 15:36:19 +01:00
Kaveh Vahedipour 2b9c018817 fixed resilience 2016-12-09 16:35:32 +01:00
Andreas Streichardt 9fedba9fff Fix eslint 2016-12-08 10:14:50 +01:00
Andreas Streichardt 10f7d75651 Fix uniform shard distribution when creating collections 2016-12-07 18:38:15 +01:00
Max Neunhoeffer 039ca9aceb Increase some timeouts from 60 to 90s to be more load tolerant. 2016-10-13 12:52:59 +02:00
Andreas Streichardt e75b8f5503 Implement cURL based inter server communication 2016-10-10 14:56:04 +02:00
Kaveh Vahedipour cf09546d93 fixed erroneous break of supervision agency updates 2016-10-07 11:01:45 +02:00
Kaveh Vahedipour ce8c1a0cac revisiting all supervision jobs 2016-10-05 17:16:02 +02:00