Dan Larkin-York
069b456217
[3.3] Backport #6539 . ( #6569 )
2018-09-24 13:00:08 +02:00
Tobias Gödderz
209e53aaf7
[3.3] Bugfix: Add missing plan version increments in repair-distributeShardsLike ( #5952 )
...
* Added missing plan version increments. Taken from #5649 .
* Fix test: increase plan version after changing the plan
2018-07-24 09:14:08 +02:00
Max Neunhöffer
a264d90227
Add an API to query for status of moveShard and cleanOutServer jobs. ( #5594 )
...
This is so far intentionally undocumented, since we want to collect
experience with it first.
2018-06-15 16:28:23 +02:00
Tobias Gödderz
a391088dbf
[3.3] Fix inconsistent distributeShardsLike job ( #5300 )
2018-05-17 12:07:20 +02:00
Kaveh Vahedipour
507418d9a4
stop supervision on demand ( #5109 )
...
* stop supervision on demand
* adding tests
* Correct an error message.
2018-04-20 11:58:47 +02:00
Max Neunhöffer
3e7425a392
More logging and increase timeout. ( #4858 )
2018-03-15 13:29:59 +01:00
Michael Hackstein
0a57be6863
Added missing semicolon ( #4570 )
2018-02-12 14:59:11 +01:00
Michael Hackstein
79a80558e1
Bug fix/distribute shards like ( #4415 )
2018-02-12 13:43:06 +01:00
Jan
607a9d9ebd
Wait for 5 dbservers to be there and GOOD before test starts. ( #4373 )
2018-01-22 09:26:03 +01:00
Max Neunhöffer
589bc68957
Fix shard distribution test. ( #4262 )
2018-01-15 22:45:12 +01:00
Kaveh Vahedipour
56a9ad69b1
Bug fix 3.3/supervision no longer fails to remove server from failed when back to good ( #4210 )
...
* let's not miss failedserver removal
* remove resetting of FailedServers in test code
* Only call abortRequestsToFailedServers at most every 3 seconds.
2018-01-03 21:55:01 +01:00
Jan
ddd8d5c8e1
Added 503/0 inquire tests to resilience tests ( #3839 ) ( #3934 )
2017-12-07 10:18:39 +01:00
Max Neunhöffer
766ab7c8cf
Fix agency shutdown bug. ( #3683 )
...
* Fix agency shutdown bug.
* Remove precondition that was not needed in AgencyComm::removeValues.
* Fail fatally if threads do not shut down.
2017-11-14 16:33:46 +01:00
Michael Hackstein
5c633f9fae
Bug fix/speedup shard distribution ( #3645 )
...
* Added a more sophisticated test for shardDistribution format
* Updated shard distribution test to use request instead of download
* Added a cxx reporter for the shard distribuation. WIP
* Added some virtual functions/classes for Mocking
* Added a unittest for the new CXX ShardDistribution Reporter.
* The ShardDsitributionReporter now reports Plan and Current correctly. However it does not dare to find a good total/current value and just returns a default. Hence these tests are still red
* Shard distribution now uses the cxx variant
* The ShardDistribution reporter now tries to execute count on the shards
* Updated changelog
* Added error case tests. If the servers time out the mechanism will stop bothering after two seconds and just report default values.
2017-11-10 15:17:08 +01:00
Kaveh Vahedipour
00650e6a3f
Bug fix/agency mt fixes ( #3158 )
...
* added debugging methods
* try to fix invalid access in case of error
* remove unused members
* bugfixes and comments
* all agency fixes in
* merge bug
* partially unguarded Agent::lead fixed
* all agency fixes in
* added nrBlocked to thread startup eval
* added nrBlocked to thread startup eval
* recombination of cases in State::get
* some maps replaced with unordered_maps
* optimized maps some
2017-08-30 10:43:51 +02:00
Max Neunhoeffer
cc985becd3
One more logging line in resilience test output.
2017-06-09 10:46:26 +02:00
Kaveh Vahedipour
d149baa317
ttls are considered, when key-value store read is invoced
2017-05-08 12:22:14 +02:00
Kaveh Vahedipour
1f81ce28b0
merge in cpp & js from 3.1.18 yet to do tests
2017-04-21 15:41:05 +02:00
Kaveh Vahedipour
e65e7c2d60
resilience tests fail at times
2017-02-21 11:05:10 +01:00
Max Neunhoeffer
cf2a47e52c
Increase timeout for testServerEmpty.
...
Under heavy load 5 min might not be enough.
2017-02-09 22:14:53 +01:00
Kaveh Vahedipour
b7b8a6cf88
lowering log output in agencycomm
2017-01-31 09:37:47 +01:00
Kaveh Vahedipour
5faef0bc9e
resilienceMove will do more reporting in cleanouts
2017-01-26 15:30:27 +01:00
Kaveh Vahedipour
1eef4c7b43
resilienceMove will do more reporting in cleanouts
2017-01-26 14:44:39 +01:00
Kaveh Vahedipour
f06fdf9e0f
getDBServers response changed for aardvark frontend. needed reflection in resilienceMove tests
2017-01-26 12:37:59 +01:00
Kaveh Vahedipour
95a165d5f4
inquiring about 503 and 0
2017-01-23 16:35:50 +01:00
Kaveh Vahedipour
981e71953d
handling agency comm failures
2017-01-23 11:26:03 +01:00
Kaveh Vahedipour
d2760f4ef1
pushing avoidServers property
2017-01-20 15:15:03 +01:00
Kaveh Vahedipour
40c6e2b48c
unobserve should not create entries in agency, if empty before
2017-01-20 08:06:27 +01:00
Max Neunhoeffer
4991056d61
Reactivate all shard move tests.
2017-01-19 14:21:35 +01:00
Max Neunhoeffer
370cb09d89
Merge branch 'devel' into schmutz-ng
2017-01-19 10:32:43 +01:00
Andreas Streichardt
9bcaf08753
Merge branch 'schmutz-ng' of github.com:arangodb/arangodb into schmutz-ng
2017-01-18 18:07:24 +01:00
Kaveh Vahedipour
f47b3b3c9d
transient heartbeats
2017-01-18 17:26:45 +01:00
Kaveh Vahedipour
879102117d
more replicationTest
2017-01-16 15:43:32 +01:00
Kaveh Vahedipour
aad9d6d576
agency-test should not break during reelection
2017-01-16 13:05:47 +01:00
Kaveh Vahedipour
a75b3624de
resilience move ok again?
2017-01-16 12:09:21 +01:00
Kaveh Vahedipour
5bf3a1ee33
fixing move shards resilience test
2017-01-16 10:29:21 +01:00
Kaveh Vahedipour
7b023bd66f
move shard jobs fail early if the contracting goes south
2017-01-11 11:50:18 +01:00
Kaveh Vahedipour
f34796b432
move resilience should now be correct as a test
2017-01-10 17:30:09 +01:00
Kaveh Vahedipour
edac0586ab
move shard has always been a matter of luck as a test in the cleanoutserver part
2017-01-10 15:37:08 +01:00
Kaveh Vahedipour
ea4467e0d4
shortening resilienceFail
2017-01-05 12:21:27 +01:00
Max Neunhoeffer
a7f295bfe8
Fix error handling in resilienceMove test.
2016-12-16 11:02:13 +01:00
Kaveh Vahedipour
f66d3b1dac
fixed moveshard tests
2016-12-15 17:22:01 +01:00
Kaveh Vahedipour
842d1030f0
Fixed dangling UUID problem in missing database directory
2016-12-13 15:36:19 +01:00
Kaveh Vahedipour
2b9c018817
fixed resilience
2016-12-09 16:35:32 +01:00
Andreas Streichardt
9fedba9fff
Fix eslint
2016-12-08 10:14:50 +01:00
Andreas Streichardt
10f7d75651
Fix uniform shard distribution when creating collections
2016-12-07 18:38:15 +01:00
Max Neunhoeffer
039ca9aceb
Increase some timeouts from 60 to 90s to be more load tolerant.
2016-10-13 12:52:59 +02:00
Andreas Streichardt
e75b8f5503
Implement cURL based inter server communication
2016-10-10 14:56:04 +02:00
Kaveh Vahedipour
cf09546d93
fixed erroneous break of supervision agency updates
2016-10-07 11:01:45 +02:00
Kaveh Vahedipour
ce8c1a0cac
revisiting all supervision jobs
2016-10-05 17:16:02 +02:00