1
0
Fork 0
Commit Graph

2071 Commits

Author SHA1 Message Date
Max Neunhöffer 42fd0825ab
Fix timeouts for write operations from coordinator to leader. (#7081)
* Improve logging on coordinator when doing `arangorestore`.

* Return more error information in `mergeResults`.

* Longer timeout for communication coordinator -> leader for writes.

This is taking into account possible write stops from followers needed
to get in sync.

* Fix compilation.

* Get rid of numbers in exception log messages.

* Fix compilation.

* Fix indentation.
2018-10-31 14:39:48 +01:00
Michael Hackstein b280142efa
Revert "fixes some misbehaviour within the coordinator agency callbacks (#7104)" (#7150)
This reverts commit 9ee7a0e955.
2018-10-30 16:48:56 +01:00
Heiko 9ee7a0e955 fixes some misbehaviour within the coordinator agency callbacks (#7104)
* fixes some misbehaviour within the coordinator agency callbacks

* changelog
2018-10-30 16:47:37 +01:00
Simon c073b9dbbe Make ensureIndexOnCoordinator more robust (#7110) (#7130) 2018-10-30 11:25:06 +01:00
Simon 9271a11441 RocksDB replication thread safety (#7088) (#7131) 2018-10-30 11:24:17 +01:00
Vasiliy e6a6025818 backport: switch scope of responsibility between a TRI_vocbase_t and a LogicalView in respect to view creation/deletion (#7106)
* backport: switch scope of responsibility between a TRI_vocbase_t and a LogicalView in respect to view creation/deletion

* backport: ensure arangosearch links get exported in the dump

* backport: ensure view is created during restore on the coordinator

* Updates for ArangoSearch DDL tests, IResearchView unregistration and known issues

* Add fix for internal issue 483
2018-10-30 12:50:29 +03:00
Tobias Gödderz e9388ab710 [3.4] Stop curl from trying to POST stdin (#7097)
* Stop libcurl from trying to POST stdin

* Stop relocking every iteration in wait

* Remove unimplemented function

* Restrict setting of empty POSTFIELDS to POST requests

* Revert locking change
2018-10-29 14:41:23 +01:00
Michael Hackstein e05880895a
Bug fix 3.4/shorter foot in door (#7084)
* Implement `syncCollectionCatchup` in DatabaseTailingSyncer.

First stab, might not even compile.

* Fixed a typo.

* Fix a typo and a compilation problem.

* Further compilation fix.

* Implement two stage catchup.

* Two small corrections.

* Unified error messages in Synchronize shard job.

* Improved a code comment.

* Fixed autocasting bool->double and double->bool issue. That is truely one of the best features ever invented... </irony>

* Renamed doHardLock => toSoftLockOnly and inverted default value

* Merged soft/hard foot logic with Transaction splits

* Use scopeguards to cancel readlocks
2018-10-26 16:16:52 +02:00
Max Neunhoeffer 015275a724
Emergency fix to compile on gcc 8. 2018-10-26 11:13:56 +02:00
Max Neunhöffer 8564a08bbb
Try to fix timeout in drop collection. (#7058)
* Try to fix timeout in drop collection.
* Fix compilation.
2018-10-25 16:51:16 +02:00
Jan b903f1f8ff
Bug fix 3.4/fix catch test issues (#7045) 2018-10-25 12:49:00 +02:00
Simon e87b42a0c3 Silence tsan warnings (#7051) 2018-10-24 23:58:47 +02:00
Simon 6eb9e38b08 Better agency pool update (#7036) 2018-10-24 16:23:10 +02:00
Vasiliy 52e2c97693 backport missed changes (#7016) 2018-10-24 15:43:45 +03:00
Simon 8b19d40136 Properly compare velocypack objects in Agency operations (#6922) 2018-10-23 11:52:22 +02:00
Matthew Von-Maszewski 43016cf04f Bugfix 3.4: address concerns from prior scheduler PR (#7005) 2018-10-23 11:30:45 +02:00
Simon c0455e9c60 Add engine specific collection APIs (#6962) 2018-10-19 15:23:55 +02:00
Lars Maier d7863b4583 Bug fix 3.4/cluster comm threads start stop (#6939)
* Start ClusterComm threads in `ClusterFeature::start`. Stop ClusterComm threads in `ClusterFeature::stop`.

* Do not free objects in `Scheduler::shutdown`. Let the `unique_ptr` do their job. Stop ClusterComm threads in `ClusterFeature::stop`, but free instance in `ClusterFeature::unprepare`.

* `io_context` may contains lambdas that hold `shared_ptr`s to `Tasks` the required a functional `VocBase` in their destructor.

* Clean up.
2018-10-19 13:12:51 +02:00
Jan 19e2dd87bd
Replace engine equality feature (#6931) (#6950) 2018-10-17 20:34:19 +02:00
Matthew Von-Maszewski 887822afa6 Bug fix 3.4: libcurl threading changes (#6829)
* enable the ability to push results processing to threads
* have ClusterComm push libcurl response processing to Scheduler threads
* tuning changes from Matthew and Michael
* give new defaults to minimum thread count
* create multiple ClusterCommThreads, each with own Communicator object
* put PR notes in change log
* correct speling
* Also drain V8 queue.
* Add prio V8 to switch in canPostDirectly.
* Accept --server.minimal-threads even if maximal threads is not set.
* Reactivate stopping of threads.
2018-10-12 17:00:55 +02:00
Jan 815adaa56f
Bug fix 3.4/fixes 110918 2 (#6845) 2018-10-12 12:48:41 +02:00
Jan 3adaf001c5
velocypack library update (#6850) 2018-10-12 12:46:52 +02:00
Jan 4dacd7c3b3
suppress some of these dreaded error messages (#6787) 2018-10-11 10:46:04 +02:00
Max Neunhöffer 052d8f2cdb
Fix a bug when getting in sync and old requests are still lingering. (#6788) (#6795) 2018-10-10 18:04:31 +02:00
Dan Larkin-York ff2ce5c846 Fix issue with colleciton/view name conflict checking in cluster. (#6779) 2018-10-10 12:40:28 +02:00
Jan f51ab1912a
fix arangojs issue 573 (#6766) 2018-10-10 09:19:37 +02:00
Matthew Von-Maszewski 276da923cf Bugfix 3.4: dynamic management of libcurl's open connections (#6711)
* untested connection count management code.
* add digit to identify location of identical error messages.
* unit tests created, code working and integrated
* subtle changes to smoothing algorithm, close ... but still not perfect
* use scheduler queue() instead of post() to avoid occassional extreme latencies. throughput improved!
* temporary hack to hardcode queue size much lower
* force direct calls to post() to instead route through queue()
* correct issue with previous check-in, but performance has dropped with this on
* revert hardcoded value in SchedulerFeature.cpp
* queue to high priority to be consistent with other Aql actions
* backout all edits to Scheduler.cpp and Scheduler.h
* oops, forgot to remove some testing code.
* 3.4 merge/update
* add changelog entry
2018-10-10 09:11:11 +02:00
Max Neunhöffer fa683d3925
Remove a relic from early days in /Target/FailedServers. (#6689)
* Remove a relic from early days in /Target/FailedServers.
* Fix a test.
2018-10-09 13:49:38 +02:00
Tobias Gödderz ed2c7ad774 Fixed testSetup(). Reduced redundant code. (#6760) 2018-10-08 18:57:07 +02:00
Dan Larkin-York 3d15b96f66 [3.4] Additional and cleaned-up logging (#6648) 2018-10-01 19:03:05 +02:00
Wilfried Goesgens fbd2ca2e79 Feature 3.4/windows utf16 fileaccess (#6600) 2018-10-01 17:19:01 +02:00
Max Neunhöffer 0b72d02d19
Add more timeout because in Jenkins dbservers can be slow. (#6665) 2018-10-01 16:54:36 +02:00
Kaveh Vahedipour 8b9a0f6a25 Feature 3.4/detailed get maintenance feature (#6666)
* backport of test data generation for maintenance from devel
2018-10-01 14:50:25 +02:00
Max Neunhöffer b67718d6aa
Fix cleanup of Current entry in case a follower is removed from Plan. (#6623)
* Properly remove unplanned followers in leader and Current.
* Add a catch test.
* Fix tests.
* Fix a bug with a temporary object.
* Protect against exception from getCollection not found.
2018-09-28 17:37:14 +02:00
Lars Maier 09395e73de Added try-catch-block. (#6649)
* Added try-catch-block.
* Removed debug output.
* Deleted unneeded constructors.
* Assignment operator deleted.
2018-09-28 17:09:50 +02:00
Lars Maier c5b67d217d Feature 3.4/static const strings cleanup (#6504)
* AgentConfiguration cleanup
* static strings in maintenance / agency
* fix windows build
* test bogus
* got rid of old inefficient create method
* completed with NonAction
* this works with osx / windows
* map creation can be outside function
* string init order fiasco
* startup init fiasco
* fix init-order fiasco with static strings (#6475)
* try to work around compile errors
* Removed broken and unused strings.
2018-09-21 13:18:37 +02:00
Simon 3c965ee48a Resilience test failure points (#6545) 2018-09-20 01:04:38 +02:00
Simon 292b6312ae Properly check syncer erros, catch more exceptions (#6522) 2018-09-17 16:38:44 +02:00
Simon 1b1fbde2c8 Copy installation files on startup (#6494) 2018-09-14 11:14:50 +02:00
Kaveh Vahedipour 2041e56f44 advertised endpoints (#6493) 2018-09-14 10:05:46 +02:00
Jan b4e6894830
Bug fix 3.4/fix cluster index estimates (#6487) 2018-09-13 23:14:07 +02:00
Simon 49a824068f Fix ReadOnly Mode without auth-enabled (#6479)
(cherry picked from commit e2f4d08d437bba3228bf6dd088755de787461c63)
2018-09-12 20:07:31 +02:00
Jan a07467e7e0
fix cluster index selectivity estimates (#6470) 2018-09-12 15:55:50 +02:00
Lars Maier 798375fbcd Dedicated thread for Phase 1&2 (#6342)
* First draft of dedicated thread for phase 1 and phase 2.

* Added comments and removed old code.
2018-09-06 12:14:53 +02:00
Max Neunhöffer 5a865b687c
Wait 2s after switching server mode before answering. (#6393)
This is needed because the change is propagated via the agency and the
heartbeat, which only happens once per second.
2018-09-05 21:14:07 +02:00
Kaveh Vahedipour 5ec01abb48 typo (#6383) 2018-09-05 15:22:42 +02:00
Kaveh Vahedipour 813a580246 maintenance fast lane backport (#6350) 2018-09-05 14:54:04 +02:00
Jan e87e305793
implement caching for stats values too (#6360) 2018-09-04 16:05:10 +02:00
Simon 3eed525481 Hide links (#6348) 2018-09-03 15:36:37 +02:00
Jan cb19878fad more explicit order for SystemDatabaseFeature (#6335) 2018-09-01 22:54:03 +02:00