1
0
Fork 0
Commit Graph

349 Commits

Author SHA1 Message Date
Jan c655a89db6
make sure all errors are prefixed with ERROR_ @fceller @kvs85 (#9383) 2019-07-02 18:06:52 +02:00
Matthew Von-Maszewski 474f0cde31 Bug fix 3.4/scheduler empty reformat (#7872)
* added check for empty scheduler

* removed log, old is 1 not 0

* require running in this thread

* test

* added isDirect to callback

* signature fixed

* added drain

* added allowDirectHandling

* disabled for testing

* Add ExecContextScope object to direct call.

* try alternate initialization of ExecContextScope

* remove ExecContextScope, no help.  try _fifoSize as part of direct decision.

* strand management to minimize reuse of same strand per listen socket

* blind attempt to address Jenkins shutdown lock up.  may remove quickly.

* add filename and line to existing error log message

* Adjust queueOperation() to stop accepting items once isStopping() becomes true.

* revert previous check-in to MMFilesCollectorThread.cpp

* big reformat

* fixed merge conflicts

* Add CHANGELOG entry.
2019-01-08 20:39:42 +01:00
Frank Celler 9477af198b big reformat 2018-12-26 00:57:05 +01:00
Jan 8f1ce5b396
fix issues found by codescan (#7198) 2018-11-02 18:53:34 +01:00
Matthew Von-Maszewski 87c4d99eea Bugfix 3.4: Allow libcurl time to retry DNS request (#7126)
* open up connect limit to allow one DNS retry

* add CHANGELOG entry for dns retry fix

* update warning message wording

* Add lock line that was missed in recent scheduler merge
2018-10-30 18:39:48 +01:00
Matthew Von-Maszewski cfaedce493 defensive coding to ensure lambda function is not dependent upon Communicator object (#7144) 2018-10-30 18:25:11 +01:00
Tobias Gödderz e9388ab710 [3.4] Stop curl from trying to POST stdin (#7097)
* Stop libcurl from trying to POST stdin

* Stop relocking every iteration in wait

* Remove unimplemented function

* Restrict setting of empty POSTFIELDS to POST requests

* Revert locking change
2018-10-29 14:41:23 +01:00
Matthew Von-Maszewski 43016cf04f Bugfix 3.4: address concerns from prior scheduler PR (#7005) 2018-10-23 11:30:45 +02:00
Simon f5392d3c53 Fix SimpleHttpClient assertions (#6913) 2018-10-16 20:31:29 +02:00
Simon 7eda6768ab Refactor stuff, add async batch extension task (#6875)
* Refactor stuff, add async batch extension task

* fix compilation
2018-10-15 11:43:45 +02:00
Matthew Von-Maszewski 887822afa6 Bug fix 3.4: libcurl threading changes (#6829)
* enable the ability to push results processing to threads
* have ClusterComm push libcurl response processing to Scheduler threads
* tuning changes from Matthew and Michael
* give new defaults to minimum thread count
* create multiple ClusterCommThreads, each with own Communicator object
* put PR notes in change log
* correct speling
* Also drain V8 queue.
* Add prio V8 to switch in canPostDirectly.
* Accept --server.minimal-threads even if maximal threads is not set.
* Reactivate stopping of threads.
2018-10-12 17:00:55 +02:00
Simon 010b54c81e Allow WAL logger to split up transactions (#6800) 2018-10-12 15:15:55 +02:00
Matthew Von-Maszewski 276da923cf Bugfix 3.4: dynamic management of libcurl's open connections (#6711)
* untested connection count management code.
* add digit to identify location of identical error messages.
* unit tests created, code working and integrated
* subtle changes to smoothing algorithm, close ... but still not perfect
* use scheduler queue() instead of post() to avoid occassional extreme latencies. throughput improved!
* temporary hack to hardcode queue size much lower
* force direct calls to post() to instead route through queue()
* correct issue with previous check-in, but performance has dropped with this on
* revert hardcoded value in SchedulerFeature.cpp
* queue to high priority to be consistent with other Aql actions
* backout all edits to Scheduler.cpp and Scheduler.h
* oops, forgot to remove some testing code.
* 3.4 merge/update
* add changelog entry
2018-10-10 09:11:11 +02:00
Lars Maier 34ec28c6b9 Reset retry counter only when previously connected in order to not disable the retry check. (#6753)
Abort the doRequest loop when in shutdown.
2018-10-08 12:54:09 +02:00
Kaveh Vahedipour 2041e56f44 advertised endpoints (#6493) 2018-09-14 10:05:46 +02:00
jsteemann ac94de2e96 fix some replication issues 2018-07-27 19:03:43 +02:00
Jan e4d7f1c5f0
Bug fix/wenn der shard mann 2mal klingelt (#5890) 2018-07-26 15:37:40 +02:00
jsteemann 44c7b1b476 remove tabstops 2018-07-16 15:00:12 +02:00
Simran 34ec56d421 Feature/misc spelling corrections (#5164) 2018-07-13 13:06:20 +02:00
jsteemann 2d6dc9b8bc make performance great again 2018-07-10 14:54:41 +02:00
Jan 208f1297e1
Bug fix/fixes 1007 (#5815) 2018-07-10 13:47:04 +02:00
Jan 0b7c74ca96
make curl not send "Expect: 100-continue" headers (#5730) 2018-07-10 09:17:01 +02:00
Jan bf5caf210c
Bug fix/cleanup 2305 (#5437) 2018-05-23 15:45:55 +02:00
Jan 8e6d5df129
fixed minor several compiler complaints (#5406) 2018-05-23 11:50:00 +02:00
Jan 76e2c1b087
abort startup when using SSLv2 for a server endpoint (#5339) 2018-05-15 18:38:34 +02:00
Simon fdee0544b7 Using asio::io_context::strands instead of locks (#5266)
* initial try adding strands

* working, stable amount of threads

* improve shell_client cluster

* Fixing some accounting for the scheduler

* Fix accounting

* Fixing wrong strand usage

* add missing return

* Fixing thread accounting

* More scheduler accounting issues

* Fixing various things

* Fixing some stuff

* Fixing some stuff

* Some more subtle bugfixes

* Some cleanup code

* fixing some stuff

* adding some more fixes

* Fixing possible issues

* Fixing missing _storeResult

* Fixing some stuff

* Reducing lambda stack, perhaps fixing hangups

* Fix writeunlocker

* Fixing possible issues

* adding some debugging stuff

* refactor sockets

* possible fixes

* Adding more job guards

* Fixin possible bug

* cleaning up some stuff

* working impl

* Remove debugging output

* Fixing build

* fixing import

* Fixing another bug

* removing debug log

* Removing examples

* Reverting scheduler working code

* Cleanup

* Addressing review comments
2018-05-07 15:58:19 +02:00
Jan 8b9eb13925
forward port branch feature-3.3/allow-jwt-in-arangosh (#5009) 2018-04-03 17:37:01 +02:00
Wilfried Goesgens ef6ccafec4 Use std::this_thread::sleep_for to fix windows compile, mac compile (#4794) 2018-03-09 12:45:26 +01:00
Michael Hackstein 6f42d062be Fixed false http error codes in timeout case. Also reactivated retry in error case. (#4760) (#4770) 2018-03-08 09:19:16 +01:00
Simon 35136a89c0 Fix some problems with active failover (#4540) 2018-02-09 15:11:53 +01:00
Jan b2ceb68205
Feature/small misc optimizations (#4504) 2018-02-08 09:25:07 +01:00
Jan 4e99113bbb
periodically recheck socket validity/connectivity for sockets that loop around poll() every 20s (#4535) 2018-02-08 08:40:52 +01:00
Jan b2b6c06cbf
Feature/efficiency (#3736) 2018-01-05 16:51:31 +01:00
Jan 73bce34174
potentially fix send request timeout (#4026) 2017-12-13 17:54:12 +01:00
Jan 9bcf3c7978
make SimpleHttpClient's SSL connections abortable too (#4023) 2017-12-13 14:03:33 +01:00
Jan 3143805c6f
make replication abortable (#4016) 2017-12-13 12:32:04 +01:00
Matthew Von-Maszewski 7e3dc8afb8 connection timeout treated as "send complete" (#3866) 2017-12-08 11:36:43 +01:00
Jan 282be208cc
remove TRI_usleep and TRI_sleep, and use std::this_thread::sleep_for … (#3817) 2017-12-06 18:43:49 +01:00
Max Neunhöffer 74458d9d34 Add security check in AgencyComm::sendWithFailover. (#3838) 2017-12-06 10:50:40 +01:00
Jan bef52d7dc3
Bug fix/cleanup after cppcheck (#3639) 2017-11-10 13:53:28 +01:00
Simon Grätzer 7c31960cf2 Feature/async failover (#3451) 2017-10-18 23:59:29 +02:00
m0ppers 325065a0c0 show warnings if curl work thread is blocked by error/success cbs (#3316) 2017-09-26 15:12:09 +02:00
m0ppers f0ae3d3de7 Bug fix/uaf communicator request abort (#3216)
* Fix abortion of requests

* Fix progress callback
2017-09-15 22:42:30 +02:00
Jan 5165155ed1 Bug fix/fixes 0609 (#3227)
* do not use V8 variant of AQL functions in early optimization stage when a C++ variant is available

* additionally, simplify AQL function definitions and aliases

* warn when more than 90% of max mappings are in use

* added C++ variant of replication catchup

* added `--log.role` option

* updated CHANGELOG

* removed non-existing scheduler.threads option from config

* removed useless __FILE__, __LINE__ invocations

* updated CHANGELOG

* allow a priority V8 context

* remove TRI_CORE_MEM_ZONE

* try to fix Windows errors & warnings

* cleanup

* removed memory zones altogether

* exclude system collections from collection tests
2017-09-13 16:28:21 +02:00
Kaveh Vahedipour a8304904ed fixed openssl issue with 1.1 (#3095) 2017-08-22 15:50:25 +02:00
Max Neunhöffer f458a7d10a Wow...wrong type for httpstatuscode...it would zero out parts of the mutex on the stack (#2800) 2017-07-14 11:36:30 +02:00
Max Neunhöffer 2f874249bb Bug fix/adjust agency comm timeouts (#2765)
* Take out 503 timeouts altogether.
* Overhaul of AgencyComm::sendWithFailover loop.
* Let performRequests optionally ignore 404 coll not found.
* Fix error message "database not found" when AgencyComm failed.
* Add log entries in Agency if locks are acquired too slowly.
* Reexecute the javascript cluster sync stuff even if there was no plan/current change...So failed sync jobs can retry later...
* Cover callbacks in Communicator by lock. This fixes https://github.com/arangodb/planning/issues/370
* Put in delay in waiting for leader in agency test.
* Schmutz logging to heartbeat topic.
* Add more lock time diagnostic in agent.
* Switch on agencycomm tracing in coordinator.
2017-07-13 00:44:28 +02:00
Jan 6a35df3992 should fix https://github.com/arangodb/planning/issues/413 (#2742) 2017-07-06 18:03:59 +02:00
Andreas Streichardt 7250ba29ee Disable communicator and abort all requests when shutting down 2017-06-08 11:58:15 +02:00
Andreas Streichardt dbf9e61b20 Add more debug output 2017-06-07 14:50:46 +02:00