added readme

2017-05-13 14:31:52 +02:00 · 2017-05-13 14:31:52 +02:00 · 13905fcd84
parent 49cc12e6a9
commit 13905fcd84
3 changed files with 384 additions and 235 deletions
--- a/Documentation/Books/Manual/SUMMARY.md
+++ b/Documentation/Books/Manual/SUMMARY.md
@ -16,6 +16,8 @@
  * [Coming from SQL](GettingStarted/ComingFromSql.md)
  # * [Coming from MongoDB](GettingStarted/ComingFromMongoDb.md) #TODO
 #
 * [StorageEngines](StorageEngines/README.md)
 #
 * [Scalability](Scalability/README.md)
  * [Architecture](Scalability/Architecture.md)
  * [Data models](Scalability/DataModels.md)
--- a/Documentation/Books/Manual/StorageEngines/README.md
+++ b/Documentation/Books/Manual/StorageEngines/README.md
@ -0,0 +1,147 @@
 # Storage Engines
 At the very bottom of the ArangoDB database lies the storage
 engine. The storage engine is repsonsible for persisting the documents
 on disk, holding copies in memory, providing indexes and caches to
 speed up queries.
 Upto version 3.1 ArangoDB only supported memory mapped files (MMFILES)
 as sole storage engine.  Beginning with 3.2 ArangoDB has supports
 pluggable storage engines. The second supported engine is RocksDB from
 Facebook.
 RocksDB is an embeddable persistent key-value store. It is a log
 structure database and is optimized for fast storage.
 The MMFILES engine is optimized for the use-case where the data fits
 into the main memory. It allows for very fast concurrent
 reads. However, writes block reads and locking is on collection
 level. Indexes are always in memory and are rebuild on startup. This
 gives a better performance but imposed a longer startup time.
 The ROCKSDB engine is optimized for large data-sets and allows for a
 steady insert performance even if the data-set is much larger than the
 main memory. Indexes are always stored on disk but caches are used to
 speed up performance. RocksDB uses document level locks allowing for
 concurrent writes. Writes do not block reads.
 The engine must be selected for the whole server / cluster. It is not
 possible to mix engines. The transaction handling and write-ahead-log
 format in the engines is very different and cannot be combined.
 ## RocksDB
 ### Advantages
 The main advantages of RocksDB are
 - document-level locks
 - support for large data-sets
 - persistent indexes
 RocksDB is a very flexible engine that can be configured for various use cases.
 ### Caveats
 RocksDB allows concurrent writes. However, when touching the same document a
 write conflict is raised. This cannot happen with the MMFILES engine, therefore
 application that switch to ROCKSDB need to be prepared that such exception can
 arise. It is possible to exclusively locking collections when executing AQL. This
 will avoid write conflicts but also inhibit concurrent writes.
 Currently, another restriction is due to the transaction handling in
 RocksDB. Transactions are limited in total size. If you have a statement
 modifying a lot of documents it is necessary to commit data inbetween. This will
 be done automatically for AQL by default.
 improvements.
 ### Performance
 RocksDB is a based on log structured merge tree. A good introduction can be
 found in:
 - http://www.benstopford.com/2015/02/14/log-structured-merge-trees/
 - https://blog.acolyer.org/2014/11/26/the-log-structured-merge-tree-lsm-tree/
 The basic idea is that data is organized in levels were each level is a factor
 larger than the previous. New data will reside in smaller levels while old data
 is moved down to the larger levels. This allows to support high rate of inserts
 over an extended period. In principle it is possible that the different levels
 reside on different storage media. The smaller ones on fast SSD, the larger ones
 on bigger spinning disks.
 RocksDB itself provides a lot of different knobs to fine tune the storage
 engine according to your use-case. ArangoDB supports the most common ones
 using the options below.
 Performance reports for the storage engine can be found here:
 - https://github.com/facebook/rocksdb/wiki/performance-benchmarks
 - https://github.com/facebook/rocksdb/wiki/RocksDB-Tuning-Guide
 ### ArangoDB options
 ArangoDB has a cache for the persistent index. The size of this cache
 is controlled by the option
    --cache.size
 RocksDB also has a cache for the blocks stored on disk. The size of
 this cache is controlled by the option
    --rocksdb.block-cache-size M
 ArangoDB distributes the available memory equally between the two
 caches.
 ArangoDB chooses a size for the various levels in RocksDB that is
 suitable for general purpose applications.
 RocksDB log strutured data levels have increasing size
    MEM: --
    L0:  --
    L1:  -- --
    L2:  -- -- -- --
    ...
 New or updated Documents are first stored in memory. If this memtable
 reaches the limit given by
    --rocksdb.write-buffer-size N
 it will converted to a SST file and inserted at level 0.
 The following option control the size of each level and the depth.
    --rocksdb.num-levels N
 Limits the number of levels to N. By default it is 7 and there is
 seldom a reason to change this. A new level is only opened if there is
 too much data in the previous one.
    --rocksdb.max-bytes-for-level-base B
 L0 will hold at most B bytes.
    --rocksdb.max-bytes-for-level-multiplier M
 Each level is at most M times as much bytes as the previous
 one. Therefore the maximum number of bytes forlevel L can be
 calcalculated as
    max-bytes-for-level-base * (max-bytes-for-level-multiplier ^ (L-1))
 ## Future
 RocksDB imposes a limit on the transaction size. It is optimized to
 handle small transaction very efficiently, but is limiting the total
 size of the transaction.
 Currently, we are solely using RocksDB transaction to implement the
 ArangoDB transaction handling when using the ROCKSDB engine. Therefore
 the some restrictions apply there.
 We will improve this by introducing distributed transactions in
 ArangoDB. This will allow to handle large transaction as a series of
 small RocksDB transaction and hence removing the size restriction.
--- a/js/apps/system/_admin/aardvark/APP/api-docs.json
+++ b/js/apps/system/_admin/aardvark/APP/api-docs.json