1
0
Fork 0
arangodb/Documentation/DocuBlocks/Rest/Views/post_api_view_arangosearch.md

8.0 KiB

@startDocuBlock post_api_view_arangosearch @brief creates an ArangoSearch View

@RESTHEADER{POST /_api/view#arangosearch, Create an ArangoSearch View, createView}

@RESTBODYPARAM{name,string,required,string} The name of the View.

@RESTBODYPARAM{type,string,required,string} The type of the View. Must be equal to "arangosearch". This option is immutable.

@RESTBODYPARAM{links,object,optional,} Expects an object with the attribute keys being names of to be linked collections, and the link properties as attribute values. See ArangoSearch View Link Properties for details.

@RESTBODYPARAM{primarySort,array,optional,object} A primary sort order can be defined to enable an AQL optimization. If a query iterates over all documents of a View, wants to sort them by attribute values and the (left-most) fields to sort by as well as their sorting direction match with the primarySort definition, then the SORT operation is optimized away. This option is immutable.
Expects an array of objects, each specifying a field (attribute path) and a sort direction ("asc for ascending, "desc" for descending): [ { "field": "attr", "direction": "asc"}, … ]

@RESTBODYPARAM{cleanupIntervalStep,integer,optional,int64} Wait at least this many commits between removing unused files in the ArangoSearch data directory (default: 2, to disable use: 0). For the case where the consolidation policies merge segments often (i.e. a lot of commit+consolidate), a lower value will cause a lot of disk space to be wasted. For the case where the consolidation policies rarely merge segments (i.e. few inserts/deletes), a higher value will impact performance without any added benefits.
Background: With every "commit" or "consolidate" operation a new state of the View internal data-structures is created on disk. Old states/snapshots are released once there are no longer any users remaining. However, the files for the released states/snapshots are left on disk, and only removed by "cleanup" operation.

@RESTBODYPARAM{commitIntervalMsec,integer,optional,int64} Wait at least this many milliseconds between committing View data store changes and making documents visible to queries (default: 1000, to disable use: 0). For the case where there are a lot of inserts/updates, a lower value, until commit, will cause the index not to account for them and memory usage would continue to grow. For the case where there are a few inserts/updates, a higher value will impact performance and waste disk space for each commit call without any added benefits.
Background: For data retrieval ArangoSearch Views follow the concept of "eventually-consistent", i.e. eventually all the data in ArangoDB will be matched by corresponding query expressions. The concept of ArangoSearch View "commit" operation is introduced to control the upper-bound on the time until document addition/removals are actually reflected by corresponding query expressions. Once a "commit" operation is complete all documents added/removed prior to the start of the "commit" operation will be reflected by queries invoked in subsequent ArangoDB transactions, in-progress ArangoDB transactions will still continue to return a repeatable-read state.

@RESTBODYPARAM{consolidationIntervalMsec,integer,optional,int64} Wait at least this many milliseconds between applying 'consolidationPolicy' to consolidate View data store and possibly release space on the filesystem (default: 10000, to disable use: 0). For the case where there are a lot of data modification operations, a higher value could potentially have the data store consume more space and file handles. For the case where there are a few data modification operations, a lower value will impact performance due to no segment candidates available for consolidation.
Background: For data modification ArangoSearch Views follow the concept of a "versioned data store". Thus old versions of data may be removed once there are no longer any users of the old data. The frequency of the cleanup and compaction operations are governed by 'consolidationIntervalMsec' and the candidates for compaction are selected via 'consolidationPolicy'.

@RESTBODYPARAM{consolidationPolicy,object,optional,} The consolidation policy to apply for selecting which segments should be merged (default: {})
Background: With each ArangoDB transaction that inserts documents one or more ArangoSearch internal segments gets created. Similarly for removed documents the segments that contain such documents will have these documents marked as 'deleted'. Over time this approach causes a lot of small and sparse segments to be created. A "consolidation" operation selects one or more segments and copies all of their valid documents into a single new segment, thereby allowing the search algorithm to perform more optimally and for extra file handles to be released once old segments are no longer used.
Sub-properties:

  • type (string, optional): The segment candidates for the "consolidation" operation are selected based upon several possible configurable formulas as defined by their types. The currently supported types are:
    • "tier" (default): consolidate based on segment byte size and live document count as dictated by the customization attributes. If this type is used, then below segments* and minScore properties are available.
    • "bytes_accum": consolidate if and only if {threshold} > (segment_bytes + sum_of_merge_candidate_segment_bytes) / all_segment_bytes i.e. the sum of all candidate segment byte size is less than the total segment byte size multiplied by the {threshold}. If this type is used, then below threshold property is available.
  • threshold (number, optional): value in the range [0.0, 1.0]
  • segmentsBytesFloor (number, optional): Defines the value (in bytes) to treat all smaller segments as equal for consolidation selection (default: 2097152)
  • segmentsBytesMax (number, optional): Maximum allowed size of all consolidated segments in bytes (default: 5368709120)
  • segmentsMax (number, optional): The maximum number of segments that will be evaluated as candidates for consolidation (default: 10)
  • segmentsMin (number, optional): The minimum number of segments that will be evaluated as candidates for consolidation (default: 1)
  • minScore (number, optional): (default: 0)

@RESTBODYPARAM{writebufferIdle,integer,optional,int64} Maximum number of writers (segments) cached in the pool (default: 64, use 0 to disable, immutable)

@RESTBODYPARAM{writebufferActive,integer,optional,int64} Maximum number of concurrent active writers (segments) that perform a transaction. Other writers (segments) wait till current active writers (segments) finish (default: 0, use 0 to disable, immutable)

@RESTBODYPARAM{writebufferSizeMax,integer,optional,int64} Maximum memory byte size per writer (segment) before a writer (segment) flush is triggered. 0 value turns off this limit for any writer (buffer) and data will be flushed periodically based on the value defined for the flush thread (ArangoDB server startup option). 0 value should be used carefully due to high potential memory consumption (default: 33554432, use 0 to disable, immutable)

@RESTDESCRIPTION Creates a new View with a given name and properties if it does not already exist.

@RESTRETURNCODES

@RESTRETURNCODE{400} If the name or type attribute are missing or invalid, then an HTTP 400 error is returned.

@RESTRETURNCODE{409} If a View called name already exists, then an HTTP 409 error is returned.

@EXAMPLES

@EXAMPLE_ARANGOSH_RUN{RestViewPostViewArangoSearch} var url = "/_api/view"; var body = { name: "testViewBasics", type: "arangosearch" };

var response = logCurlRequest('POST', url, body);

assert(response.code === 201);

logJsonResponse(response);

db._flushCache();
db._dropView("testViewBasics");

@END_EXAMPLE_ARANGOSH_RUN @endDocuBlock