1
0
Fork 0
arangodb/Documentation/Books/AQL/Examples/DataModificationQueries.mdpp

268 lines
7.7 KiB
Plaintext

Data-modification queries
=========================
The following operations can be used to modify data of multiple documents
with one query. This is superior to fetching and updating the documents individually
with multiple queries. However, if only a single document needs to be modified,
ArangoDB's specialized data-modification operations for single documents
might execute faster.
Updating documents
------------------
To update existing documents, we can either use the *UPDATE* or the *REPLACE*
operation. *UPDATE* updates only the specified attributes in the found documents,
and *REPLACE* completely replaces the found documents with the specified values.
We'll start with an *UPDATE* query that rewrites the gender attribute in all
documents:
```js
FOR u IN users
UPDATE u WITH { gender: TRANSLATE(u.gender, { m: 'male', f: 'female' }) } IN users
```
To add new attributes to existing documents, we can also use an *UPDATE* query.
The following query adds an attribute *numberOfLogins* for all users with status
active:
```js
FOR u IN users
FILTER u.active == true
UPDATE u WITH { numberOfLogins: 0 } IN users
```
Existing attributes can also be updated based on their previous value:
```js
FOR u IN users
FILTER u.active == true
UPDATE u WITH { numberOfLogins: u.numberOfLogins + 1 } IN users
```
The above query will only work if there was already a *numberOfLogins* attribute
present in the document. If it is unsure whether there is a *numberOfLogins*
attribute in the document, the increase must be made conditional:
```js
FOR u IN users
FILTER u.active == true
UPDATE u WITH {
numberOfLogins: HAS(u, 'numberOfLogins') ? u.numberOfLogins + 1 : 1
} IN users
```
Updates of multiple attributes can be combined in a single query:
```js
FOR u IN users
FILTER u.active == true
UPDATE u WITH {
lastLogin: DATE_NOW(),
numberOfLogins: HAS(u, 'numberOfLogins') ? u.numberOfLogins + 1 : 1
} IN users
```
Note than an update query might fail during execution, for example because a
document to be updated does not exist. In this case, the query will abort at
the first error. In single-server mode, all modifications done by the query will
be rolled back as if they never happened.
Replacing documents
-------------------
To not just partially update, but completely replace existing documents, use
the *REPLACE* operation.
The following query replaces all documents in the collection backup with
the documents found in collection users. Documents common to both
collections will be replaced. All other documents will remain unchanged.
Documents are compared using their *_key* attributes:
```js
FOR u IN users
REPLACE u IN backup
```
The above query will fail if there are documents in collection users that are
not in collection backup yet. In this case, the query would attempt to replace
documents that do not exist. If such case is detected while executing the query,
the query will abort. In single-server mode, all changes made by the query will
also be rolled back.
To make the query succeed for such case, use the *ignoreErrors* query option:
```js
FOR u IN users
REPLACE u IN backup OPTIONS { ignoreErrors: true }
```
Removing documents
------------------
Deleting documents can be achieved with the *REMOVE* operation.
To remove all users within a certain age range, we can use the following query:
```js
FOR u IN users
FILTER u.active == true && u.age >= 35 && u.age <= 37
REMOVE u IN users
```
Creating documents
------------------
To create new documents, there is the *INSERT* operation.
It can also be used to generate copies of existing documents from other collections,
or to create synthetic documents (e.g. for testing purposes). The following
query creates 1000 test users in collection users with some attributes set:
```js
FOR i IN 1..1000
INSERT {
id: 100000 + i,
age: 18 + FLOOR(RAND() * 25),
name: CONCAT('test', TO_STRING(i)),
active: false,
gender: i % 2 == 0 ? 'male' : 'female'
} IN users
```
Copying data from one collection into another
---------------------------------------------
To copy data from one collection into another, an *INSERT* operation can be
used:
```js
FOR u IN users
INSERT u IN backup
```
This will copy over all documents from collection users into collection
backup. Note that both collections must already exist when the query is
executed. The query might fail if backup already contains documents, as
executing the insert might attempt to insert the same document (identified
by *_key* attribute) again. This will trigger a unique key constraint violation
and abort the query. In single-server mode, all changes made by the query
will also be rolled back.
To make such copy operation work in all cases, the target collection can
be emptied before, using a *REMOVE* query.
Handling errors
---------------
In some cases it might be desirable to continue execution of a query even in
the face of errors (e.g. "document not found"). To continue execution of a
query in case of errors, there is the *ignoreErrors* option.
To use it, place an *OPTIONS* keyword directly after the data modification
part of the query, e.g.
```js
FOR u IN users
REPLACE u IN backup OPTIONS { ignoreErrors: true }
```
This will continue execution of the query even if errors occur during the
*REPLACE* operation. It works similar for *UPDATE*, *INSERT*, and *REMOVE*.
Altering substructures
----------------------
To modify lists in documents we have to work with temporary variables.
We will collect the sublist in there and alter it. We choose a simple
boolean filter condition to make the query better comprehensible.
First lets create a collection with a sample:
```js
database = db._create('complexCollection')
database.save({
"topLevelAttribute" : "a",
"subList" : [
{
"attributeToAlter" : "oldValue",
"filterByMe" : true
},
{
"attributeToAlter" : "moreOldValues",
"filterByMe" : true
},
{
"attributeToAlter" : "unchangedValue",
"filterByMe" : false
}
]
})
```
Heres the Query which keeps the *subList* on *alteredList* to update it later:
```js
FOR document in complexCollection
LET alteredList = (
FOR element IN document.subList
LET newItem = (! element.filterByMe ?
element :
MERGE(element, { attributeToAlter: "shiny New Value" }))
RETURN newItem)
UPDATE document WITH { subList: alteredList } IN complexCollection
```
While the query as it is is now functional:
```js
db.complexCollection.toArray()
[
{
"_id" : "complexCollection/392671569467",
"_key" : "392671569467",
"_rev" : "392799430203",
"topLevelAttribute" : "a",
"subList" : [
{
"filterByMe" : true,
"attributeToAlter" : "shiny New Value"
},
{
"filterByMe" : true,
"attributeToAlter" : "shiny New Value"
},
{
"filterByMe" : false,
"attributeToAlter" : "unchangedValue"
}
]
}
]
```
It will probably be soonish a performance bottleneck, since it **modifies**
all documents in the collection **regardless whether the values change or not**.
Therefore we want to only *UPDATE* the documents if we really change their value.
Hence we employ a second *FOR* to test whether *subList* will be altered or not:
```js
FOR document in complexCollection
LET willUpdateDocument = (
FOR element IN docToAlter.subList
FILTER element.filterByMe LIMIT 1 RETURN 1)
FILTER LENGTH(willUpdateDocument) > 0
LET alteredList = (
FOR element IN document.subList
LET newItem = (! element.filterByMe ?
element :
MERGE(element, { attributeToAlter: "shiny New Value" }))
RETURN newItem)
UPDATE document WITH { subList: alteredList } IN complexCollection
```