25 KiB
Arango Actions
@NAVIGATE_UserManualActions @EMBEDTOC{UserManualActionsTOC}
Please note, that user Actions in ArangoDB are still preliminary and details are subject to change.
Introduction to User Actions
In some ways the communication layer of the ArangoDB server behaves like a Web
server. Unlike a Web server, it normally responds to HTTP requests by delivering
JSON objects. Remember, documents in the database are just JSON objects. So,
most of the time the HTTP response will contain a JSON document from the
database as body. You can extract the documents stored in the database using
HTTP GET
. You can store documents using HTTP POST
.
However, there is something more. You can write small sniplets - so called actions - to extend the database. The idea of actions is that sometimes it is better to store parts of the business logic within AnrangoDB.
The simplest example is the age of a person. Assume you store information about people in your database. It is an anti-pattern to store the age, because it changes every now and then. Therefore, you normally store the birthday and let the client decide what to do with it. However, if you have many different clients, it might be easier to enrich the person document with the age using actions once on the server side.
Or, for instance, if you want to apply some statistics to large data-sets and you cannot easily express this as query. You can define a action instead of transferring the whole data to the client and do the computation on the client.
Actions are also useful if you want to restrict and filter data according to some complex permission system.
The ArangoDB server can deliver all kinds of information, JSON being only one possible format. You can also generate HTML or images. However, a Web server is normally better suited for the task as it also implements various caching strategies, language selection, compression and so on. Having said that, there are still situations where it might be suitable to use the ArangoDB to deliver HTML pages - static or dynamic. An simple example is the built-in administration interface. You can access it using any modern browser and there is no need for a separate Apache or IIS.
The following sections will explain actions within ArangoDB and show how to define them. The examples start with delivering static HTML pages - even if this is not the primary use-case for actions. The later sections will then show you how to code some pieces of your business logic and return JSON objects.
The interface is loosely modelled after the JavaScript classes for HTTP request and responses found in node.js and the middleware/routing aspects of connect.js and express.js.
Note that unlike node.js, ArangoDB is multi-threaded and there is no easy way to share state between queries inside the JavaScript engine. If such state information is required, you need to use the database itself.
A Hello World Example
The client API or browser sends a HTTP request to the ArangoDB server and the
server returns a HTTP response to the client. A HTTP request consists of a
method, normally GET
or POST
when using a browser, and a request path like
/hello/world
. For a real Web server there are a zillion of other thing to
consider, we will ignore this for the moment. The HTTP response contains a
content type, describing how to interpret the returned data, and the data
itself.
In the following example, we want to define an action in ArangoDB, so that the server returns the HTML document
<html>
<body>
Hello World
</body>
</html>
if asked GET /hello/world
.
The server needs to know what function to call or what document to deliver if it
receives a request. This is called routing. All the routing information of
ArangoDB is stored in a collection _routing
. Each entry in this collections
describes how to deal with a particular request path.
For the above example, add the following document to the @{_routing} collection:
arangosh> db._routing.save({
........> url: { match: "/hello/world" },
........> content: {
........> contentType: "text/html",
........> body: "<html><body>Hello World</body></html>" }});
In order to activate the new routing, you must either restart the server or call the internal reload function.
arangosh> require("internal").reloadRouting()
Now use the browser and access
http://localhost:8529/hello/world
You should see the Hello World
in our browser.
Matching an URL
There are a lot of options for the url
attribute. If you define different
routing for the same path, then the following simple rule is applied in order to
determine which match wins: If there are two matches, then the more specific
wins. I. e, if there is a wildcard match and an exact match, the exact match is
prefered. If there is a short and a long match, the longer match wins.
Exact Match
If the definition is
{ url: { match: "/hello/world" } }
then the match must be exact. Only the request for /hello/world
will match,
everything else, e. g. /hello/world/my
or /hello/world2
, will not match.
The following definition is a short-cut for an exact match.
{ url: "/hello/world" }
Prefix Match
If the definition is
{ url: { match: "/hello/world/*" } }
then the match can be a prefix match. The requests for /hello/world
,
/hello/world/my
, and /hello/world/how/are/you
will all match. However
/hello/world2
does not match. Prefix matches within an URL part,
i. e. /hello/world*
, are not allowed. The wildcard must occur at the end,
i. e.
/hello/*/world
is also disallowed.
If you define two routes
{ url: { match: "/hello/world/*" } }
{ url: { match: "/hello/world/emil" } }
then the second route will be used for /hello/world/emil
because it is more
specific.
Parameterized Match
A parameterized match is similar to a prefix match, but the parameters are also allowed inside the URL path.
If the definition is
{ url: { match: "/hello/:name/world" } }
then the URL must have three parts, the first part being hello
and the third
part world
. For example, /hello/emil/world
will match, while
/hello/emil/meyer/world
will not.
Constraint Match
A constraint match is similar to a parameterized match, but the parameters can carry constraints.
If the definition is
{ url: { match: "/hello/:name/world", constraint: { name: "/[a-z]+/" } }
then the URL must have three parts, the first part being hello
and the third
part world
. The second part must be all lowercase.
It is possible to use more then one constraint for the same URL part.
{ url: { match: "/hello/:name|:id/world",
constraint: { name: "/[a-z]+/", id: "/[0-9]+/" } }
Optional Match
An optional match is similar to a parameterized match, but the last parameter is optional.
If the definition is
{ url: { match: "/hello/:name?", constraint: { name: "/[a-z]+/" } }
then the URL /hello
and /hello/emil
will match.
If the definitions are
{ url: { match: "/hello/world" } }
{ url: { match: "/hello/:name", constraint: { name: "/[a-z]+/" } }
{ url: { match: "/hello/*" } }
then the URL /hello/world
will be matched by the first route, because it is
the most specific. The URL /hello/you
will be matched by the second route,
because it is more specific than the prefix match.
Method Restriction
You can restrict the match to specific methods.
If the definition is
{ url: { match: "/hello/world", methods: [ "post", "put" ] }
then only POST
and PUT
requests will match.
More on Matching
Remember that the more specific match wins.
- A match without parameter or wildcard is more specific than a match with parameters or wildcard.
- A match with parameter is more specific than a match with a wildcard.
- If there is more than one parameter, specificity is applied from left to right.
Consider the following definitions
(1) { url: { match: "/hello/world" } }
(2) { url: { match: "/hello/:name", constraint: { name: "/[a-z]+/" } }
(3) { url: { match: "/:something/world" }
(4) { url: { match: "/hello/*" } }
Then
/hello/world
is match by (1)/hello/emil
is match by (2)/your/world
is match by (3)/hello/you
is match by (4)
You can write the following document into the _routing
collection
to test the above examples.
{
routes: [
{ url: { match: "/hello/world" }, content: "route 1" },
{ url: { match: "/hello/:name|:id", constraint: { name: "/[a-z]+/", id: "/[0-9]+/" } }, content: "route 2" },
{ url: { match: "/:something/world" }, content: "route 3" },
{ url: { match: "/hello/*" }, content: "route 4" },
]
}
A Hello World Example for JSON
If you change the example slightly, then a JSON object will be delivered.
arangosh> db._routing.save({
........> url: "/hello/json",
........> content: {
........> contentType: "application/json",
........> body: "{ \"hello\" : \"world\" }" }});
arangosh> require("internal").reloadRouting()
Again check with your browser
http://localhost:8529/hello/json
Depending on your browser and installed add-ons you will either see the JSON
object or a download dialog. If your browser wants to open an external
application to display the JSON object, you can change the contentType
to
"text/plain"
for the example. This makes it easier to check the example using
a browser. Or use curl
to access the server.
bash> curl "http://127.0.0.1:8529/hello/json" && echo
{ "hello" : "world" }
Delivering Content
There are a lot of different ways on how to deliver content. We have already seen the simplest one, where static content is delivered. The fun, however, starts when delivering dynamic content.
Static Content
You can specify a body and a content-type.
{ content: {
contentType: "text/html",
body: "<html><body>Hallo World</body></html>"
}
}
If the content type is text/plain
then you can use the short-cut
{ content: "Hallo World" }
A Simple Action
The simplest dynamic action is:
{ action: { do: "org/arangodb/actions/echoRequest" } }
It is not possible to store functions directly in the routing table, but you can call functions defined in modules. In the above example the function can be accessed from JavaScript as:
require("org/arangodb/actions").echoRequest
The function echoRequest
is pre-defined. It takes the request objects and
echos it in the response.
The signature of such a function must be
function (req, res, options, next)
For example
arangosh> db._routing.save({
........> url: "/hello/echo",
........> action: { do: "org/arangodb/actions/echoRequest" } });
Reload the routing and check
http://127.0.0.1:8529/hello/echo
You should see something like
{
"request": {
"path": "/hello/echo",
"headers": {
"accept-encoding": "gzip, deflate",
"accept-language": "de-de,de;q=0.8,en-us;q=0.5,en;q=0.3",
"connection": "keep-alive",
"content-length": "0",
"host": "localhost:8529",
"user-agent": "Mozilla/5.0 (X11; Linux x86_64; rv:15.0) Gecko/20100101 Firefox/15.0"
},
"requestType": "GET",
"parameters": { }
},
"options": { }
}
The request might contain path
, prefix
, suffix
, and urlParameters
attributes. path
is the complete path as supplied by the user and always
available. If a prefix was matched, then this prefix is stored in the attribute
prefix
and the remaining URL parts are stored as an array in suffix
. If one
or more parameters were matched, then the parameter values are stored in
urlParameters
.
For example, if the url description is
{ url: { match: "/hello/:name/:action" } }
and you request the path /hello/emil/jump
, then the request object
will contain the following attribute
urlParameters: { name: "emil", action: "jump" } }
Action Controller
As an alternative to the simple action, you can use controllers. A
controller is a module, defines the function get
, put
,
post
, delete
, head
, patch
. If a request of
the corresponding type is matched, the function will be called.
For example
arangosh> db._routing.save({
........> url: "/hello/echo",
........> action: { controller: "org/arangodb/actions/echoController" } });
Prefix Action Controller
The controller is selected when the definition is read. There is a more flexible, but slower and maybe insecure variant, the prefix controller.
Assume that the url is a prefix match
{ url: { match: /hello/*" } }
You can use
{ action: { prefixController: "org/arangodb/actions" } }
to define a prefix controller. If the URL /hello/echoController
is
given, then the module org/arangodb/actions/echoController
is used.
If you use an prefix controller, you should make certain that no unwanted actions are available under the prefix.
The definition
{ action: "org/arangodb/actions" }
is a short-cut for a prefix controller definition.
Requests and Responses
The controller must define handler functions which take a request object and fill the response object.
A very simple example is the function echoRequest
defined in
the module org/arangodb/actions
.
function (req, res, options, next) {
var result;
result = { request: req, options: options };
res.responseCode = exports.HTTP_OK;
res.contentType = "application/json";
res.body = JSON.stringify(result);
}
Install it as
arangosh> db._routing.save({
........> url: "/echo",
........> action: { controller: "org/arangodb/actions", do: "echoRequest" } });
Reload the routing and check
http://127.0.0.1:8529/hello/echo
You should see something like
{
"request": {
"prefix": "/hello/echo",
"suffix": [
"hello",
"echo"
],
"path": "/hello/echo",
"headers": {
"accept-encoding": "gzip, deflate",
"accept-language": "de-de,de;q=0.8,en-us;q=0.5,en;q=0.3",
"connection": "keep-alive",
"content-length": "0",
"host": "localhost:8529",
"user-agent": "Mozilla/5.0 (X11; Linux x86_64; rv:15.0) Gecko/20100101 Firefox/15.0"
},
"requestType": "GET",
"parameters": { }
},
"options": { }
}
You may also pass options to the called function:
arangosh> db._routing.save({
........> url: "/echo",
........> action: {
........> controller: "org/arangodb/actions",
........> do: "echoRequest",
........> options: { "Hallo": "World" } } });
You should now see the options in the result.
{
"request": {
...
},
"options": {
"Hallo": "World"
}
}
Modifying Request and Response
As we've seen in the previous examples, actions get called with the request and
response objects (named req
and res
in the examples) passed as parameters to
their handler functions.
The req
object contains the incoming HTTP request, which might or might not
have been modified by a previous action (if actions were chained).
A handler can modify the request object in place if desired. This might be useful when writing middleware (see below) that is used to intercept incoming requests, modify them and pass them to the actual handlers.
While modifying the request object might not be that relevant for non-middleware actions, modifying the response object definitely is. Modifying the response object is an action's only way to return data to the caller of the action.
We've already seen how to set the HTTP status code, the content type, and the
result body. The res
object has the following properties for these:
- contentType: MIME type of the body as defined in the HTTP standard (e.g.
text/html
,text/plain
,application/json
, ...) - responsecode: the HTTP status code of the response as defined in the HTTP
standard. Common values for actions that succeed are
200
or201
. Please refer to the HTTP standard for more information. - body: the actual response data
To set or modify arbitrary headers of the response object, the headers
property can be used. For example, to add a user-defined header to the response,
the following code will do:
res.headers = res.headers || { }; // headers might or might not be present
res.headers['X-Test'] = 'someValue'; // set header X-Test to "someValue"
This will set the additional HTTP header X-Test
to value someValue
. Other
headers can be set as well. Note that ArangoDB might change the case of the
header names to lower case when assembling the overall response that is sent to
the caller.
It is not necessary to explicitly set a Content-Length
header for the response
as ArangoDB will calculate the content length automatically and add this header
itself. ArangoDB might also add a Connection
header itself to handle HTTP
keep-alive.
ArangoDB also supports automatic transformation of the body data to another format. Currently, the only supported transformations are base64-encoding and base64-decoding. Using the transformations, an action can create a base64 encoded body and still let ArangoDB send the non-encoded version, for example:
res.body = 'VGhpcyBpcyBhIHRlc3Q=';
res.transformations = res.transformations || [ ]; // initialise
res.transformations.push('base64decode'); // will base64 decode the response body
When ArangoDB processes the response, it will base64-decode what's in res.body
and set the HTTP header Content-Encoding: binary
. The opposite can be achieved
with the base64encode
transformation: ArangoDB will then automatically
base64-encode the body and set a Content-Encoding: base64
HTTP header.
Writing dynamic action handlers
To write your own dynamic action handlers, you must put them into modules.
Modules are a means of organising action handlers and making them loadable under specific names.
To start, we'll define a simple action handler in a module /own/test
:
arangosh> db._modules.save({
........> path: "/own/test",
........> content: "exports.do = function(req, res, options, next) { res.body = 'test'; res.responseCode = 200; res.contentType = 'text/html'; };",
........> autoload: true });
This does nothing but register a do action handler in a module /own/test
. The
action handler is not yet callable, but must be mapped to a route first. To map
the action to the route /ourtest
, execute the following command:
arangosh> db._routing.save({
........> url: "/ourtest",
........> action: { controller: "/own/test" } });
In order to see the module in action, you must either restart the server or call the internal reload function.
arangosh> require("internal").reloadRouting()
Now use the browser and access
http://localhost:8529/ourtest
You will see that the module's do function has been executed.
Advanced Usages
For detailed information see the reference manual.
Redirects
Use the following for a permanent redirect:
arangosh> db._routing.save({
........> url: "/",
........> action: {
........> controller: "org/arangodb/actions",
........> do: "redirectRequest",
........> options: {
........> permanently: true,
........> destination: "http://somewhere.else/" } } });
Routing Bundles
Instead of adding all routes for package separately, you can specify a bundle.
{
routes: [
{ url: "/url1", content: "..." },
{ url: "/url2", content: "..." },
{ url: "/url3", content: "..." },
...
]
}
The advantage is, that you can put all your routes into one document and use a common prefix.
{
urlPrefix: "/test",
routes: [
{ url: "/url1", content: "..." },
{ url: "/url2", content: "..." },
{ url: "/url3", content: "..." },
...
]
}
will define the URL /test/url1
, /test/url2
, and /test/url3
.
Writing Middleware
Assume, you want to log every request. In this case you can easily define an
action for the whole url-space /
. This action simply logs the requests, calls
the next in line, and logs the response.
exports.logRequest = function (req, res, options, next) {
console.log("received request: %s", JSON.stringify(req));
next();
console.log("produced response: %s", JSON.stringify(res));
};
This function is available as org/arangodb/actions/logRequest
. You need to
tell ArangoDB that it is should use a prefix match and that the shortest match
should win in this case:
arangosh> db._routing.save({
........> middleware: [
........> { url: { match: "/*" }, action: { controller: "org/arangodb/actions", do: "logRequest" } }
........> ]
........> });
Application Deployment
Using single routes or @ref UserManualActionsAdvancedBundles "bundles" can be
become a bit messy in large applications. Therefore a deployment tool exists
inside arangosh to simplify the task. This tool was inspired by the ArangoDB
deployment tool https://github.com/kaerus/arangodep
written in node.js by
kaerus.
An application is a bunch of routes, static pages stored in collections, and small scriptlets stored modules.
In order to create an application, chose a suitable name, e. g. reverse domain
name plus the application name and call createApp
:
arangosh> var deploy = require("org/arangodb/deploy");
arangosh> var app = deploy.createApp("org.example.simple");
Normally content will either be stored in collections or dynamically calculated. But sometimes it is convenient to store the content directly in the routing table, e. g. to deliver a version number.
arangosh> app.mountStaticContent("/version", {
........> version: "1.2.3", major: 1, minor: 2, patch: 3 });
[ArangoApp "org.example.simple" at ""]
Save the application
arangosh> app.save();
[ArangoApp "org.example.simple" at ""]
and use the browser to check the result
http://localhost:8529/version
You can also specify the content-type
arangosh> app.mountStaticContent("/author",
........> "Frank Celler",
........> "text/plain").save();
[ArangoApp "org.example.simple" at ""]
and check at
http://localhost:8529/author
If you have more than one application, putting version under /
might lead to
conflicts. It is therefore possible to use a common prefix for the application.
arangosh> app.setPrefix("/example").save();
[ArangoApp "org.example.simple" at "/example"]
Now check
http://localhost:8529/example/version
http://localhost:8529/example/author
Deploying Static Pages
Most of the time, static html pages and JavaScript content will be delivered by your web-server. But sometimes it is convenient to deliver these directly from within ArangoDB. For example, to provide a small admin interface for you application.
Assume that all data is stored underneath a directory "/tmp/example" and we want to store the content in a collection "org_example_simple_content".
First connect the url path to the collection.
arangosh> app.mountStaticPages("/static", "org_example_simple_content").save();
[ArangoApp "org.example.simple" at "/example"]
Next create a file index.html
at `/tmp/example/index.html".
<html>
<body>
Hallo World!
</body>
</html>
Create the collection and upload this into the collection
arangosh> require("org/arangodb").db._createDocumentCollection("org_example_simple_content");
[ArangoCollection 224910918055, "org_example_simple_content" (type document, status loaded)]
arangosh> app.uploadStaticPages("/static", "/tmp/example");
imported '/index.html' of type 'text/html; charset=utf-8'
[ArangoApp "org.example.simple" at "/example"]
Check the index file
http://localhost:8529/example/static/index.html