Indexes

Source

When you create an index, you specify its source, which is one or more collections of documents. Once the index is active, queries that create, update, or delete a document in the source collections causes the index to be updated.

Terms

An index can define terms: these are zero or more term objects that define which document fields to use for searching.

terms are comparable to column=value predicates in an SQL WHERE clause. For example, if your documents have a name field, you can define terms to include that field, and then you can find all of the documents that match a name.

Only scalar Values are indexed. When a term targets a document field or index binding result that has an array, one index entry per array item is created, which makes it easy to search for an array item. Objects are not indexed. As a result, it is not possible to search for arrays or objects.

Be aware that when the terms definition includes multiple array fields, the number of index entries created is the Cartesian product of the number of array items. For example, when an index terms definition specifies two fields that are arrays, and a document is created including one array with 5 items and the second array with 11 items, 55 index entries are created. Index write operations are grouped together, so the billing impact depends on the overall size of the index entries.

When an index has one or more terms, the index is partitioned by the terms, allowing Fauna to efficiently scale indexes.

When a document is indexed, and all of the index’s defined terms evaluate to null, no index entry is stored for the document.

Values

An index can define values: these are zero or more scalar Values returned for each index entry that matches the terms when you query the index. values are comparable to the SQL SELECT clause.

values are also how indexes are sorted: each field value in values is sorted lexically according to the field type, and the order can be inverted by setting reverse: true.

Each index entry records the Reference of each document involved in the index. When there is no values definition, the index returns the Reference for each matching index entry. When values is defined, only the defined field values are returned.

When a document is indexed, and all of the index’s defined values evaluate to null, no index entry is stored for the document.

Values must refer to fields with scalar Values. Objects are not indexed, so when a values definition points to a document field or index binding result that has an Object, the index entry stores null because Objects cannot be sorted. When a values definition points to a document field or index binding result that has an Array, one index entry per array item is created.

Collection index

An index with no terms and values defined is known as a collection index: searching for documents is not possible, and all documents in the collection are included in the result set, and are sorted by their reference in ascending order.

Unique

You can use unique: true in an index definition. When you do, the index has only one entry for a document with the defined terms and values. Subsequently, creating, or updating, a document to have the same terms and values as an existing document would cause an error.

Avoid creating a unique index that does not define terms.

If you do create a "term-less" index, the index could cause performance issues. Every time a covered document is created or updated, the index (and its history) needs to be evaluated to decide whether the document is unique or not. As the index grows larger, the evaluation for uniqueness can cause your queries involving writes to exceed the query timeout.

Bindings

Fauna indexes can also specify bindings, which are Lambda functions that you can use to compute fields in the terms or values. For example, if your documents store a timestamp, and you want to search for documents by year, you could write a binding that converts the timestamp to a year and include the computed year as one of the terms. Similarly, if you want to report the month of a document, you could write a binding that converts the timestamp to a month, and include the computed month as one of the values.

For background, a Set is a sorted group of immutable data from a collection. An Index is a group of sets in a collection. Indexes are defined as documents in the system indexes collection.

Index details

Example index

The simplest index is called a "collection" index. A collection index has no terms or values defined. This means that the index includes all documents with no search terms, and that the index returns the Reference to each indexed document. Such an index can be created with a name and a source collection:

CreateIndex({
  name: 'new-index',
  source: Collection('spells'),
})

{
  ref: Index("new-index"),
  ts: 1624310362210000,
  active: true,
  serialized: true,
  name: 'new-index',
  source: Collection("spells"),
  partitions: 8
}

Query metrics:

bytesIn: 81
bytesOut: 252
computeOps: 1
readOps: 0
writeOps: 5
readBytes: 1,825
writeBytes: 855
queryTime: 9ms
retries: 0

Index fields

Field Name Field Type Definition and Requirements

name

String

The logical name of the index.

Cannot be events, sets, self, documents, or _. Cannot have the % character.

source

Reference or Array

A Collection reference, or an array of one or more source objects describing source collections and (optional) binding fields.

The ts field can be used as a term or value for an index but should not be used in a binding because it is not known at the time index bindings are computed.

terms

Array

Optional - An array of Term objects describing the fields that should be searchable. Indexed terms can be used to search for field values using the Match function. The default is an empty Array.

values

Array

Optional - An array of Value objects describing the fields that should be reported in search results. The default is an empty Array. When there is no values definition, search results include the Reference of each matching document.

unique

Boolean

Optional - If true, maintains a unique constraint on combined terms and values. The default is false.

serialized

Boolean

Optional - If true, writes to this index are serialized with concurrent reads and writes. Subsequent reads or writes must wait until the writes for this index have completed.

If false, writes to this index occur asynchronously and do not block subsequent reads or writes. This is better performance-wise, but the lack of serialization makes it notably harder to use this index to read your own writes.

The default is true.

See Isolation levels for details.

permissions

Object

Optional - Indicates who is allowed to read the index. The default is everyone can read the index.

data

Object

Optional - This is user-defined metadata for the index. It is provided for the developer to store information at the index level. The default is an empty object with no data.

The maximum size of an index entry, which is comprised of the terms and values content (and some overhead to distinguish multiple fields), must not exceed 64k bytes. If an index entry is too large, the query that created/updated the index entry fails.

Source objects

Source objects describe the source collection of index entries and, optionally, bindings. A binding must be a pure Lambda function (it must not create side effects, such as reads or writes) that emits values to be used as a term and/or value.

An index cannot be created in the same transaction that creates its source collections.

The collection field can be a single collection reference or an array of collection references. Documents in collections matching the collection field apply the associated bindings to be used in the index terms or values definitions. A collection reference can only exist in one source object.

Field Type Definition and Requirements

Field	Type	Definition and Requirements
`collection`	Collection Reference, or Array of collection references	The collection or collections to be indexed.
`fields`	Object	An object mapping a binding name to a `Lambda` function.

collection

Collection Reference, or Array of collection references

The collection or collections to be indexed.

fields

Object

An object mapping a binding name to a Lambda function.

The following examples demonstrates the structure of a source object, which includes an example binding object:

{
  source: {
    collection: Collection("collection"),
    fields: {
      binding1: Query(
        Lambda(
          "document",
          Select(["data", "field"], Var("document")),
        )
      ),
    },
  },
}

{
  source: {
    collection: Collection("collection"),
    fields: {
      binding1: Query(Lambda("document", Select(["data", "field"], Var("document"))))
    }
  }
}

Query metrics:

bytesIn: 201
bytesOut: 244
computeOps: 1
readOps: 0
writeOps: 0
readBytes: 0
writeBytes: 0
queryTime: 7ms
retries: 0

Binding objects

A binding object binds a field name to a pure, single-argument Lambda function (it must not create side effects, such as reads or writes). The function must take the document to be indexed and emit a single scalar value or an array of scalar values. Binding functions are not permitted to use reads or writes.

Once defined, bindings can be used in the index terms or values definitions as if they were document fields.

Functions that cannot be used in bindings include:

{
  binding1: Query(
    Lambda('document', Select(['data', 'field'], Var('document')))
  )
}

{
  binding1: Query(Lambda("document", Select(["data", "field"], Var("document"))))
}

Query metrics:

bytesIn: 116
bytesOut: 137
computeOps: 1
readOps: 0
writeOps: 0
readBytes: 0
writeBytes: 0
queryTime: 2ms
retries: 0

Term objects

Term objects describe the fields whose values are used to search for entries in the index.

When a terms field is missing from an indexed document, the field value in the index is null. If all defined terms fields evaluate to null, no index entry is stored for the document.

If no term objects are defined, passing term values to Match is not required. The resulting set has all documents in the source collection.

A value can be from a field in the document or a binding defined by the source object.

Terms must refer to fields with scalar Values. When a term points to a document field or index binding result that has an array, one index entry per array item is created. Objects are not indexed.

Be aware that when the terms definition includes multiple array fields, the number of index entries created is the Cartesian product of the number of array items. For example, when an index terms definition specifies two fields that are arrays, and a document is created including one array with 5 items and the second array with 11 items, 55 index entries are created. Index write operations are grouped together, so the billing impact depends on the overall size of the index entries.

Field Type Definition

Field	Type	Definition
`field`	Array or String	The field name path or field name in the document to be indexed. The path targets a field value. For this example object: `{ "ref": Ref(Collection("pet_owners"), "12345"), "data": { "name": "Alice", "pets": [ { "type": "dog", "name": "Luna" }, { "type": "cat", "name": "Fluffy" }, ], } }` The following paths can be used to target fields: `"ref"` targets the value for the `ref` field. `["data", "name"]` targets the value for the `data.name` field, "Alice." `["data", "pets", "name"]` targets the `name` field of objects in the `pets` array. Recall that one index item is created per array item during indexing, which flattens the array so that no array offset exists in the index entry.
`binding`	String	The name of a binding from a source object.

field

Array or String

The field name path or field name in the document to be indexed.

The path targets a field value. For this example object:

{
  "ref": Ref(Collection("pet_owners"), "12345"),
  "data": {
    "name": "Alice",
    "pets": [
      { "type": "dog", "name": "Luna" },
      { "type": "cat", "name": "Fluffy" },
    ],
  }
}

The following paths can be used to target fields:

"ref" targets the value for the ref field.
["data", "name"] targets the value for the data.name field, "Alice."
["data", "pets", "name"] targets the name field of objects in the pets array.

Recall that one index item is created per array item during indexing, which flattens the array so that no array offset exists in the index entry.

binding

String

The name of a binding from a source object.

The following example demonstrates an index terms definition with two term objects, the first specifies a binding, the second specifies a document field:

{
  terms: [
    { binding: 'binding1' },
    { field: ['data', 'field'] },
  ]
}

{
  terms: [ { binding: 'binding1' }, { field: [ 'data', 'field' ] } ]
}

Query metrics:

bytesIn: 94
bytesOut: 74
computeOps: 1
readOps: 0
writeOps: 0
readBytes: 0
writeBytes: 0
queryTime: 4ms
retries: 0

Value objects

Value objects describe the fields whose values should be used to sort the index, and whose values should be reported in query results. By default, indexes have no values defined, and return the References of indexed documents.

When a values field is missing from an indexed document, the field value in the index is null. If all defined values fields evaluate to null, no index entry is stored for the document.

A value can be from a field in the document, or a binding function defined in a Source objects.

Values must refer to fields with scalar Values. Objects are not indexed, so when a values definition points to a document field or index binding result that has an Object, the index entry stores null because Objects cannot be sorted. When a values definition points to a document field or index binding result with an Array, one index entry per array item is created.

Field Type Definition

Field	Type	Definition
`field`	Array or String	The field name path or field name in the document to be indexed. The path targets a field value. For this example object: `{ "ref": Ref(Collection("pet_owners"), "12345"), "data": { "name": "Alice", "pets": [ { "type": "dog", "name": "Luna" }, { "type": "cat", "name": "Fluffy" }, ], } }` The following paths can be used to target fields: `"ref"` targets the value for the `ref` field. `["data", "name"]` targets the value for the `data.name` field, "Alice." `["data", "pets", "name"]` targets the `name` field of objects in the `pets` array. Recall that one index item is created per array item during indexing, which flattens the array so that no array offset exists in the index entry.
`binding`	String	The name of a binding from a Source objects.
`reverse`	Boolean	Whether this field value should sort reversed. Defaults to `false`.

field

Array or String

The field name path or field name in the document to be indexed.

The path targets a field value. For this example object:

{
  "ref": Ref(Collection("pet_owners"), "12345"),
  "data": {
    "name": "Alice",
    "pets": [
      { "type": "dog", "name": "Luna" },
      { "type": "cat", "name": "Fluffy" },
    ],
  }
}

The following paths can be used to target fields:

"ref" targets the value for the ref field.
["data", "name"] targets the value for the data.name field, "Alice."
["data", "pets", "name"] targets the name field of objects in the pets array.

Recall that one index item is created per array item during indexing, which flattens the array so that no array offset exists in the index entry.

binding

String

The name of a binding from a Source objects.

reverse

Boolean

Whether this field value should sort reversed. Defaults to false.

The document reference is included in before and after cursors when paging through an index with the Paginate function, even if the reference is not included as a values field. Pagination uses the covered document reference stored in each index entry to stabilize pagination.

All document fields may be indexed. The value of field in a Term or Value object indicates the position in a document for a field. For example, the field ref refers to the top-level ref field. The field ["data", "address", "street"] refers to the street field contained in an address object in the data object of the document.

The following example demonstrates an index values definition with two term objects, the first specifies a binding, the second specifies a document field that should be sorted in reverse:

{
  values: [
    { binding: 'binding1' },
    { field: ['data', 'field'], reverse: true },
  ]
}

{
  values: [
    { binding: 'binding1' },
    { field: [ 'data', 'field' ], reverse: true }
  ]
}

Query metrics:

bytesIn: 110
bytesOut: 90
computeOps: 1
readOps: 0
writeOps: 0
readBytes: 0
writeBytes: 0
queryTime: 2ms
retries: 0

Indexes

Source

Terms

Values

Collection index

Unique

Bindings

Index details

Example index

Index fields

Source objects

Binding objects

Term objects

Value objects

Related topics

Keyboard shortcuts

While reading

While searching

URL fragments