Skip to content

Latest commit

 

History

History
188 lines (130 loc) · 6.04 KB

identifiers.rst

File metadata and controls

188 lines (130 loc) · 6.04 KB

Identifiers

Table of contents

Identifiers are used for naming your database objects, such as index name, field name, alias etc. Basically there are two types of identifiers: regular identifiers and delimited identifiers.

A regular identifier is a string of characters that must start with ASCII letter (lower or upper case). The subsequent character can be a combination of letter, digit, underscore (_). It cannot be a reversed key word. And whitespace and other special characters are not allowed.

For OpenSearch, the following identifiers are supported extensionally:

  1. Identifiers prefixed by dot .: this is called hidden index in OpenSearch, for example .opensearch_dashboards.
  2. Identifiers prefixed by at sign @: this is common for meta fields generated in Logstash ingestion.
  3. Identifiers with - in the middle: this is mostly the case for index name with date information.
  4. Identifiers with star * present: this is mostly an index pattern for wildcard match.

Index name with date suffix separated by dash or dots, such as cwl-2020.01.11 or logs-7.0-2020.01.11, is common for those created by Logstash or FileBeat ingestion. So, this kind of identifier used as index name is also supported without the need of being quoted for user convenience. In this case, wildcard within date pattern is also allowed to search for data across indices of different date range. For example, you can use logs-2020.1* to search in indices for October, November and December 2020.

Here are examples for using index pattern directly without quotes:

os> source=accounts | fields account_number, firstname, lastname;
fetched rows / total rows = 4/4
+----------------+-----------+----------+
| account_number | firstname | lastname |
|----------------+-----------+----------|
| 1              | Amber     | Duke     |
| 6              | Hattie    | Bond     |
| 13             | Nanette   | Bates    |
| 18             | Dale      | Adams    |
+----------------+-----------+----------+

A delimited identifier is an identifier enclosed in back ticks `. In this case, the identifier enclosed is not necessarily a regular identifier. In other words, it can contain any special character not allowed by regular identifier.

Here are typical examples of the use of delimited identifiers:

  1. Identifiers of reserved key word name
  2. Identifiers with dot . present: similarly as - in index name to include date information, it is required to be quoted so parser can differentiate it from identifier with qualifiers.
  3. Identifiers with other special character: OpenSearch has its own rule which allows more special character, for example Unicode character is supported in index name.

Here are examples for quoting an index name by back ticks:

os> source=`accounts` | fields `account_number`;
fetched rows / total rows = 4/4
+----------------+
| account_number |
|----------------|
| 1              |
| 6              |
| 13             |
| 18             |
+----------------+

A cross-cluster index identifier is an index identifier with a prefix <cluster identifier>:. The cluster identifier could contain star *. This is mostly an cluster pattern for wildcard match.

It is used to identify an index on a remote cluster for cross-cluster search.

For example, if you setup a connection between the local cluster and a remote cluster my_cluster, then you can run source=my_cluster:accounts to query the accounts index at my_cluster.

Identifiers are treated in case sensitive manner. So it must be exactly same as what is stored in OpenSearch.

For example, if you run source=Accounts, it will end up with an index not found exception from our plugin because the actual index name is under lower case.

To query multiple indices, you could

  1. Include * in index name, this is an index pattern for wildcard match.
  2. Include multiple indices and seperated them by ,.
  3. Delimited multiple indices and seperated them by ,. Note: no space allowed between each index.

Query wildcard indices:

os> source=acc* | stats count();
fetched rows / total rows = 1/1
+---------+
| count() |
|---------|
| 5       |
+---------+

Query multiple indices seperated by ,:

os> source=accounts, account2 | stats count();
fetched rows / total rows = 1/1
+---------+
| count() |
|---------|
| 5       |
+---------+

Query delimited multiple indices seperated by ,:

os> source=`accounts,account2` | stats count();
fetched rows / total rows = 1/1
+---------+
| count() |
|---------|
| 5       |
+---------+

One can also provide meta-field name(s) to retrieve reserved-fields (beginning with underscore) from OpenSearch documents. Meta-fields are not output as default field list (search source=<index>) and must be explicitly included to be returned.

Query metadata fields:

os> source=accounts | fields firstname, lastname, _index, _sort;
fetched rows / total rows = 4/4
+-----------+----------+----------+-------+
| firstname | lastname | _index   | _sort |
|-----------+----------+----------+-------|
| Amber     | Duke     | accounts | -2    |
| Hattie    | Bond     | accounts | -2    |
| Nanette   | Bates    | accounts | -2    |
| Dale      | Adams    | accounts | -2    |
+-----------+----------+----------+-------+