/ /

Text Search Operators (Self-Managed Deployments)

/ /

Text Search

Text Search Operators (Self-Managed Deployments)

Text Search Operators (Self-Managed Deployments)

$text (Self-Managed Deployments)

Note

This page describes text query capabilities for self-managed (non-Atlas) deployments. For data hosted on MongoDB, MongoDB also offers an improved full-text query solution, MongoDB Search and a vector search solution, Vector Search.

This page describes the $text operator for self-managed deployments.

Definition

$text: $text performs a text query on the content of the fields indexed with a text index.

Compatibility

You can use $text for deployments hosted in the following environments:

MongoDB Atlas: The fully managed service for MongoDB deployments in the cloud

MongoDB Enterprise: The subscription-based, self-managed version of MongoDB
MongoDB Community: The source-available, free-to-use, and self-managed version of MongoDB

Syntax

A $text expression has the following syntax:

{
 $text: {
 $search: <string>,
 $language: <string>,
 $caseSensitive: <boolean>,
 $diacriticSensitive: <boolean>
 }
}

The $text operator accepts a text query document with the following fields:

Field	Type	Description
`$search`	string	A string of terms that MongoDB parses and uses to query the text index. MongoDB performs a logical `OR` query on the terms unless you specify the terms as an exact string. For more information on the field, see Behavior.
`$language`	string	Optional. The language that determines the list of stop words for the query and the rules for the stemmer and tokenizer. If unspecified, MongoDB uses the default language of the index. For supported languages, see Text Search Languages on Self-Managed Deployments. If you specify a `default_language` value of `none`, the text index parses through each word in the field, including stop words, and ignores suffix stemming.
`$caseSensitive`	boolean	Optional. A boolean flag to enable or disable case sensitivity. Defaults to `false`. If unspecified, MongoDB defers to the case insensitivity of the text index. For more information, see Case Insensitivity.
`$diacriticSensitive`	boolean	Optional. A boolean flag to enable or disable diacritic sensitivity against version 3 text indexes. Defaults to `false`. If unspecified, MongoDB defers to the diacritic insensitivity of the text index. Text queries against earlier versions of the text index are inherently diacritic sensitive and cannot be diacritic insensitive. As such, the `$diacriticSensitive` option has no effect with earlier versions of the `text` index. For more information, see Diacritic Insensitivity.

The $text operator, by default, does not return results sorted in terms of the results' scores. For more information on sorting by the results' scores, see the Text Score documentation.

Behavior

Restrictions

A query can specify, at most, one $text expression.
$text cannot appear in $nor expressions.
$text cannot appear in $elemMatch query expressions or $elemMatch projection expressions.
To use $text in an $or expression, all clauses in the $or array must be indexed.
If a query includes a $text expression, you cannot use hint() to specify which index to use for the query.
You cannot specify the $natural sort order if the query includes a $text expression.
You cannot combine the $text expression, which requires a special text index, with a query operator that requires a different type of special index. For example you cannot combine $text expression with the $near operator.
Views do not support $text.
$text is unsupported for creating indexes using the Stable API V1.

If using the $text operator in aggregation, the following restrictions also apply.

The $match stage that includes a $text must be the first stage in the pipeline.
A $text operator can only occur once in the stage.
The $text operator expression cannot appear in $or or $not expressions.
$text, by default, does not return the matching documents in order of matching scores. To sort by descending score, use the $meta aggregation expression in the $sort stage.

`$search` Field

In the $search field, specify a string of words that the $text operator parses and uses to query the text index.

The $text operator treats most punctuation in the string as delimiters, except a hyphen-minus (-) that negates term or an escaped double quotes \" that specifies an exact string.

Note

The $search field for the $text expression is different than the $search aggregation stage provided by Atlas Search. The $search aggregation stage performs a full-text search on specified fields and is only available on MongoDB Atlas.

Exact Strings

To match on an exact multi-word string, as opposed to individual terms, enclose the string in escaped double quotes (\"), as in:

"\"ssl certificate\""

If the $search string of a $text operation includes a multi-word string and individual terms, $text only matches the documents that include the multi-word string.

For example, the following $search string returns documents that include the exact string "ssl certificate":

"\"ssl certificate\" authority key"

Negations

Prefixing a word with a hyphen-minus (-) negates a word:

The negated word excludes documents that contain the negated word from the result set.
When passed a string that only contains negated words, $text does not match any documents.
A hyphenated word, such as pre-market, is not a negation. If used in a hyphenated word, the $text operator treats the hyphen-minus (-) as a delimiter. To negate the word market in this instance, include a space between pre and -market, i.e., pre -market.

The $text operator adds all negations to the operation with the logical AND operator.

Match Operation

Stop Words

The $text operator ignores language-specific stop words, such as the and and in English.

Stemmed Words

When you use case insensitivity and diacritic insensitivity, the $text operator matches on the complete stemmed word. If a document field contains the word blueberry, a $text operation with a $search term of blue does not match. However, blueberry or blueberries match.

Case Sensitivity and Stemmed Words

When you use case sensitivity ($caseSensitive: true), if the suffix stem contains uppercase letters, the $text operator matches on the exact word.

Diacritic Sensitivity and Stemmed Words

When you use diacritic sensitivity ($diacriticSensitive: true), if the suffix stem contains the diacritic mark or marks, the $text operator matches on the exact word.

Case Insensitivity

The $text operator defaults to the case insensitivity of the text index:

The version 3 text index is case insensitive for Latin characters with or without diacritics and characters from non-Latin alphabets, such as the Cyrillic alphabet. See text index for details.
Earlier versions of the text index are case insensitive for Latin characters without diacritic marks; i.e. for [A-z].

`$caseSensitive` Option

To support case sensitivity where the text index is case insensitive, specify $caseSensitive: true.

Case Sensitivity Process

If $caseSensitive: true and the text index is case insensitive, the $text operator:

First queries the text index for case insensitive and diacritic matches.
Then, to return just the documents that match the case of the specified terms, the $text operation includes an additional stage to filter out the documents that do not match the specified case.

If $caseSensitive: true and if the suffix stem contains uppercase letters, the $text operator matches on the exact word.

Specifying $caseSensitive: true may impact performance.

Tip

Stemmed Words

Diacritic Insensitivity

The $text operator defaults to the diacritic insensitivity of the text index:

The version 3 text index is diacritic insensitive. That is, the index does not distinguish between characters that contain diacritical marks and their non-marked counterpart, such as é, ê, and e.
Earlier versions of the text index are diacritic sensitive.

`$diacriticSensitive` Option

To support diacritic sensitivity with the text index, specify $diacriticSensitive: true.

Text queries against earlier versions of the text index are inherently diacritic sensitive and cannot be diacritic insensitive. As such, the $diacriticSensitive option for the $text operator has no effect with earlier versions of the text index.

Diacritic Sensitivity Process

To use diacritic sensitivity ($diacriticSensitive: true) with a version 3 text index, the $text operator:

First queries the text index, which is diacritic insensitive.
Then, to return just the documents that match the diacritic marked characters of the specified terms, the $text operation includes an additional stage to filter out the documents that do not match.

Specifying $diacriticSensitive: true may impact performance.

If you use $diacriticSensitive: true with an earlier version of the text index, the $text operator queries the text index, which is diacritic sensitive.

If $diacriticSensitive: true and if the suffix stem contains the diacritic mark or marks, the $text operator matches on the exact word.

Tip

Stemmed Words

Text Score

The $text operator assigns a score to each result document. The score represents the relevance of a document to a given query. The score can be part of a sort() method specification as well as part of the projection expression. The { $meta: "textScore" } expression provides information on the processing of the $text operation. See $meta projection operator for details on accessing the score for projection or sort.

Examples

The following examples assume a collection articles that has a version 3 text index on the field subject:

db.articles.createIndex( { subject: "text" } )

Populate the collection with the following documents:

db.articles.insertMany( [
 { _id: 1, subject: "coffee", author: "xyz", views: 50 },
 { _id: 2, subject: "Coffee Shopping", author: "efg", views: 5 },
 { _id: 3, subject: "Baking a cake", author: "abc", views: 90 },
 { _id: 4, subject: "baking", author: "xyz", views: 100 },
 { _id: 5, subject: "Café Con Leche", author: "abc", views: 200 },
 { _id: 6, subject: "Сырники", author: "jkl", views: 80 },
 { _id: 7, subject: "coffee and cream", author: "efg", views: 10 },
 { _id: 8, subject: "Cafe con Leche", author: "xyz", views: 10 }
] )

`$text` with a Single Word

The following example specifies a $search string of coffee:

db.articles.find( { $text: { $search: "coffee" } } )

This operation returns the documents that contain the term coffee in the indexed subject field, or more precisely, the stemmed version of the word:

{ _id: 1, subject: 'coffee', author: 'xyz', views: 50 },
{ _id: 7, subject: 'coffee and cream', author: 'efg', views: 10 },
{ _id: 2, subject: 'Coffee Shopping', author: 'efg', views: 5 }

Tip

Match Any of the `$search` Terms

If the $search string is a space-delimited string, $text performs a logical OR operation on each term and returns documents that contain any of the terms.

The following example specifies a $search string of three terms delimited by space, "bake coffee cake":

db.articles.find( { $text: { $search: "bake coffee cake" } } )

This operation returns documents that contain either bake or coffee or cake in the indexed subject field, or more precisely, the stemmed version of these words:

{ "_id" : 2, "subject" : "Coffee Shopping", "author" : "efg", "views" : 5 }
{ "_id" : 7, "subject" : "coffee and cream", "author" : "efg", "views" : 10 }
{ "_id" : 1, "subject" : "coffee", "author" : "xyz", "views" : 50 }
{ "_id" : 3, "subject" : "Baking a cake", "author" : "abc", "views" : 90 }
{ "_id" : 4, "subject" : "baking", "author" : "xyz", "views" : 100 }

Tip

`$text` with an Exact String

To match an exact multi-word string as a single term, escape the quotes.

The following example matches the exact string coffee shop:

db.articles.find( { $text: { $search: "\"coffee shop\"" } } )

This operation returns documents that contain the string coffee shop:

{ "_id" : 2, "subject" : "Coffee Shopping", "author" : "efg", "views" : 5 }

The following example matches the strings coffee shop and Cafe con Leche. This is a logical OR of the two strings.

db.articles.find( { $text: { $search: "\'coffee shop\' \'Cafe con Leche\'" } } )

This operation returns documents that contain both the strings, including documents that contain terms from both the strings:

[
 { _id: 8, subject: 'Cafe con Leche', author: 'xyz', views: 10 },
 { _id: 5, subject: 'Café Con Leche', author: 'abc', views: 200 },
 { _id: 1, subject: 'coffee', author: 'xyz', views: 50 },
 { _id: 7, subject: 'coffee and cream', author: 'efg', views: 10 },
 { _id: 2, subject: 'Coffee Shopping', author: 'efg', views: 5 }
]

Tip

Exact Strings

Exclude Documents That Contain a Term

A negated term is a term that is prefixed by a minus sign -. If you negate a term, the $text operator excludes the documents that contain those terms from the results.

The following example matches documents that contain the word coffee but do not contain the term shop, or more precisely the stemmed version of the words:

db.articles.find( { $text: { $search: "coffee -shop" } } )

The operation returns the following documents:

{ "_id" : 7, "subject" : "coffee and cream", "author" : "efg", "views" : 10 }
{ "_id" : 1, "subject" : "coffee", "author" : "xyz", "views" : 50 }

Tip

Query a Different Language

Use the optional $language field in the $text expression to specify a language that determines the list of stop words and the rules for the stemmer and tokenizer for the $search string.

If you specify a default_language value of none, the text index parses through each word in the field, including stop words, and ignores suffix stemming.

The following example specifies es, i.e. Spanish, as the language that determines the tokenization, stemming, and stop words:

db.articles.find(
 { $text: { $search: "leche", $language: "es" } }
)

The example returns the following documents:

{ "_id" : 5, "subject" : "Café Con Leche", "author" : "abc", "views" : 200 }
{ "_id" : 8, "subject" : "Cafe con Leche", "author" : "xyz", "views" : 10 }

The $text expression can also accept the language by name, spanish. See Text Search Languages on Self-Managed Deployments for the supported languages.

Tip

Case Insensitivity

Case and Diacritic Insensitivity

The $text operator defers to the case and diacritic insensitivity of the text index. The version 3 text index is diacritic insensitive and expands its case insensitivity to include the Cyrillic alphabet as well as characters with diacritics. For details, see text Index Case Insensitivity and text Index Diacritic Insensitivity.

The following example performs a case and diacritic insensitive text query for the terms сы́рники or CAFÉS:

db.articles.find( { $text: { $search: "сы́рники CAFÉS" } } )

Using the version 3 text index, the operation matches the following documents.

{ "_id" : 6, "subject" : "Сырники", "author" : "jkl", "views" : 80 }
{ "_id" : 5, "subject" : "Café Con Leche", "author" : "abc", "views" : 200 }
{ "_id" : 8, "subject" : "Cafe con Leche", "author" : "xyz", "views" : 10 }

With previous versions of the text index, the query would not match any document.

Tip

Case Sensitivity

To enable case sensitivity, specify $caseSensitive: true. Specifying $caseSensitive: true may impact performance.

Case Sensitivity with a Term

The following example performs a case sensitive query for the term Coffee:

db.articles.find( { $text: { $search: "Coffee", $caseSensitive: true } } )

The operation matches just the following document:

{ "_id" : 2, "subject" : "Coffee Shopping", "author" : "efg", "views" : 5 }

Tip

Case Sensitivity with an Exact String

The following example performs a case sensitive query for the exact multi-word string Café Con Leche:

db.articles.find( {
 $text: { $search: "\"Café Con Leche\"", $caseSensitive: true }
} )

The operation matches just the following document:

{ "_id" : 5, "subject" : "Café Con Leche", "author" : "abc", "views" : 200 }

Tip

Case Sensitivity with Negated Term

A negated term is a term that is prefixed by a minus sign -. If you negate a term, the $text operator will exclude the documents that contain those terms from the results. You can also specify case sensitivity for negated terms.

The following example performs a case sensitive query for documents that contain the word Coffee but do not contain the lower-case term shop, or more precisely the stemmed version of the words:

db.articles.find( { $text: { $search: "Coffee -shop", $caseSensitive: true } } )

The operation matches the following document:

{ "_id" : 2, "subject" : "Coffee Shopping", "author" : "efg" }

Tip

Diacritic Sensitivity

To enable diacritic sensitivity with a version 3 text index, specify $diacriticSensitive: true. Specifying $diacriticSensitive: true may impact performance.

Diacritic Sensitivity with a Term

The following example performs a diacritic sensitive text query on the term CAFÉ, or more precisely the stemmed version of the word:

db.articles.find( { $text: { $search: "CAFÉ", $diacriticSensitive: true } } )

The operation only matches the following document:

{ "_id" : 5, "subject" : "Café Con Leche", "author" : "abc" }

Tip

Diacritic Sensitivity with Negated Term

The $diacriticSensitive option applies also to negated terms. A negated term is a term that is prefixed by a minus sign -. If you negate a term, the $text operator will exclude the documents that contain those terms from the results.

The following example performs a diacritic sensitive text query for documents that contain the term leches but not the term cafés, or more precisely the stemmed version of the words:

db.articles.find(
 { $text: { $search: "leches -cafés", $diacriticSensitive: true } }
)

The operation matches the following document:

{ "_id" : 8, "subject" : "Cafe con Leche", "author" : "xyz" }

Tip

Relevance Score Examples

Return the Relevance Score

The following example performs a text query for the term cake and uses the $meta operator in the projection document to append the relevance score to each matching document:

db.articles.find(
 { $text: { $search: "cake" } },
 { score: { $meta: "textScore" } }
)

The returned document includes an additional field score that contains the document's relevance score:

{ "_id" : 3, "subject" : "Baking a cake", "author" : "abc", "views" : 90, "score" : 0.75 }

Tip

$meta

Sort by Relevance Score

You can specify the { $meta: "textScore" } expression in the sort() without also specifying the expression in the projection. For example:
```
db.articles.find(
 { $text: { $search: "cake" } }
).sort( { score: { $meta: "textScore" } } )
```
As a result, you can sort the resulting documents by their relevance without projecting the textScore.
If you include the { $meta: "textScore" } expression in both the projection and sort(), the projection and sort documents can have different field names for the expression.
For example, in the following operation, the projection uses a field named score for the expression and the sort() uses the field named ignoredName.
```
db.articles.find(
 { $text: { $search: "cake" } } ,
 { score: { $meta: "textScore" } }
).sort( { ignoredName: { $meta: "textScore" } } )
```

Tip

$meta

Return Top 2 Matching Documents

Use the limit() method in conjunction with a sort() to return the top n matching documents.

The following example queries for the term coffee and sorts the results by the descending score, limiting the results to the top two matching documents:

db.articles.find(
 { $text: { $search: "coffee" } },
 { score: { $meta: "textScore" } }
).sort( { score: { $meta: "textScore" } } ).limit(2)

Tip

$meta

$text with Additional Query and Sort Expressions

The following example matches documents where the author equals "xyz" and the indexed field subject contains the terms coffee or bake. The operation also specifies a sort order of ascending date, then descending relevance score:

db.articles.find(
 { author: "xyz", $text: { $search: "coffee bake" } },
 { score: { $meta: "textScore" } }
).sort( { date: 1, score: { $meta: "textScore" } } )

Tip

$text in the Aggregation Pipeline on Self-Managed Deployments

Back

Text Search Operators (Self-Managed Deployments)

Text Search in the Aggregation Pipeline

Note

Definition

Compatibility

Syntax

Behavior

Restrictions

$search Field

Note

Exact Strings

Negations

Match Operation

Stop Words

Stemmed Words

Case Sensitivity and Stemmed Words

Diacritic Sensitivity and Stemmed Words

Case Insensitivity

$caseSensitive Option

Case Sensitivity Process

Tip

Diacritic Insensitivity

$diacriticSensitive Option

Diacritic Sensitivity Process

Tip

Text Score

Examples

$text with a Single Word

Tip

Match Any of the $search Terms

Tip

$text with an Exact String

Tip

Exclude Documents That Contain a Term

Tip

Query a Different Language

Tip

Case and Diacritic Insensitivity

Tip

Case Sensitivity

Case Sensitivity with a Term

Tip

Case Sensitivity with an Exact String

Tip

Case Sensitivity with Negated Term

Tip

Diacritic Sensitivity

Diacritic Sensitivity with a Term

Tip

Diacritic Sensitivity with Negated Term

Tip

Relevance Score Examples

Return the Relevance Score

Tip

Sort by Relevance Score

Tip

Return Top 2 Matching Documents

Tip

$text with Additional Query and Sort Expressions

Tip

`$search` Field

`$caseSensitive` Option

`$diacriticSensitive` Option

`$text` with a Single Word

Match Any of the `$search` Terms

`$text` with an Exact String