You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* ELSER works best on small-to-medium sized fields that contain natural
431
-
language. For connector or web crawler use cases, this aligns best with fields
432
-
like _title_, _description_, _summary_, or _abstract_. As ELSER encodes the
433
-
first 512 tokens of a field, it may not provide as relevant of results for large
434
-
fields. For example, `body_content` on web crawler documents, or body fields
435
-
resulting from extracting text from office documents with connectors. For larger
436
-
fields like these, consider "chunking" the content into multiple values, where
437
-
each chunk can be under 512 tokens.
438
-
* Larger documents take longer at ingestion time, and {infer} time per
439
-
document also increases the more fields in a document that need to be processed.
440
-
* The more fields your pipeline has to perform inference on, the longer it takes
441
-
per document to ingest.
430
+
* ELSER works best on small-to-medium sized fields that contain natural language.
431
+
For connector or web crawler use cases, this aligns best with fields like _title_, _description_, _summary_, or _abstract_.
432
+
As ELSER encodes the first 512 tokens of a field, it may not provide as relevant of results for large fields.
433
+
For example, `body_content` on web crawler documents, or body fields resulting from extracting text from office documents with connectors.
434
+
For larger fields like these, consider "chunking" the content into multiple values, where each chunk can be under 512 tokens.
435
+
* Larger documents take longer at ingestion time, and {infer} time per document also increases the more fields in a document that need to be processed.
436
+
* The more fields your pipeline has to perform inference on, the longer it takes per document to ingest.
442
437
443
438
To learn more about ELSER performance, refer to the <<elser-benchmarks>>.
444
439
@@ -460,15 +455,21 @@ Always review and clean your input text before ingestion to eliminate any irrele
460
455
461
456
To gain the biggest value out of ELSER trained models, consider to follow this list of recommendations.
462
457
463
-
* Use two ELSER {infer} endpoints: one optimized for ingest and one optimized for search.
464
458
* If quick response time is important for your use case, keep {ml} resources available at all times by setting `min_allocations` to `1`.
465
459
* Setting `min_allocations` to `0` can save on costs for non-critical use cases or testing environments.
460
+
* Enabling <<ml-nlp-auto-scale,autoscaling>> through adaptive allocations or adaptive resources makes it possible for {es} to scale up or down the available resources of your ELSER deployment based on the load on the process.
461
+
462
+
* Use two ELSER {infer} endpoints: one optimized for ingest and one optimized for search.
463
+
** In {kib}, you can select for which case you want to optimize your ELSER deployment.
464
+
** If you use the {infer} API and want to optimize your ELSER endpoint for ingest, set the number of threads to `1` (`"num_threads": 1`).
465
+
** If you use the {infer} API and want to optimize your ELSER endpoint for search, set the number of threads to greater than `1`.
466
466
467
467
468
468
[discrete]
469
469
[[further-readings]]
470
470
== Further reading
471
471
472
+
* {ref}/semantic-search-semantic-text.html[Perform semantic search with `semantic_text` using the ELSER endpoint]
472
473
* {ref}/semantic-search-elser.html[Perform semantic search with ELSER]
0 commit comments