text analyzer stemmer filter language support #41998
faileon
started this conversation in
Ideas & Feature requests
Replies: 2 comments 3 replies
-
|
Wait @aoiasd to comment. |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
I just discussed with @aoiasd. So far, we don't have the stemming filter for Czech. The milvus analyzer is mainly powered by Tantivy, and the snowball project provides stemmers for tantivy. Czech is not in the list of the stemmers of snowball: https://github.com/snowballstem/snowball/blob/master/libstemmer/modules.txt |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
are there any plans to include more languages for the stemming filter step in text analyzer? It would greatly improve keyword/hybrid search in my local language (czech). Will it eventually be possible to add custom stemmers?
The roadmap has the following section for CY25:
Analyzer Enhancement
Enhance Analyzer with expanded tokenizer support and improved observability
Perhaps it will be included with this feature?
Beta Was this translation helpful? Give feedback.
All reactions