An attempt to control the order in which markers are used in ElasticSearch.
I know from the documents that the tokenizer is applied first, then the token filters, but they do not mention how the order of the marker filters is determined.
Here is a YAML snippet from my script analysis setup:
KeywordNameIndexAnalyzer :
type : custom
tokenizer : whitespace
filter : [my_word_concatenator, keyword_ngram]
I would have thought it my_word_concatenatorwould apply before keyword_ngram, but it seems like it is not. Does anyone know how (or if) the order of these filters can be controlled?
Thank you so much!
source
share