java - Elasticsearch lowercase tokenizer quirk? -
I am testing mapping for URLs in elasticsearch.
I want to find a domain name with TLD (like example.com
) and with TLD (for example, example
) and returning For complete domain documentation (e.g., http: //example.com
and www.example.com
and so on)
I used this mapping In the ES - Sense:
PUT / en_docs {"mappings": {"url": {"properties": {"content": {"type": "string" "" "": "" "" "" "" "" "" "" "" "" "" Sto {"Stopwords_filter": {"type": "stop", "stopwounds": ["http", "https", "ftp", "www"]}}}}}
Now, when I store a URL document, eg
POST / en_docs / url {"content": "http://example.com"}
I search it for example.com
but simply example
does not return anything Docks is called, and my analysts view example
and as a direct test. Com
, but my analyzer uses the lowercase
tokenizer, but when I search for the indexed document, example
nothing returns :
Get / en_docs / url / _search? Q = Example
No results, but if the query is example.com
, the result will be returned.
What am I missing?
Comments
Post a Comment