November Happy Hour will be moved to Thursday December 5th.

What values to use for Min Similarity in a fuzzy query

Vote:
 

Hi, I'm working on Find 13.0.1, CMS 11.9.1.

We have added in fuzzy logic to our find query setup. This includes MinSimilarity functionality. However we can't find anything online that says what values to use here. We notice differences when setting MinSimilarity to a whole number such as 0,1 or 2. However, setting it to a decimal brings back the same results as the closest whole number. We are not 100% sure what should be returned. If I search for "Nursing" but spell it wrong e.g "Nrsing" I would expect it to return "Nursing" results if Min Similarity is set to 1. However if I get two letters wrong in the search term then I would expect no results unless I set Min Similarity to 2. Am I right in how this should work? Also do the mistakes need to be right next to each other?

Any help is much appreciated.

#197082
Sep 21, 2018 5:13
Vote:
 

+1 

I'm also looking for documentation about this here on Episerver world to guide me when setting the MinSimilarity value.

There's documentation for elastic search and that could possibly be relevant I guess. 
If so it would be nice of there was a link to it from Episerver world.

This is what I've found. 

Note! I'm not sure which version of Elastic Search that is currently used by Episerver Find.

Elastic Search Fuzzy Query

https://www.elastic.co/guide/en/elasticsearch/reference/6.5/query-dsl-fuzzy-query.html

Fuzziness (MinSimilarity ???)

https://www.elastic.co/guide/en/elasticsearch/reference/6.5/common-options.html#fuzziness

#200381
Jan 10, 2019 9:04
* You are NOT allowed to include any hyperlinks in the post because your account hasn't associated to your company. User profile should be updated.