All About Programming: Custom Per-Field Similarity in Solr4.1

<similarity>com.sdudhara.MyCustomSImilarity</similarity>

 <fieldType name="text_dfr" class="solr.TextField">
    <analyzer class="org.apache.lucene.analysis.standard.StandardAnalyzer"/>
    <similarity class="solr.DFRSimilarityFactory">
      <str name="basicModel">I(F)</str>
      <str name="afterEffect">B</str>
      <str name="normalization">H2</str>
    </similarity>
 </fieldType>
 <fieldType name="text_ib" class="solr.TextField">
    <analyzer class="org.apache.lucene.analysis.standard.StandardAnalyzer"/>
    <similarity class="solr.IBSimilarityFactory">
      <str name="distribution">SPL</str>
      <str name="lambda">DF</str>
      <str name="normalization">H2</str>
    </similarity>
 </fieldType>

You will also need to delete existing data, reindex the data, since Similarity class is also used during index times. So, unless you reindex the data, you wont be able to get the custom similarities take effect

Labels

Popular Posts

Custom Per-Field Similarity in Solr4.1

Solr provides a way to override the default Lucene Similarity class by specifying under schema.xml as mentioned below:

In the MyCustomSImilarity class that extends from DefaultSimilarity (or any other Lucene Similarity class) , you can override the methods in the DefaultSimilarity class e.g. tf(). This will however impact all the fields and use same logic to calculate tf() score for all the fields.

To do this at the field level, in the schema.xml, go to the fieldType where you have defined the fieldType for the given field. Within the fieldType, you can add one more line with <similiarity>com.sdudhara.MyCustomSimilarity</similarity>

You will also need to delete existing data, reindex the data, since Similarity class is also used during index times. So, unless you reindex the data, you wont be able to get the custom similarities take effect

A related StackOverflow link that I had posted to resolve the issue:

http://stackoverflow.com/questions/15751766/solr-4-1-dismax-pf-not-returning-expected-results/15868556#15868556

No comments:

Post a Comment

Labels

Popular Posts