Our Solution to Solr Multiterm Synonyms: The Match Query Parser
You have probably heard us talk about Solr multiterm synonyms a lot! It's a big problem that prevents a lot of organizations from getting reasonable search relevance out of Solr. The problem has been described as the "sea biscuit" problem. Because, if you have a synonyms.txt file like:
sea biscuit => seabiscuit
… you unfortunately won't get what you expect at query time. This is because most Solr query parsers break up query strings on spaces before running query-time analysis. If you search for "sea biscuit" Solr sees this first as [sea]
OR [biscuit]
. The required analysis step then happens on each individual clause – first on just "sea" then on just "biscuit." Without analysis seeing a "sea" right before a "biscuit", query time analysis doesn't recognize the synonym listed above. Bummer.
Read full article from Our Solution to Solr Multiterm Synonyms: The Match Query Parser
No comments:
Post a Comment