by Mark Shead on May 26, 2011 I was recently working on a simple application where the user will enter famous quotations . Obviously we want to avoid duplicates so I needed a way to check for quotations that were substantially similar before a new quote was added to the database. The idea was to show the top 5 most similar quotes before letting the user save the new quotation to the db. I used Lucene for this which allowed me to punt on the more difficult task of figuring out if two quotes were similar or not.
Read full article from Lucene MoreLikeThis Example Code — Mark Shead
No comments:
Post a Comment