Recently, I had a client using LucidWorks search engine who needed to integrate with the Nutch crawler. This sounds simple as both products have been around for a while and are officially integrated. Even better, there are some great “getting started in x minutes” tutorials already out there for both Nutch, Solr and LucidWorks. But there were a few gotchas that kept those tutorials from working for me out of the box. This blog post documents my process of getting Nutch up and running on a Ubuntu server. 0) Install Java Included as step 0,
Read full article from Crawling with Nutch | OpenSource Connections | Solr, Big Data, and NoSQL consultants
No comments:
Post a Comment