Our search solutions are built on top of mature, enterprise-quality, widely-used open source frameworks, libraries, and components. Data collection Apache ManifoldCF Apache ManifoldCF provides a framework for connecting source content repositories like file systems, DB, CMIS ... to target repositories or indexes, such as Apache Solr. http://manifoldcf.apache.org/ Apache Nutch Apache Nutch is a mature, highly scalable web crawler which provides extensible interfaces for parsing (for example for Tika), indexing (for example through Solr, SolrCloud, ...), and filters for custom implementations.
Read full article from Technologies
No comments:
Post a Comment