Blog Categorisation using Encog, ROME, JSoup and Google Guava Continuing with Programming Collection Intelligence (PCI) the next exercise was using the distance scores to pigeonhole a list of blogs based on the words used within the relevant blog. I had already found Encog as the framework for the AI / Machine learning algorithms, for this exercise I needed an RSS reader and a HTML parser. The 2 libraries I ended up using were: Blogs Used: http://blog.guykawasaki.com/index.rdf http://blog.outer-court.com/rss.xml http://flagrantdisregard.com/index.php/feed/ http://gizmodo.com/index.
Read full article from Zen in the art of IT: Blog Categorisation using Encog, ROME, JSoup and Google Guava
No comments:
Post a Comment