Today for my 30 day challenge, I decided to learn how to do text and image extraction from web links using the Java programming language. This is a very common requirement in most of the content discovery websites like Prismatic. In this blog, we will learn how we can use a Java library called boilerpipe to accomplish this task.
Read full article from Day 18: BoilerPipe--Article Extraction for Java Developers | Openshift Blog
No comments:
Post a Comment