Upe Blog: Java Wrapper for Tesseract OCR Library
Java Wrapper for Tesseract OCR Library Tesseract is a very popular OCR library written in C++. It can be simply used to identify characters in a given image that contains text. In addition to that it can be used to get positions of each word/ character. Tesseract provides a command line tool and a C++ api to give services to users. However there is not a implementation for Java users that can directly use Tesseract for their applications. As a part of my GSoC project in Apache PDFBox I implemented a Java wrapper for Tesseract C++ api that can be used by Java users to directly use Tesseract in their applications. Code repository can be found from here . To use Java API simply import Tesseract-JNI-Wrapper-1.0.0.jar to your project. If you are using maven, add this to your pomRead full article from Upe Blog: Java Wrapper for Tesseract OCR Library
No comments:
Post a Comment