Machine Learning Tutorial: The Naive Bayes Text Classifier | Datumbox
Update: The Datumbox Machine Learning Framework is now open-source and free to download . Check out the package com.datumbox.framework.machinelearning.classification to see the implementation of Naive Bayes Classifier in Java. Note that some of the techniques described below are used on Datumbox's Text Analysis service and they power up our API . What is the Naive Bayes Classifier? The Naive Bayes classifier is a simple probabilistic classifier which is based on Bayes theorem with strong and naïve independence assumptions. It is one of the most basic text classification techniques with various applications in email spam detection, personal email sorting, document categorization, sexually explicit content detection, language detection and sentiment detection. Despite the naïve design and oversimplified assumptions that this technique uses, Naive Bayes performs well in many complex real-world problems. Even though it is often outperformed by other techniques such as boosted trees,Read full article from Machine Learning Tutorial: The Naive Bayes Text Classifier | Datumbox
No comments:
Post a Comment