Mallet

Mallet, or a Machine Learning for Language Toolkit, "is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text". Mallet tools are optimized for five functions: importing data, classifying documents, sequence tagging, topic modelling, and algorithmic, numerical implementation. Mallet also offers an add-on package, GRMM, that expands the tools to contain support for general graphic modelling. Each of the Mallet categories functions as a toolkit: equipped with several different applications and resources that may be useful to scholars conducting the particular genre of research.

Category: