MALLET

A free Java library for machine learning applied to text
Download

MALLET Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Freeware
  • Price:
  • FREE
  • Publisher Name:
  • Andrew McCallum
  • Publisher web site:
  • http://www.cs.umass.edu/~mccallum/
  • Operating Systems:
  • Mac OS X
  • File Size:
  • 21.9 MB

MALLET Tags


MALLET Description

A free Java library for machine learning applied to text MALLET is a free Java-based package for statistical natural language processing, topic modeling, information extraction, document classification, clustering, and other machine learning applications to text.MALLET includes sophisticated tools for document classification: efficient routines for converting text to "features", a wide variety of algorithms (including Naïve Bayes, Maximum Entropy, and Decision Trees), and code for evaluating classifier performance using several commonly used metrics.MALLET provides facilities not only for document classification, but also information extraction, part-of-speech tagging, noun phrase segmentation, and much more. The development of the library is quite mature, however it does not yet have as polished front-ends or documentation as rainbow.NOTE: MALLET is licensed and released under the terms of the Common Public License. Requirements: · Java What's New in This Release: Major updates: · An implementation of generalized expectation criteria training of MaxEnt classifiers and methods for obtaining constraints (c.f. Gregory Druck, Gideon Mann, Andrew McCallum "Learning from Labeled Features using Generalized Expectation Criteria.") · PagedInstanceList has been substantially rewritten by Mike Bond. · Bug fixes to topic model hyperparameter optimization and topic inference.


MALLET Related Software