Whoosh

Fast, pure-Python full text indexing, search, and spell checking library
Download

Whoosh Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Apache
  • Price:
  • FREE
  • Publisher Name:
  • Matt Chaput
  • Publisher web site:
  • http://whoosh.ca/
  • Operating Systems:
  • Mac OS X 10.0 or later
  • File Size:
  • 154 KB

Whoosh Tags


Whoosh Description

Fast, pure-Python full text indexing, search, and spell checking library Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python. Whoosh is as fast or faster at indexing as other Python search libraries that wrap C or Java libraries (e.g. PyLucene?, Xappy), and not that much slower at searching. Bottom line, Whoosh is fast enough for most applications and should be useful in the following circumstances:· Anywhere a pure-Python solution is desirable to avoid having to build/compile native libraries (or force users to build/compile them).· As a research platform (at least for programmers that find Python easier to read and work with than Java ;)· When an easy-to-use Pythonic interface is more important to you than raw speed. Whoosh takes much inspiration (and sometimes translates code) from other search engines. The fundamental design is similar to (but does not entirely work like and is not compatible with) Lucene, but uses some scoring algorithms from Terrier, KinoSearch's indexing algorithm, and the English morphological variation generator from Minion. Here are some key features of "Whoosh": · Pythonic API. · Pure-Python. No compilation or binary packages needed, no mysterious crashes. · Fielded indexing and search. · Fast indexing and retrieval -- much faster than any other pure-Python solution. · Pluggable scoring algorithm (including BM25F), text analysis, storage, posting format, etc. · Powerful query language parsed by pyparsing. · Pure Python spell-checker (as far as I know, the only one). Requirements: · Python What's New in This Release: · Added Weighting.final() method, which can change the final score of a document after all term scores have been accumulated.


Whoosh Related Software