Hounder

An open source, simple and complete search system
Download

Hounder Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Freeware
  • Price:
  • FREE
  • Publisher Name:
  • Flaptor
  • Publisher web site:
  • http://www.flaptor.com/
  • Operating Systems:
  • Mac OS X
  • File Size:
  • 28.5 MB

Hounder Tags


Hounder Description

An open source, simple and complete search system Out of the box, Hounder will crawl the web targeting only those documents of interest, and will automatically present them through a simple search web page. Hounder is:· configurable: Hounder can be customized for your particular needs. Use it as a standalone solution or as a building block for your existing application.· scalable: Hounder grows with your needs. Start with a single machine, add more as needed.· robust: The big bad web is no place for puppies. Hounder's searcher has been designed from the beginning to survive traffic surges.· easy to integrate: Hounder can be used from various languages, such as Java, PHP, Python, etc.· trainable: point Hounder's attention to the information you want by feeding it with training sets, then sit back and watch it fetch the right documents for you.NOTE: Hounder is provided and licensed under the terms of the Apache 2.0 License. Here are some key features of "Hounder": Designed for flexibility: · Works as a complete solution (from crawling the web to providing a search interface). · Works as a complement to existing solutions (feeding a content system or indexing a data stream). · Crawl specific sites in depth or the web at large, searching for relevant pages and automatically classifying them · Add custom modules to indexer, crawler and searcher to add functionality. Installation: · GUI and Command line installer. · Configuration wizard for most common uses. Integration: · Searcher supports XML-RPC, RMI and OpenSearch. · Indexer supports XML-RPC and RMI. Indexing: · Document processing pipeline defined by modules, defined as plugins: create and add your own. · Existent modules include: filtering spam, adding certain fields, logging, etc. · Manage when and how index updates are submitted to the searcher. Crawler: · Bayesian filter to determine if a page is of interest or to which category it belongs. · Politeness. · Detects page content change and adapts frequency of recrawling. · Document processing pipeline defined by modules, defined as plugins: create and add your own. · Existent modules include: whitelisting, blacklisting, boosting, classifying, caching, indexing, etc. Search results: · Snippet generation. · Results grouping. · Boosting. Queries: · Define fields for your documents and search on the fields you want. · Operators: Or, And, Not. · Phrase recognition. Performance: · Results caching. · Smart query execution and queue size management. Monitoring and controlling: · Monitor and control all nodes of Hounder with the clustering web application. What's New in This Release: · Switched the fetcher from Nutch 0.7.2 to Nutch 0.9 · The Nutch9 http plugin can now work distributed · The crawler IndexerModule can now talk to multiple indexers · The crawler now makes progress reports · There is now spam detection support · The MultiSearcher returns stats about each searcher · There is a new indexer for batch re-indexations on multi-cored hardware


Hounder Related Software