DataparkSearch

DataparkSearch Engine is a full-featured open sources web-based search engine released under the GNU General Public License and designed to organize search within a website, group of websites, intrane
Download

DataparkSearch Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Freeware
  • Language:
  • English
  • Publisher Name:
  • Datapark Corp.
  • Publisher web site:
  • Operating Systems:
  • Any Linux Distribution
  • File Size:
  • 2.1 MB

DataparkSearch Tags


DataparkSearch Description

DataparkSearch Engine is a full-featured open sources web-based search engine released under the GNU General Public License and designed to organize search within a website, group of websites, intranet or local system. Key features * Support for http, https, ftp, nntp and news URL schemes. * htdb virtual URL scheme for indexing SQL databases. * Indexes text/html, text/xml, text/plain, audio/mpeg (mp3) and image/gif mime types natively. * External parsers support for other document types, including Microsoft Word, Excel, RTF, PowerPoint, Adobe Acrobat PDF and Flash. * Can index multilingual sites using content negotiation. * Can search all of the word forms using ispell affixes and dictionaries. * Synonym, acronym and abbreviation query expansion based on editable dictionaries, specified by language and charset. * Stop-words, synonyms and Acronyms lists. * Options to query with all words, all words near to each others, any words, or Boolean queries. A subset of VQL (Verity Query Language) is supported. * Popularity Rank based on a neural network model. * Results can be sorted by relevancy (using vector calculation), popularity rank as "Goo" (adding weight for incoming links), and "Neo" (neural network model), last modified time, and by "importance" (a Combination of relevancy and popularity rank). * Supports wide range of character sets support with automated character set and language detection. * Offers an accent insensitive search option. * Provides phrase segmenting (tokenizing) for Chinese, Japanese, Korean and Thai. * Includes an indexer and a web CGI front-end, as well as a search module for Apache Web Server (mod_dpsearch). * Handles Internationalized Domain Names (IDN). * Summary Extraction Algorithm automatically sums up each document in several sentences. * Uses If-Modified-Since for efficient transfer of only changed files.


DataparkSearch Related Software