DataparkSearch

DataparkSearch Engine is a full-featured open sources web-based search engine released under the GNU General Public License and designed to organize search within a website, group of websites, intranet or local system.

DataparkSearch Engine is a full-featured open sources web-based search

engine released under the GNU General Public License and designed to organize search

within a website, group of websites, intranet or local system.

DataparkSearch consists of two parts. The first part

is indexing mechanism (indexer). Indexer walks over html

hypertext references and stores found words and new references

into database. The second part is web CGI front-end to provide

search using data collected by indexer.

Key features:

Support for http, https, ftp, nntp and news URL schemes;

htdb virtual URL scheme support for indexing SQL databases;

text/html, text/xml, text/plain, audio/mpeg (MP3) and image/gif mime types built-in support;

External parsers support for other document types;

Ability to index multilingual sites using content negotiation;

Searching all of the word forms using ispell affixes and dictionaries;

Fuzzy searching based on acronyms and abbreviations.

Stopwords and synonyms lists;

Boolean query language support;

Results sorting by relevance, popularity rank, last modified time and by importance (a multiplication of

relevance and popularity rank);

Various character sets support;

Accent insensitive search;

Phrases segmenting for Chinese, Japanese, Korean and Thai languages;

mod_dpsearch - search module for Apache web server;

Internationalized Domain Names support;

The Summary Extraction Algorithm.

Download Alternative Download 1

System Requirements:

Unix, Linux

Version:

4.53

Last updated:

2010-05-06 05:35:28

Publisher:

Datapark Corp.

Homepage:

http://www.datapark.ru/

File name:

dpsearch-4.53.tar.bz2

File size:

2.02MB

License:

Freeware

Price:

0

You may also like...