Free and open source powerful search engine program!
ASPSeek is an Internet search engine written in C++ and uses the STL library. It mainly includes a search robot, a search daemon, and a search front end (CGI or Apache module). It can search probably millions of URLs for given phrases and words, and use wildcards to perform Boolean searches. Search results can be limited to a given time or site, site space, and sorted by relevance or time (some really cool techniques are used here). ASPSeek can be applied to many languages and encodings (even multi-byte languages such as Chinese). It is optimized for multiple sites. (Multi-threaded retrieval, synchronous DNS query, grouping results by site, Web collection, etc.), and it can also be used for single site search. Other features include support for stopwords and ispell, character set and language prediction, HTML templates for search results, and citation and query word highlighting. And it has detailed documentation available.
ASPseek is an Internet search engine software developed by
SWsoft and licensed as free software under
GNU GPL .
ASPseek consists of an indexing robot, a search daemon, and a CGI search frontend. It can index as many as a few million URLs and search for words and phrases, use wildcards, and do a Boolean search. Search results can be limited to time given period, site or Web space (set of sites) and sorted by relevance (PageRank is used) or date.
ASPseek is optimized for multiple sites (threaded index, async DNS lookups, grouping results by site, Web spaces), but can be used for searching one site as well. ASPseek can work with multiple languages/encodings at once (including multibyte encodings such as Chinese) due to Unicode storage mode. Other features include stopwords and ispell support, a charset and language guesser, HTML templates for search results, excerpts, and query words highlighting.
ASPseek is written in C++ using the STL library, and uses mix of SQL database and binary files for storage.
Read this to know more about ASPseek features.