ht://Dig
The ht://Dig system is a complete WWW indexing and searching system for a domain or intranet. The system is developed aiming the search needs for a single company, campus, or even a particular sub section of a website.
The ht://Dig system has the ability to search through many servers on a network by acting as a WWW browser. It searches using various configurable algorithms such as, exact, soundex, metaphone, stemming (common word endings), synonyms, accent stripping, and substring and prefix. This software can search both HTML documents and plain text files. You can add any number of keywords and special meta information inside the HTML page of the document to optimize it for search result.
ht://Dig was developed at San Diego State University as a way to search the different web servers on the campus network. The software was developed under UNIX system using C++. Therefore, to run ht://Dig you need a UNIX machine and C++ compiler. However, to compile some of the GNU libraries you require C compiler along with C++.
The ht://Dig system is free and the full source code of this search engine is released under GNU General Public License version 2.0. You can use and distribute the source code under the terms and conditions of this license.