Active Data Online SearchEngine is a hypertextual search engine software suite. It comprises three Windows Forms applications (a data management console, a crawler and an indexer), a console application (a refresher bot) and an ASP.NET search engine module (.aspx's, .ascx's, dll's etc). SearchEngine can crawl, index and store thousands of web pages, and provide accurate search results based on regular expression keyword matching. SearchEngine supports most languages. It is optimised for English and other European languages including French and German. The crawler can fetch web pages encoded in alphabets other than Western Latin. And the data store can accept words in any language supported by UTF-8.
SearchEngine uses its own data repository as opposed to a relational database such as SQL Server. Its file and data structures have been designed to keep operational costs as low as possible.
The software architecture of SearchEngine is modular but centered around the data store. A key feature of SearchEngine is the use of queues and the separation of the crawler and indexer. This design enables the processes of crawling and indexing to be separated and in some cases distributed or shared over a number of computers within a Local Area Network.
SearchEngine includes a DOS console application for refreshing a search index periodically and automatically via Windows scheduled tasks.
The suite also comes with its own data management console. Aside from providing a means to manage the various queues, stacks and lists, this tool can also be used to cut out gibberish from a search repository or index, where necessary, and therefore keep the search engine useful.
SearchEngine is available in two versions: the full, unrestricted version, which can handle thousands of pages or more; and a lite version, which is restricted to 100 pages. |