NAME GlimpseHTTP WHAT IS GLIMPSE Glimpse (which stands for GLobal IMPlicit SEarch) is an indexing and query system that allows you to search through lots of files in many (possibly nested) directories very quickly. Glimpseindex, which you run by saying glimpseindex builds a very small index (2-5% of the text). With it, glimpse can search through all the files in these directories much the same way as grep, except that you don't have to specify file names. Glimpse supports most of agrep's options (agrep is our powerful version of grep, and it is part of glimpse) including approximate matching (e.g., finding misspelled words), Boolean queries, and even some limited forms of regular expressions. DESCRIPTION GlimpseHTTP is a collection of tools that allows you to incorporate glimpse in WWW documents. With it, you can provide general search capabilities to any user without incurring too much space overhead. Furthermore, these tools allow you to integrate search with browsing. If you have several nested directories which the user may browse, you can include the glimpse interface in each document such that only the relevant directories will be included in the search. More details are given below. The current version of GlimpseHTTP was tested under httpd 1.2 HTML server from NCSA and Glimpse currently works on many Unix platforms. To search and browse the information any HTML browser can be used (this includes NCSA Mosaic for X-Windows, MS-Windows and Macintosh, Lynx and other browsers. For maximum convenience your browser should support forms, although minimal functionality can be achieved with any browser). Since GlimpseHTTP uses Glimpse, this provides some unique features - A very small index (3-5% of the total text). - Reasonably fast search. - Search for approximate match allowing errors. In addition, GlimpseHTTP provides you with the following capabilities: - You can use a combination of browsing and searching: first, you locate the directory where the relevant information can be stored, then you can use search to locate specific files. - The result of the search is a nicely formatted hypertext with hyperlinks to matching documents. - Following the hyperlink leads you not only to a particular file, but also to the exact place where the match occured. - Hyperlinks in the documents are converted on the fly to actual hyperlinks, which you can follow immediately. This makes the GlimpseHTTP particularily suitable for searching meta-information (Internet directories etc.). - Similar tools are provided for archiving and searching USENET newsgroups. You can maintain the archive of news articles and allow people to search your archive using the same interface. Features supported include kill-file for articles and fast search for particular posters. Since news archiver uses NNTP interface, you can archive news articles from remote news servers. (Browse and search for news is yet to be implemented: browsing in this case means selection of pertinent newsgroup(s), currently supported is only the search within one newsgroup a time) Among the possible applications of GlimpseHTTP we envision: - FTP sites with search possibilities; - news archiving sites; - any search application which should be accessed over local or global network where searching for approximate match and/or saving of disk space for indices is an issue. GlimpseHTTP components: 1. aglimpse - "Archive Glimpse" - a tool for searching file hierarchies indexed for Glimpse. aglimpse is a CGI-compliant program which performs the search and formats the output as HTML document with hyperlinks to the matches. 2. Administrative tools which facilitate maintaining and indexing of Glimpse archives. One of the programs is the HTML indexer which prepares hypertext indices for each searchable directory - this supports the concept of combined browsing and searching. 3. GlimpseNews - a collection of tools for archiving and searching newsgroups archives. SEE ALSO http://glimpse.cs.arizona.edu:1994/glimpsehttp.html - GlimpseHTTP home page. http://glimpse.cs.arizona.edu:1994 - Glimpse developers home page. README.install - directions on installing GlimpseHTTP on your server. README.amgr - description of Archive Manager. README.indexing - descriptioN of HTML indexer. AUTHORS Paul Klark (GlimpseHTTP) Udi Manber, Sun Wu, and Burra Gopal (Glimpse) University of Arizona, Department of Computer Science To be put on glimpse mailing list, send mail to glimpse-request@cs.arizona.edu