/[swish]/trunk/README
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Diff of /trunk/README

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 12 by dpavlin, Sun Mar 16 21:20:22 2003 UTC revision 32 by dpavlin, Wed Apr 30 12:40:09 2003 UTC
# Line 1  Line 1 
1  This project is side-effect of webpac indexer -- I learned so much about  This project is side-effect of WebPac indexer -- I learned so much about
2  perl and swish-e during working on webpac so I decided to make smaller  perl and swish-e during working on WebPac so I decided to make smaller
3  cgi interface to swish for general purpose web crawling and searching.  cgi interface to swish for general purpose web crawling and searching.
4    
5    
# Line 11  ln -s swish.cgi rot13.cgi Line 11  ln -s swish.cgi rot13.cgi
11    
12  Then, rot13.cgi will use rot13.xml file as configuration.  Then, rot13.cgi will use rot13.xml file as configuration.
13    
14  Dobrica Pavlinusic <dpavlin@rot13.org> 2003-03-16  
15    This perl cgi font-end has following interesting features:
16    1. old (easy to understand :-) swish spider modified to support "no parent"
17       URL (url above which it will stop spidering -- useful if you want to
18       spider just your personal pages under some URL)
19    2. separate xml configuration file for one index (all uses same cgi script)
20    3. no need to design html pages (but, that limits you to one rather ugly
21       design with some fill-in words -- someone could change that to use
22       templates as I will probably in future)
23    4. support for searching using Lingua::Spelling::Alternative module which
24       uses ispell affix or findaffix data to create variations of entered words
25    5. support for converting swish UTF-8 output to some other encoding on web
26       using iconv.
27    
28    
29    For example how to create useful configuration file take a look at
30    included Makefile. In short, make_config.pl can create configuration
31    file for swish, xml configuration for cgi script and necessary symlink
32    by just:
33    
34    $ make_config.pl name_of_config http://host/url_to_index/ [strip from url]
35    
36    Optional "strip from url" will remove that part of the path when storing
37    in swish index. That enables you to create indexes which can later be merged
38    in one combined index easily. It's also useful if you want to index your
39    whole /doc directory and don't want that prefix in each and every entry in
40    index (which should save some space on disk too!).
41    
42    
43    Dobrica Pavlinusic <dpavlin@rot13.org> 2003-04-26

Legend:
Removed from v.12  
changed lines
  Added in v.32

  ViewVC Help
Powered by ViewVC 1.1.26