1 |
This project is side-effect of webpac indexer -- I learned so much about |
This project is side-effect of WebPac indexer -- I learned so much about |
2 |
perl and swish-e during working on webpac so I decided to make smaller |
perl and swish-e during working on WebPac so I decided to make smaller |
3 |
cgi interface to swish for general purpose web crawling and searching. |
cgi interface to swish for general purpose web crawling and searching. |
4 |
|
|
5 |
|
|
11 |
|
|
12 |
Then, rot13.cgi will use rot13.xml file as configuration. |
Then, rot13.cgi will use rot13.xml file as configuration. |
13 |
|
|
14 |
|
|
15 |
|
This perl cgi font-end has following interesting features: |
16 |
|
1. old (easy to understand :-) swish spider modified to support "no parent" |
17 |
|
URL (url above which it will stop spidering -- useful if you want to |
18 |
|
spider just your personal pages under some URL) |
19 |
|
2. separate xml configuration file for one index (all uses same cgi script) |
20 |
|
3. no need to design html pages (but, that limits you to one rather ugly |
21 |
|
design with some fill-in words -- someone could change that to use |
22 |
|
templates as I will probably in future) |
23 |
|
4. support for searching using Lingua::Spelling::Alternative module which |
24 |
|
uses ispell affix or findaffix data to create variations of entered words |
25 |
|
5. support for converting swish UTF-8 output to some other encoding on web |
26 |
|
using iconv. |
27 |
|
|
28 |
|
|
29 |
Dobrica Pavlinusic <dpavlin@rot13.org> 2003-03-16 |
Dobrica Pavlinusic <dpavlin@rot13.org> 2003-03-16 |