/[swish]/trunk/spider
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Log of /trunk/spider

View Directory Listing Directory Listing


Sticky Revision:

Revision 40 - Directory Listing
Modified Sun Jun 1 11:45:19 2003 UTC (16 years, 10 months ago) by dpavlin
- support for listing of files in .tar.gz; decompressing of .gz and .bz2
  content
- changed order of arguments for swishspider: now baseurl,url (but it's
  backwards compatibile, so your old configurations will work)
- do html fixup just on html files (to prevent binary archive corruption)
- crawl sites that have frames


Revision 32 - Directory Listing
Modified Wed Apr 30 12:40:09 2003 UTC (16 years, 11 months ago) by dpavlin
added make_config.pl which creates swish config file
added checkbox to hide document properties (like content, size etc)
remove comments between <html> and <head> which confuse swish


Revision 30 - Directory Listing
Modified Mon Mar 24 09:57:44 2003 UTC (17 years ago) by dpavlin
added instructions about formating of html before indexing it (and added
ability to unroll wrongly splited tags in form which is acceptable to swish)


Revision 15 - Directory Listing
Modified Sun Mar 16 21:31:55 2003 UTC (17 years ago) by dpavlin
support for image map and skip pictures (speedup)


Revision 1 - Directory Listing
Added Tue Jun 4 06:39:53 2002 UTC (17 years, 10 months ago) by dpavlin
Initial revision


  ViewVC Help
Powered by ViewVC 1.1.26