/[swish]/trunk/spider
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Log of /trunk/spider

View Directory Listing Directory Listing


Sticky Revision:

Revision 40 - Directory Listing
Modified Sun Jun 1 11:45:19 2003 UTC (20 years, 10 months ago) by dpavlin
- support for listing of files in .tar.gz; decompressing of .gz and .bz2
  content
- changed order of arguments for swishspider: now baseurl,url (but it's
  backwards compatibile, so your old configurations will work)
- do html fixup just on html files (to prevent binary archive corruption)
- crawl sites that have frames


Revision 32 - Directory Listing
Modified Wed Apr 30 12:40:09 2003 UTC (20 years, 11 months ago) by dpavlin
added make_config.pl which creates swish config file
added checkbox to hide document properties (like content, size etc)
remove comments between <html> and <head> which confuse swish


Revision 30 - Directory Listing
Modified Mon Mar 24 09:57:44 2003 UTC (21 years, 1 month ago) by dpavlin
added instructions about formating of html before indexing it (and added
ability to unroll wrongly splited tags in form which is acceptable to swish)


Revision 15 - Directory Listing
Modified Sun Mar 16 21:31:55 2003 UTC (21 years, 1 month ago) by dpavlin
support for image map and skip pictures (speedup)


Revision 1 - Directory Listing
Added Tue Jun 4 06:39:53 2002 UTC (21 years, 10 months ago) by dpavlin
Initial revision


  ViewVC Help
Powered by ViewVC 1.1.26