/[webpac]/trunk/all2xml.pl
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Log of /trunk/all2xml.pl

Parent Directory Parent Directory | Revision Log Revision Log


Links to HEAD: (view) (annotate)
Sticky Revision:

Revision 180 - (view) (annotate) - [select for diffs]
Modified Tue Nov 25 20:04:24 2003 UTC (20 years, 3 months ago) by dpavlin
File length: 23064 byte(s)
Diff to previous 178 , to selected 35
invalidate memory cache when needed


Revision 178 - (view) (annotate) - [select for diffs]
Modified Mon Nov 24 21:54:19 2003 UTC (20 years, 3 months ago) by dpavlin
File length: 23001 byte(s)
Diff to previous 177 , to selected 35
major improvements: you can select order of scanning in each topic tag
to be eather by line (which is default, repeatable fields in one line will
be unrolled) or page-by-page (using new interate_by_page="1" attribute).
New page-by-page mode is really useful with lookups (because you can
append fields with lookups in same line, but using two tags), but it will
create multiple rows in html output.


Revision 177 - (view) (annotate) - [select for diffs]
Modified Mon Nov 24 01:19:15 2003 UTC (20 years, 3 months ago) by dpavlin
File length: 19332 byte(s)
Diff to previous 170 , to selected 35
support for lookup fields. Implemented using GDBM or TDB (which I recommend
because it's fastest implementation)


Revision 170 - (view) (annotate) - [select for diffs]
Modified Sun Nov 23 15:42:16 2003 UTC (20 years, 3 months ago) by dpavlin
File length: 17207 byte(s)
Diff to previous 164 , to selected 35
Re-wrote parsing for ISO-type data (isis, marc) to use in-memory cache of
format... 10% speed improvement and cleaner code. Include filter functions
just once.


Revision 164 - (view) (annotate) - [select for diffs]
Modified Sat Nov 22 22:04:05 2003 UTC (20 years, 4 months ago) by dpavlin
File length: 16888 byte(s)
Diff to previous 163 , to selected 35
implemented filter which can replace (or be used together with) unac_string
from Text::Unaccent


Revision 163 - (view) (annotate) - [select for diffs]
Modified Thu Nov 20 21:23:40 2003 UTC (20 years, 4 months ago) by dpavlin
File length: 16781 byte(s)
Diff to previous 153 , to selected 35
Added type="swish_exact" to save data into swish index with boundaries
xxbxx data xxexxx. This is helpful to implement exact match from beginning
of query and exact match to full query which are defined using e[nr] field
in web user interface (with same [nr] as f[nr] and v[nr] fields) which
have to have value 1 (from beginning) 2 (from end, not that useful...) or
3 (1+2 - exact match)


Revision 153 - (view) (annotate) - [select for diffs]
Modified Sun Nov 16 22:42:41 2003 UTC (20 years, 4 months ago) by dpavlin
File length: 16190 byte(s)
Diff to previous 144 , to selected 35
implemented formats which can be used to produce links between records
in WebPac (documented in README.links)


Revision 144 - (view) (annotate) - [select for diffs]
Modified Sun Nov 16 11:55:18 2003 UTC (20 years, 4 months ago) by dpavlin
File length: 15286 byte(s)
Diff to previous 138 , to selected 35
fixed filters (again)


Revision 138 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 23:10:51 2003 UTC (20 years, 4 months ago) by dpavlin
File length: 15252 byte(s)
Diff to previous 137 , to selected 35
Aargh! I should really go to sleep or make PostgeSQL replication or something...


Revision 137 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 22:57:43 2003 UTC (20 years, 4 months ago) by dpavlin
File length: 15112 byte(s)
Diff to previous 136 , to selected 35
I removed too much: this always added delimiter before first element


Revision 136 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 22:46:49 2003 UTC (20 years, 4 months ago) by dpavlin
File length: 15074 byte(s)
Diff to previous 135 , to selected 35
another fix for repeatable fields


Revision 135 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 21:27:00 2003 UTC (20 years, 4 months ago) by dpavlin
File length: 15165 byte(s)
Diff to previous 109 , to selected 35
fix repeatable fields in index data


Revision 109 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 18:50:39 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 15165 byte(s)
Diff to previous 108 , to selected 35
erase also *.PTR files


Revision 108 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 18:20:27 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 14925 byte(s)
Diff to previous 106 , to selected 35
Overcome limit of 32 open databases. Unfortunatly, OpenIsis in current
version (0.9.0) doesn't support close call, so you need patch from:
http://www.rot13.org/~dpavlin/projects/openisis-0.9.0-perl_close.diff


Revision 106 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 17:09:36 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 14721 byte(s)
Diff to previous 104 , to selected 35
check for bogus *.TXT databases (with zero length or 0 records) and
erase them to force OpenIsis to use binary files


Revision 104 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:55:35 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 13949 byte(s)
Diff to previous 102 , to selected 35
remove fake progress bar also


Revision 102 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:54:34 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 13917 byte(s)
Diff to previous 101 , to selected 35
removed debugging


Revision 101 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:52:13 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 13977 byte(s)
Diff to previous 98 , to selected 35
- better error reporing from OpenIsis
- added show_progress in global.conf to turn off progress bar


Revision 98 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 22:29:14 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 13585 byte(s)
Diff to previous 97 , to selected 35
fixed ordering


Revision 97 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 21:57:12 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 13602 byte(s)
Diff to previous 90 , to selected 35
ability to join repeatable fields before inseting into index


Revision 90 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 13:22:50 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 13300 byte(s)
Diff to previous 81 , to selected 35
repeatable fields (broken when other input formats where introduced) work
again


Revision 81 - (view) (annotate) - [select for diffs]
Modified Tue Jul 8 22:13:56 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 13117 byte(s)
Diff to previous 74 , to selected 35
the great rename: isis2xml.* -> all2xml.*


Revision 74 - (view) (annotate) - [select for diffs]
Modified Sat Jul 5 22:37:30 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 12774 byte(s)
Diff to previous 67 , to selected 35
support for new feed format which have decimal number of field, semicolumn
and space at beginning of each line (like: 0: data)


Revision 67 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 23:29:27 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 12739 byte(s)
Diff to previous 62 , to selected 35
implemented feed method which calls external program that returns
data line-by-line


Revision 62 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 20:11:48 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 11691 byte(s)
Diff to previous 59 , to selected 35
added MARC file import


Revision 59 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 17:57:11 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 10549 byte(s)
Diff to previous 58 , to selected 35
added config tag which can read any variable from isis2xml.conf file for
that library


Revision 58 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 16:56:40 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 9895 byte(s)
Diff to previous 57 , to selected 35
support type and sub-types (in form type_subtype)


Revision 57 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 15:05:23 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 9738 byte(s)
Diff to previous 56 , to selected 35
don't choke on input which iconv can't convert


Revision 56 - (view) (annotate) - [select for diffs]
Modified Wed Jun 25 12:09:27 2003 UTC (20 years, 8 months ago) by dpavlin
File length: 9727 byte(s)
Diff to previous 54 , to selected 35
use start_row from excel.xml


Revision 54 - (view) (annotate) - [select for diffs]
Modified Mon Jun 23 20:20:32 2003 UTC (20 years, 9 months ago) by dpavlin
File length: 9734 byte(s)
Diff to previous 50 , to selected 35
added Microsoft Excel file import


Revision 50 - (view) (annotate) - [select for diffs]
Modified Sun Jun 1 13:46:42 2003 UTC (20 years, 9 months ago) by dpavlin
File length: 7470 byte(s)
Diff to previous 44 , to selected 35
move database arguments to .conf file


Revision 44 - (view) (annotate) - [select for diffs]
Modified Sat Mar 22 22:51:48 2003 UTC (21 years ago) by dpavlin
File length: 7223 byte(s)
Diff to previous 43 , to selected 35
fix


Revision 43 - (view) (annotate) - [select for diffs]
Modified Sat Mar 22 22:43:05 2003 UTC (21 years ago) by dpavlin
File length: 7232 byte(s)
Diff to previous 42 , to selected 35
fixed alphabet soup -- characters encoding should really work now!


Revision 42 - (view) (annotate) - [select for diffs]
Modified Sat Mar 15 21:48:48 2003 UTC (21 years ago) by dpavlin
File length: 7188 byte(s)
Diff to previous 40 , to selected 35
filter fix && optimisation


Revision 40 - (view) (annotate) - [select for diffs]
Modified Sat Mar 15 21:33:36 2003 UTC (21 years ago) by dpavlin
File length: 7153 byte(s)
Diff to previous 35
major de-mungling of different codepages: use same codepage inside perl
(as opposed to UTF-8) and in files on disk


Revision 35 - (view) (annotate) - [selected]
Modified Sun Feb 23 15:47:40 2003 UTC (21 years ago) by dpavlin
File length: 8276 byte(s)
Diff to previous 34
last changes; completly broken charsets


Revision 34 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 08:06:07 2003 UTC (21 years ago) by dpavlin
File length: 8189 byte(s)
Diff to previous 32 , to selected 35
append="1" fix


Revision 32 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 07:53:01 2003 UTC (21 years ago) by dpavlin
File length: 8151 byte(s)
Diff to previous 29 , to selected 35
display fields using order="" attribute


Revision 29 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 07:08:54 2003 UTC (21 years ago) by dpavlin
File length: 7880 byte(s)
Diff to previous 21 , to selected 35
repeatable field support, filter functions added, broken charset (again!)


Revision 21 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 00:00:51 2003 UTC (21 years ago) by dpavlin
File length: 7225 byte(s)
Diff to previous 20 , to selected 35
fix


Revision 20 - (view) (annotate) - [select for diffs]
Modified Sat Feb 22 23:49:22 2003 UTC (21 years ago) by dpavlin
File length: 7245 byte(s)
Diff to previous 17 , to selected 35
add filter="name" for fields (to correct strane input data or make variations
for indexing)


Revision 17 - (view) (annotate) - [select for diffs]
Modified Sat Feb 22 21:33:04 2003 UTC (21 years ago) by dpavlin
File length: 6454 byte(s)
Diff to previous 13 , to selected 35
fix index insertion


Revision 13 - (view) (annotate) - [select for diffs]
Modified Sun Feb 16 22:41:37 2003 UTC (21 years, 1 month ago) by dpavlin
File length: 6528 byte(s)
Diff to previous 10 , to selected 35
added configuration file with database descriptions,
moved isis.xml definition file in separate directory (in preparation for MARK),
support for different encodings in different files,
various fixes, improvements and badly written parts which will change ;-)


Revision 10 - (view) (annotate) - [select for diffs]
Modified Thu Jan 16 17:35:54 2003 UTC (21 years, 2 months ago) by dpavlin
File length: 5683 byte(s)
Diff to previous 9 , to selected 35
bunch of changes: make design more modular, implement index (partial
implementation) and other small and big changes


Revision 9 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 19:55:30 2003 UTC (21 years, 2 months ago) by dpavlin
File length: 6713 byte(s)
Diff to previous 7 , to selected 35
renamed "old" index to swish, and introduced index which is -- index;
implemented using PostgreSQL for now.


Revision 7 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 16:44:03 2003 UTC (21 years, 2 months ago) by dpavlin
File length: 5543 byte(s)
Diff to previous 5 , to selected 35
major modifications to produce first (non-working) version of Web CGI
interface.


Revision 5 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 06:14:48 2003 UTC (21 years, 2 months ago) by dpavlin
File length: 5107 byte(s)
Diff to previous 4 , to selected 35
require 1.02 version of Text::Unaccent (1.01 can't pass 'make test' here!)


Revision 4 - (view) (annotate) - [select for diffs]
Modified Sun Dec 1 22:51:29 2002 UTC (21 years, 3 months ago) by dpavlin
File length: 5065 byte(s)
Diff to previous 3 , to selected 35
remove subfield definition from values which are displayed and indexed


Revision 3 - (view) (annotate) - [select for diffs]
Modified Sat Nov 30 00:36:34 2002 UTC (21 years, 3 months ago) by dpavlin
File length: 4996 byte(s)
Diff to previous 1 , to selected 35
first really working version -- creates xml file for swish + swish config


Revision 1 - (view) (annotate) - [select for diffs]
Added Sun Nov 24 20:52:11 2002 UTC (21 years, 3 months ago) by dpavlin
File length: 1483 byte(s)
Diff to selected 35
Initial revision


This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

  ViewVC Help
Powered by ViewVC 1.1.26