/[webpac]/trunk/all2xml.pl
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Log of /trunk/all2xml.pl

Parent Directory Parent Directory | Revision Log Revision Log


Links to HEAD: (view) (annotate)
Sticky Revision:

Revision 775 - (view) (annotate) - [select for diffs]
Modified Sun Aug 3 06:31:00 2008 UTC (11 years, 4 months ago) by dpavlin
File length: 30252 byte(s)
Diff to previous 750 , to selected 50
switch back to GDBM_File which is included in recent
Debian, and TDB_File won't compile against tdb-dev


Revision 750 - (view) (annotate) - [select for diffs]
Modified Sun Oct 29 16:34:25 2006 UTC (13 years, 1 month ago) by dpavlin
File length: 30252 byte(s)
Diff to previous 747 , to selected 50
use join_subfields_with from Biblio::Isis 0.23 to handle repeatable subfields


Revision 747 - (view) (annotate) - [select for diffs]
Modified Tue Jun 6 12:34:25 2006 UTC (13 years, 6 months ago) by dpavlin
File length: 30208 byte(s)
Diff to previous 730 , to selected 50
added check for broken CDATA at beginning, fix also broken delimiters


Revision 730 - (view) (annotate) - [select for diffs]
Modified Thu Apr 13 19:44:51 2006 UTC (13 years, 8 months ago) by dpavlin
File length: 29849 byte(s)
Diff to previous 726 , to selected 50
support distinct flag for field to show just unique values if field is
repeatable


Revision 726 - (view) (annotate) - [select for diffs]
Modified Thu Apr 13 19:35:19 2006 UTC (13 years, 8 months ago) by dpavlin
File length: 29105 byte(s)
Diff to previous 679 , to selected 50
fix start_row when using excel import


Revision 679 - (view) (annotate) - [select for diffs]
Modified Mon Feb 28 10:01:34 2005 UTC (14 years, 9 months ago) by dpavlin
File length: 29093 byte(s)
Diff to previous 678 , to selected 50
added import_xml_file and import_xml_tag into configuration file,
documentation for .dbf import


Revision 678 - (view) (annotate) - [select for diffs]
Modified Sun Feb 27 23:07:35 2005 UTC (14 years, 9 months ago) by dpavlin
File length: 28947 byte(s)
Diff to previous 673 , to selected 50
Experimental support for dBase .dbf files. Usege like this in all2xml.conf:

[hda]
       dbf_file=/data/drustvene/hda/ISO.DBF
       type=dbf
       dbf_codepage=cp852
       dbf_mapping=<<_END_OF_MAP_
ID_BROJ                001
ISBN_BROJ      010
SKUPINA1       200
SKUPINA2       205
SKUPINA4       210
SKUPINA5       215
SKUPINA6       225
SKUPINA7       300
ANOTACIJA      330
PREDMET1       610
PREDMET2       610
PREDMET3       510
UDK            675
REDALICA       700
SIGNATURA      990
_END_OF_MAP_

dbf type will use <isis> tag in import_xml and dbf_codepage will
override codepage specified in import_xml file.

Small code refactoring.



Revision 673 - (view) (annotate) - [select for diffs]
Modified Thu Feb 17 18:11:57 2005 UTC (14 years, 9 months ago) by dpavlin
File length: 26574 byte(s)
Diff to previous 672 , to selected 50
bugfix: check Isis database error in correct place


Revision 672 - (view) (annotate) - [select for diffs]
Modified Thu Feb 17 18:08:14 2005 UTC (14 years, 9 months ago) by dpavlin
File length: 26581 byte(s)
Diff to previous 650 , to selected 50
bugfix: now database names are always transfered to filtering function
(previously it work with Isis databases)


Revision 650 - (view) (annotate) - [select for diffs]
Modified Thu Jan 27 20:23:36 2005 UTC (14 years, 10 months ago) by dpavlin
File length: 26458 byte(s)
Diff to previous 647 , to selected 50
bugfixes and improvements: skip MARC databases that can't be opened,
fix progress bar on MARC databases (so that it finish on 100%),
minor pod update


Revision 647 - (view) (annotate) - [select for diffs]
Modified Thu Jan 27 17:55:09 2005 UTC (14 years, 10 months ago) by dpavlin
File length: 26411 byte(s)
Diff to previous 643 , to selected 50
don't die if ISIS database is not found, just go to next one


Revision 643 - (view) (annotate) - [select for diffs]
Modified Sun Jan 23 15:18:03 2005 UTC (14 years, 10 months ago) by dpavlin
File length: 26355 byte(s)
Diff to previous 642 , to selected 50
add filtering to index (using parameter filter, for now single)


Revision 642 - (view) (annotate) - [select for diffs]
Modified Sun Jan 23 14:31:02 2005 UTC (14 years, 10 months ago) by dpavlin
File length: 26331 byte(s)
Diff to previous 641 , to selected 50
renamed tag to finger to avoid confusion (I tried to exmplain why I use term
tag and failed -- it too similar to tags used in import_xml)


Revision 641 - (view) (annotate) - [select for diffs]
Modified Sun Jan 23 02:02:10 2005 UTC (14 years, 10 months ago) by dpavlin
File length: 26325 byte(s)
Diff to previous 632 , to selected 50
New implementation of indexes: now it uses only two tables (index for all
data and tags for all tags). Currently, it doesn't enforce relation between
them on RDBMS level (I have to test this code against SQLite and MySQL
before enforcing that).
Removed swish-e output while indexing, database is used as default tag to
enable filtering by database (there isn't possiblity to set tag to something
else yet!). Output usage count in index.


Revision 632 - (view) (annotate) - [select for diffs]
Modified Sun Jan 16 18:35:24 2005 UTC (14 years, 11 months ago) by dpavlin
File length: 26337 byte(s)
Diff to previous 628 , to selected 50
use Bibio::Isis istead of IsisDB (same module, but name is changed for CPAN
upload)


Revision 628 - (view) (annotate) - [select for diffs]
Modified Sun Jan 2 22:09:01 2005 UTC (14 years, 11 months ago) by dpavlin
File length: 26330 byte(s)
Diff to previous 626 , to selected 50
yet another progress bar fix


Revision 626 - (view) (annotate) - [select for diffs]
Modified Sun Jan 2 00:53:33 2005 UTC (14 years, 11 months ago) by dpavlin
File length: 26031 byte(s)
Diff to previous 623 , to selected 50
fix for progress bar (don't fake slowdown)


Revision 623 - (view) (annotate) - [select for diffs]
Modified Sat Jan 1 19:09:53 2005 UTC (14 years, 11 months ago) by dpavlin
File length: 25982 byte(s)
Diff to previous 622 , to selected 50
cleanup and documentation updates


Revision 622 - (view) (annotate) - [select for diffs]
Modified Sat Jan 1 19:01:55 2005 UTC (14 years, 11 months ago) by dpavlin
File length: 25984 byte(s)
Diff to previous 620 , to selected 50
use IsisDB module instead of OpenIsis -- this will fix various problems in
index generation becasue IsisDB doesn't have problems as OpenIsis perl
bindings does.


Revision 620 - (view) (annotate) - [select for diffs]
Modified Sat Jan 1 18:16:21 2005 UTC (14 years, 11 months ago) by dpavlin
File length: 27480 byte(s)
Diff to previous 619 , to selected 50
use newer MARC::File::USMARC instead of MARC


Revision 619 - (view) (annotate) - [select for diffs]
Modified Fri Dec 31 04:22:49 2004 UTC (14 years, 11 months ago) by dpavlin
File length: 25832 byte(s)
Diff to previous 618 , to selected 50
important change: use IsisDB instead of OpenIsis


Revision 618 - (view) (annotate) - [select for diffs]
Modified Fri Dec 31 03:35:43 2004 UTC (14 years, 11 months ago) by dpavlin
File length: 27323 byte(s)
Diff to previous 599 , to selected 50
new nicer progress bar (back-ported from v2)


Revision 599 - (view) (annotate) - [select for diffs]
Modified Wed Dec 8 18:23:25 2004 UTC (15 years ago) by dpavlin
File length: 26943 byte(s)
Diff to previous 488 , to selected 50
fix repeatable field names which have non 7-bit ascii characters


Revision 488 - (view) (annotate) - [select for diffs]
Modified Wed Sep 29 17:22:24 2004 UTC (15 years, 2 months ago) by dpavlin
File length: 26915 byte(s)
Diff to previous 379 , to selected 50
changes to support UTF-8 encoding from
SpreadSheet::ParseExcel::FmtDefault.

You will have to modify line 69 from
	return pack('C*', unpack('n*', $sTxt));
to following which returns utf-8:
	return pack('U*', unpack('n*', $sTxt));



Revision 379 - (view) (annotate) - [select for diffs]
Modified Wed Jul 7 09:55:45 2004 UTC (15 years, 5 months ago) by dpavlin
File length: 26741 byte(s)
Diff to previous 333 , to selected 50
create missing lookup files


Revision 333 - (view) (annotate) - [select for diffs]
Modified Tue May 18 18:15:19 2004 UTC (15 years, 6 months ago) by dpavlin
File length: 26617 byte(s)
Diff to previous 320 , to selected 50
print warning if type is not handled (probably a typo)


Revision 320 - (view) (annotate) - [select for diffs]
Modified Sun Apr 18 00:57:39 2004 UTC (15 years, 8 months ago) by dpavlin
File length: 26547 byte(s)
Diff to previous 298 , to selected 50
implement my_unac_string function, and my_unac_filter option in global.conf
which you *REALLY* want to use if you don't have only clean 7-bit characters 
in your data


Revision 298 - (view) (annotate) - [select for diffs]
Modified Fri Apr 2 23:31:25 2004 UTC (15 years, 8 months ago) by dpavlin
File length: 26333 byte(s)
Diff to previous 290 , to selected 50
You can now specify configuration file as command-line option, and
if you don't do that, it will use default one called all2xml.conf


Revision 290 - (view) (annotate) - [select for diffs]
Modified Sun Mar 14 20:19:42 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 26159 byte(s)
Diff to previous 279 , to selected 50
delimiter and append now works as expected


Revision 279 - (view) (annotate) - [select for diffs]
Modified Sun Mar 14 14:59:43 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 26152 byte(s)
Diff to previous 274 , to selected 50
Implemented new form of delimiters like this:

<tag>
	<delimiter>, </delimiter>
	<value>200a</value>
</tag>

which is equivavelnt to following old mark-up:

<tag delimiter=", ">200a</tag>

but, it won't loose spaces in attribute values (which
are invalid by XML specification and XML::Simple removes
them so WebPac never get them)


Revision 274 - (view) (annotate) - [select for diffs]
Modified Sun Mar 14 11:50:29 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 25529 byte(s)
Diff to previous 263 , to selected 50
<config> tags (which use values from all2xml.conf) are now properly handled
if there is more than one in same swish tag. However, to use <config
type="index"> is useless IMHO, and <config type="index_lookup"> is not
implemented.


Revision 263 - (view) (annotate) - [select for diffs]
Modified Fri Mar 12 15:06:58 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 25532 byte(s)
Diff to previous 259 , to selected 50
ported r260 from hidra branch: moved eval to parse_format.pm where it
belongs. Also changed eval format to: eval{v901^a eq "Mikrotezaurus"}
(please note same format as in ISIS formating language)


Revision 259 - (view) (annotate) - [select for diffs]
Modified Thu Mar 11 18:23:59 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 25885 byte(s)
Diff to previous 256 , to selected 50
ported 257:258 from hidra branch
all2xml.pl - fix for swish without filter
openisis/perl/OpenIsis.pm - removed warning


Revision 256 - (view) (annotate) - [select for diffs]
Modified Tue Mar 9 12:18:17 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 25878 byte(s)
Diff to previous 255 , to selected 50
ported r254 from hidra branch


Revision 255 - (view) (annotate) - [select for diffs]
Modified Tue Mar 9 12:17:05 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 26092 byte(s)
Diff to previous 234 , to selected 50
ported r248:252 from hidra branch:

r248: much improved installation instructions, especially for Debian
      GNU/Linux distributions
r249: changed use of Spreadsheet::ParseExcel and MARC to require/import so
      that dependency on those modules can be resolved in runtime.
r250: finished installation documentation
r251: removing dependency on HTML::Parser would ease installation
r252: smaller eval{} fiexes. eval{} logic should really move to
      parse_format.pm


Revision 234 - (view) (annotate) - [select for diffs]
Modified Sun Mar 7 22:51:14 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 25755 byte(s)
Diff to previous 233 , to selected 50
eval{...} now works for type="swish" also...


Revision 233 - (view) (annotate) - [select for diffs]
Modified Fri Mar 5 23:33:19 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 25796 byte(s)
Diff to previous 231 , to selected 50
lookup_key and lookup_val types now support filters


Revision 231 - (view) (annotate) - [select for diffs]
Modified Fri Mar 5 22:53:30 2004 UTC (15 years, 9 months ago) by dpavlin
File length: 25556 byte(s)
Diff to previous 224 , to selected 50
clear memory cache when opening new file lookup


Revision 224 - (view) (annotate) - [select for diffs]
Modified Sun Feb 8 20:16:54 2004 UTC (15 years, 10 months ago) by dpavlin
File length: 25490 byte(s)
Diff to previous 218 , to selected 50
important bug fix for bug introduced in 1.57: it might eat your data
if you are not using filter. This one was hard do find...


Revision 218 - (view) (annotate) - [select for diffs]
Modified Thu Feb 5 10:55:58 2004 UTC (15 years, 10 months ago) by dpavlin
File length: 25489 byte(s)
Diff to previous 215 , to selected 50
Changed never userd format configuration option for import_xml to
marc_format to prevent clash with format for output. If you don't
specify it (as I never do) it will default to 'usmarc' which is probably
the right thing (tm).


Revision 215 - (view) (annotate) - [select for diffs]
Modified Sun Feb 1 22:06:00 2004 UTC (15 years, 10 months ago) by dpavlin
File length: 25462 byte(s)
Diff to previous 207 , to selected 50
brown-bag bug: I was using MARC.pm wrong: now whole file will be loaded
at start of indexing, changing memory usage to much more step-like, but
that enables real progress indicator and few seconds gain in indexing
speed.


Revision 207 - (view) (annotate) - [select for diffs]
Modified Sat Jan 31 21:03:06 2004 UTC (15 years, 10 months ago) by dpavlin
File length: 25315 byte(s)
Diff to previous 199 , to selected 50
thesaurus is finally working... It contains recursive entries to parnet
term, and we actually needed to display narrower terms, so mem_lookup was
created. Important changes:
- you can write eval{"901a" eq "Mikrotezaurus"} within <isis>
  tag and if expression evaluates to false, no content will be outputed
  (It's used to hide microtesarus terms from lover level descriptors)
- mem_lookup.pm now supports formats: you can write something like
  [a:5614];;[d:[a:5614]] and it will correctly embed values


Revision 199 - (view) (annotate) - [select for diffs]
Modified Wed Jan 7 12:29:11 2004 UTC (15 years, 11 months ago) by dpavlin
File length: 25019 byte(s)
Diff to previous 197 , to selected 50
fixed filter delimiter bug


Revision 197 - (view) (annotate) - [select for diffs]
Modified Sun Dec 21 03:27:02 2003 UTC (15 years, 11 months ago) by dpavlin
File length: 25019 byte(s)
Diff to previous 196 , to selected 50
Changed behaviour of creating data for swish_exact when using type="index".
Now every line is separate entry in swish_exact. That will create additional
clutter in index (fields which wouldn't be used because we are not insering
them in index), but you will have to bare with this for now.


Revision 196 - (view) (annotate) - [select for diffs]
Modified Mon Dec 15 00:12:16 2003 UTC (16 years ago) by dpavlin
File length: 25247 byte(s)
Diff to previous 195 , to selected 50
correct support for swish_exact when there are repeatable fields


Revision 195 - (view) (annotate) - [select for diffs]
Modified Sun Dec 14 20:50:03 2003 UTC (16 years ago) by dpavlin
File length: 24995 byte(s)
Diff to previous 188 , to selected 50
don't repeat field name if same as last, support format_name and
format_delimiter on field level if using iterate_by_page (without this, it's
really hard to get useful formating when using iterate_by_page), don't warn
on rare occasion (which is faulty import_xml definition, but anyway...) when
using append="1"


Revision 188 - (view) (annotate) - [select for diffs]
Modified Sat Nov 29 19:07:00 2003 UTC (16 years ago) by dpavlin
File length: 24159 byte(s)
Diff to previous 182 , to selected 50
implemented index_delimiter which enables to to format index entries in format
(values to be inserted in index);;(values to be displayed) if there is
definition of index_delimiter=";;". This will allow you to index (and
search) through values from original database and still have ability to
display lookup fields.


Revision 182 - (view) (annotate) - [select for diffs]
Modified Sat Nov 29 15:59:19 2003 UTC (16 years ago) by dpavlin
File length: 23692 byte(s)
Diff to previous 181 , to selected 50
make index with lookup field working with iterate on page


Revision 181 - (view) (annotate) - [select for diffs]
Modified Tue Nov 25 20:19:03 2003 UTC (16 years ago) by dpavlin
File length: 23088 byte(s)
Diff to previous 180 , to selected 50
fix swish_exact fields so that they don't show up in display


Revision 180 - (view) (annotate) - [select for diffs]
Modified Tue Nov 25 20:04:24 2003 UTC (16 years ago) by dpavlin
File length: 23064 byte(s)
Diff to previous 178 , to selected 50
invalidate memory cache when needed


Revision 178 - (view) (annotate) - [select for diffs]
Modified Mon Nov 24 21:54:19 2003 UTC (16 years ago) by dpavlin
File length: 23001 byte(s)
Diff to previous 177 , to selected 50
major improvements: you can select order of scanning in each topic tag
to be eather by line (which is default, repeatable fields in one line will
be unrolled) or page-by-page (using new interate_by_page="1" attribute).
New page-by-page mode is really useful with lookups (because you can
append fields with lookups in same line, but using two tags), but it will
create multiple rows in html output.


Revision 177 - (view) (annotate) - [select for diffs]
Modified Mon Nov 24 01:19:15 2003 UTC (16 years ago) by dpavlin
File length: 19332 byte(s)
Diff to previous 170 , to selected 50
support for lookup fields. Implemented using GDBM or TDB (which I recommend
because it's fastest implementation)


Revision 170 - (view) (annotate) - [select for diffs]
Modified Sun Nov 23 15:42:16 2003 UTC (16 years ago) by dpavlin
File length: 17207 byte(s)
Diff to previous 164 , to selected 50
Re-wrote parsing for ISO-type data (isis, marc) to use in-memory cache of
format... 10% speed improvement and cleaner code. Include filter functions
just once.


Revision 164 - (view) (annotate) - [select for diffs]
Modified Sat Nov 22 22:04:05 2003 UTC (16 years ago) by dpavlin
File length: 16888 byte(s)
Diff to previous 163 , to selected 50
implemented filter which can replace (or be used together with) unac_string
from Text::Unaccent


Revision 163 - (view) (annotate) - [select for diffs]
Modified Thu Nov 20 21:23:40 2003 UTC (16 years ago) by dpavlin
File length: 16781 byte(s)
Diff to previous 153 , to selected 50
Added type="swish_exact" to save data into swish index with boundaries
xxbxx data xxexxx. This is helpful to implement exact match from beginning
of query and exact match to full query which are defined using e[nr] field
in web user interface (with same [nr] as f[nr] and v[nr] fields) which
have to have value 1 (from beginning) 2 (from end, not that useful...) or
3 (1+2 - exact match)


Revision 153 - (view) (annotate) - [select for diffs]
Modified Sun Nov 16 22:42:41 2003 UTC (16 years, 1 month ago) by dpavlin
File length: 16190 byte(s)
Diff to previous 144 , to selected 50
implemented formats which can be used to produce links between records
in WebPac (documented in README.links)


Revision 144 - (view) (annotate) - [select for diffs]
Modified Sun Nov 16 11:55:18 2003 UTC (16 years, 1 month ago) by dpavlin
File length: 15286 byte(s)
Diff to previous 138 , to selected 50
fixed filters (again)


Revision 138 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 23:10:51 2003 UTC (16 years, 1 month ago) by dpavlin
File length: 15252 byte(s)
Diff to previous 137 , to selected 50
Aargh! I should really go to sleep or make PostgeSQL replication or something...


Revision 137 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 22:57:43 2003 UTC (16 years, 1 month ago) by dpavlin
File length: 15112 byte(s)
Diff to previous 136 , to selected 50
I removed too much: this always added delimiter before first element


Revision 136 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 22:46:49 2003 UTC (16 years, 1 month ago) by dpavlin
File length: 15074 byte(s)
Diff to previous 135 , to selected 50
another fix for repeatable fields


Revision 135 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 21:27:00 2003 UTC (16 years, 1 month ago) by dpavlin
File length: 15165 byte(s)
Diff to previous 109 , to selected 50
fix repeatable fields in index data


Revision 109 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 18:50:39 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 15165 byte(s)
Diff to previous 108 , to selected 50
erase also *.PTR files


Revision 108 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 18:20:27 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 14925 byte(s)
Diff to previous 106 , to selected 50
Overcome limit of 32 open databases. Unfortunatly, OpenIsis in current
version (0.9.0) doesn't support close call, so you need patch from:
http://www.rot13.org/~dpavlin/projects/openisis-0.9.0-perl_close.diff


Revision 106 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 17:09:36 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 14721 byte(s)
Diff to previous 104 , to selected 50
check for bogus *.TXT databases (with zero length or 0 records) and
erase them to force OpenIsis to use binary files


Revision 104 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:55:35 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 13949 byte(s)
Diff to previous 102 , to selected 50
remove fake progress bar also


Revision 102 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:54:34 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 13917 byte(s)
Diff to previous 101 , to selected 50
removed debugging


Revision 101 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:52:13 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 13977 byte(s)
Diff to previous 98 , to selected 50
- better error reporing from OpenIsis
- added show_progress in global.conf to turn off progress bar


Revision 98 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 22:29:14 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 13585 byte(s)
Diff to previous 97 , to selected 50
fixed ordering


Revision 97 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 21:57:12 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 13602 byte(s)
Diff to previous 90 , to selected 50
ability to join repeatable fields before inseting into index


Revision 90 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 13:22:50 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 13300 byte(s)
Diff to previous 81 , to selected 50
repeatable fields (broken when other input formats where introduced) work
again


Revision 81 - (view) (annotate) - [select for diffs]
Modified Tue Jul 8 22:13:56 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 13117 byte(s)
Diff to previous 74 , to selected 50
the great rename: isis2xml.* -> all2xml.*


Revision 74 - (view) (annotate) - [select for diffs]
Modified Sat Jul 5 22:37:30 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 12774 byte(s)
Diff to previous 67 , to selected 50
support for new feed format which have decimal number of field, semicolumn
and space at beginning of each line (like: 0: data)


Revision 67 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 23:29:27 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 12739 byte(s)
Diff to previous 62 , to selected 50
implemented feed method which calls external program that returns
data line-by-line


Revision 62 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 20:11:48 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 11691 byte(s)
Diff to previous 59 , to selected 50
added MARC file import


Revision 59 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 17:57:11 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 10549 byte(s)
Diff to previous 58 , to selected 50
added config tag which can read any variable from isis2xml.conf file for
that library


Revision 58 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 16:56:40 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 9895 byte(s)
Diff to previous 57 , to selected 50
support type and sub-types (in form type_subtype)


Revision 57 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 15:05:23 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 9738 byte(s)
Diff to previous 56 , to selected 50
don't choke on input which iconv can't convert


Revision 56 - (view) (annotate) - [select for diffs]
Modified Wed Jun 25 12:09:27 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 9727 byte(s)
Diff to previous 54 , to selected 50
use start_row from excel.xml


Revision 54 - (view) (annotate) - [select for diffs]
Modified Mon Jun 23 20:20:32 2003 UTC (16 years, 5 months ago) by dpavlin
File length: 9734 byte(s)
Diff to previous 50
added Microsoft Excel file import


Revision 50 - (view) (annotate) - [selected]
Modified Sun Jun 1 13:46:42 2003 UTC (16 years, 6 months ago) by dpavlin
File length: 7470 byte(s)
Diff to previous 44
move database arguments to .conf file


Revision 44 - (view) (annotate) - [select for diffs]
Modified Sat Mar 22 22:51:48 2003 UTC (16 years, 8 months ago) by dpavlin
File length: 7223 byte(s)
Diff to previous 43 , to selected 50
fix


Revision 43 - (view) (annotate) - [select for diffs]
Modified Sat Mar 22 22:43:05 2003 UTC (16 years, 8 months ago) by dpavlin
File length: 7232 byte(s)
Diff to previous 42 , to selected 50
fixed alphabet soup -- characters encoding should really work now!


Revision 42 - (view) (annotate) - [select for diffs]
Modified Sat Mar 15 21:48:48 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 7188 byte(s)
Diff to previous 40 , to selected 50
filter fix && optimisation


Revision 40 - (view) (annotate) - [select for diffs]
Modified Sat Mar 15 21:33:36 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 7153 byte(s)
Diff to previous 35 , to selected 50
major de-mungling of different codepages: use same codepage inside perl
(as opposed to UTF-8) and in files on disk


Revision 35 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 15:47:40 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 8276 byte(s)
Diff to previous 34 , to selected 50
last changes; completly broken charsets


Revision 34 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 08:06:07 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 8189 byte(s)
Diff to previous 32 , to selected 50
append="1" fix


Revision 32 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 07:53:01 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 8151 byte(s)
Diff to previous 29 , to selected 50
display fields using order="" attribute


Revision 29 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 07:08:54 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 7880 byte(s)
Diff to previous 21 , to selected 50
repeatable field support, filter functions added, broken charset (again!)


Revision 21 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 00:00:51 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 7225 byte(s)
Diff to previous 20 , to selected 50
fix


Revision 20 - (view) (annotate) - [select for diffs]
Modified Sat Feb 22 23:49:22 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 7245 byte(s)
Diff to previous 17 , to selected 50
add filter="name" for fields (to correct strane input data or make variations
for indexing)


Revision 17 - (view) (annotate) - [select for diffs]
Modified Sat Feb 22 21:33:04 2003 UTC (16 years, 9 months ago) by dpavlin
File length: 6454 byte(s)
Diff to previous 13 , to selected 50
fix index insertion


Revision 13 - (view) (annotate) - [select for diffs]
Modified Sun Feb 16 22:41:37 2003 UTC (16 years, 10 months ago) by dpavlin
File length: 6528 byte(s)
Diff to previous 10 , to selected 50
added configuration file with database descriptions,
moved isis.xml definition file in separate directory (in preparation for MARK),
support for different encodings in different files,
various fixes, improvements and badly written parts which will change ;-)


Revision 10 - (view) (annotate) - [select for diffs]
Modified Thu Jan 16 17:35:54 2003 UTC (16 years, 11 months ago) by dpavlin
File length: 5683 byte(s)
Diff to previous 9 , to selected 50
bunch of changes: make design more modular, implement index (partial
implementation) and other small and big changes


Revision 9 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 19:55:30 2003 UTC (16 years, 11 months ago) by dpavlin
File length: 6713 byte(s)
Diff to previous 7 , to selected 50
renamed "old" index to swish, and introduced index which is -- index;
implemented using PostgreSQL for now.


Revision 7 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 16:44:03 2003 UTC (16 years, 11 months ago) by dpavlin
File length: 5543 byte(s)
Diff to previous 5 , to selected 50
major modifications to produce first (non-working) version of Web CGI
interface.


Revision 5 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 06:14:48 2003 UTC (16 years, 11 months ago) by dpavlin
File length: 5107 byte(s)
Diff to previous 4 , to selected 50
require 1.02 version of Text::Unaccent (1.01 can't pass 'make test' here!)


Revision 4 - (view) (annotate) - [select for diffs]
Modified Sun Dec 1 22:51:29 2002 UTC (17 years ago) by dpavlin
File length: 5065 byte(s)
Diff to previous 3 , to selected 50
remove subfield definition from values which are displayed and indexed


Revision 3 - (view) (annotate) - [select for diffs]
Modified Sat Nov 30 00:36:34 2002 UTC (17 years ago) by dpavlin
File length: 4996 byte(s)
Diff to previous 1 , to selected 50
first really working version -- creates xml file for swish + swish config


Revision 1 - (view) (annotate) - [select for diffs]
Added Sun Nov 24 20:52:11 2002 UTC (17 years ago) by dpavlin
File length: 1483 byte(s)
Diff to selected 50
Initial revision


This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

  ViewVC Help
Powered by ViewVC 1.1.26