/[webpac]/trunk2/all2all.pl
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Log of /trunk2/all2all.pl

Parent Directory Parent Directory | Revision Log Revision Log


Links to HEAD: (view) (annotate)
Sticky Revision:

Revision 708 - (view) (annotate) - [select for diffs]
Modified Wed Jul 13 23:38:04 2005 UTC (14 years ago) by dpavlin
File length: 10353 byte(s)
Diff to previous 591 , to selected 153
added HyperEstraier back-end indexer


Revision 591 - (view) (annotate) - [select for diffs]
Modified Sat Nov 6 22:46:30 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 10163 byte(s)
Diff to previous 587 , to selected 153
added crovoc simbols (using lookup)


Revision 587 - (view) (annotate) - [select for diffs]
Modified Sat Nov 6 21:00:14 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 10103 byte(s)
Diff to previous 579 , to selected 153
bugfix: first decode entities and unaccent string, and then remove all
non-word characters from beggining or end of word (because it would eat &
and ; from entity)


Revision 579 - (view) (annotate) - [select for diffs]
Modified Mon Nov 1 22:15:41 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 10103 byte(s)
Diff to previous 575 , to selected 153
make iframes after first level of tree


Revision 575 - (view) (annotate) - [select for diffs]
Modified Mon Nov 1 18:00:26 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 10103 byte(s)
Diff to previous 574 , to selected 153
first iframe implementation of tree, shuffled a bit paramters to
WebPAC::Tree (so, node dir is now parametar to new and not to output)


Revision 574 - (view) (annotate) - [select for diffs]
Modified Mon Nov 1 17:19:48 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 10141 byte(s)
Diff to previous 573 , to selected 153
fix paths to data in tree


Revision 573 - (view) (annotate) - [select for diffs]
Modified Mon Nov 1 15:29:17 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 10116 byte(s)
Diff to previous 572 , to selected 153
added detail_url code ref and base_path to unroll


Revision 572 - (view) (annotate) - [select for diffs]
Modified Mon Nov 1 14:55:16 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 10006 byte(s)
Diff to previous 566 , to selected 153
convert WebPAC::Tree to use Template Toolkit,
started adding support for nodes in iframe


Revision 566 - (view) (annotate) - [select for diffs]
Modified Sun Oct 31 18:43:33 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 9832 byte(s)
Diff to previous 562 , to selected 153
remove HIDRA mark


Revision 562 - (view) (annotate) - [select for diffs]
Modified Sat Oct 30 23:56:57 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 9780 byte(s)
Diff to previous 561 , to selected 153
fix patch to icon image


Revision 561 - (view) (annotate) - [select for diffs]
Modified Sat Oct 30 23:24:54 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 9723 byte(s)
Diff to previous 560 , to selected 153
make icons in tree also


Revision 560 - (view) (annotate) - [select for diffs]
Modified Sat Oct 30 23:04:37 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 9688 byte(s)
Diff to previous 550 , to selected 153
removed upper case letters from encoding of index nodes (to support Windows
which is case-insensitive), added support for filters into TT templates,
changed crovoc symbol to #C# in data


Revision 550 - (view) (annotate) - [select for diffs]
Modified Tue Oct 26 21:44:57 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 9426 byte(s)
Diff to previous 549 , to selected 153
unaccent html entities from index. This fixes searching for words without
accented characters which are stored as entities in input data.
Unfortunately, searhing via copy/paste of accented result doesn't work in
Firefox (it produces UTF-8 encoded char which isn't handled by l2_unescape).


Revision 549 - (view) (annotate) - [select for diffs]
Modified Tue Oct 26 20:53:03 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 9301 byte(s)
Diff to previous 546 , to selected 153
Magic regexp to unaccent html entities in input data.


Revision 546 - (view) (annotate) - [select for diffs]
Modified Tue Oct 26 18:59:38 2004 UTC (14 years, 8 months ago) by dpavlin
File length: 9139 byte(s)
Diff to previous 530 , to selected 153
Removed all unaccented letters except in bfilter for characters in "part".
This solves problem of different browsers sorting differently according to
current locale setting. This way, only US ASCII characters are used and
everything is (hopefully) fine.


Revision 530 - (view) (annotate) - [select for diffs]
Modified Tue Oct 19 17:43:52 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 8684 byte(s)
Diff to previous 520 , to selected 153
major restructuring of tree output: new function WebPAC::Tree::output which
produce output (so you can apply same output to more than one template)


Revision 520 - (view) (annotate) - [select for diffs]
Modified Mon Oct 18 20:03:42 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 8523 byte(s)
Diff to previous 511 , to selected 153
stupid, stupid, stupid brown bag kind of bug: don't shift values from array
which you need later.


Revision 511 - (view) (annotate) - [select for diffs]
Modified Sun Oct 17 17:35:32 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 8531 byte(s)
Diff to previous 504 , to selected 153
implemented optional name paramter to constructor, useful for debugging


Revision 504 - (view) (annotate) - [select for diffs]
Modified Sun Oct 10 17:36:43 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 8507 byte(s)
Diff to previous 503 , to selected 153
generate headlines for jsFind index from 'index' type, then from 'display'
and finally fallback to <headline>


Revision 503 - (view) (annotate) - [select for diffs]
Modified Sun Oct 10 17:21:17 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 8292 byte(s)
Diff to previous 501 , to selected 153
make CROVOC symbols working again


Revision 501 - (view) (annotate) - [select for diffs]
Modified Sun Oct 10 11:25:10 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 8292 byte(s)
Diff to previous 500 , to selected 153
implement CROVOC marker for tree


Revision 500 - (view) (annotate) - [select for diffs]
Modified Sun Oct 10 11:04:52 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 8135 byte(s)
Diff to previous 490 , to selected 153
implement filter{name} which can be embedded inside filed, just like
eval{...}. Code for filters is not compatibile with old implementation, but
it should be easier.
If you mix eval{} and filter{}, eval has to come first.


Revision 490 - (view) (annotate) - [select for diffs]
Modified Sat Oct 9 21:44:25 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 7980 byte(s)
Diff to previous 471 , to selected 153
use locale so that \W (non-word character) won't eat local characters


Revision 471 - (view) (annotate) - [select for diffs]
Modified Sun Sep 26 15:11:36 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 7968 byte(s)
Diff to previous 466 , to selected 153
minor changes to have_children_at_level (but it's not really used),
fixed lookup (which obsoleted have_children_at_level :-)


Revision 466 - (view) (annotate) - [select for diffs]
Modified Thu Sep 23 23:17:42 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 7761 byte(s)
Diff to previous 460 , to selected 153
remove non-word characters from data before inserting into jsFind index


Revision 460 - (view) (annotate) - [select for diffs]
Modified Tue Sep 21 20:43:43 2004 UTC (14 years, 9 months ago) by dpavlin
File length: 7570 byte(s)
Diff to previous 454 , to selected 153
fixed tree generation (have_children_at_level still doesn't work)


Revision 454 - (view) (annotate) - [select for diffs]
Modified Wed Sep 15 22:36:23 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 7768 byte(s)
Diff to previous 448 , to selected 153
new simplier implementation of WebPAC::Tree with support for recursion
(still buggy)


Revision 448 - (view) (annotate) - [select for diffs]
Modified Wed Sep 15 16:53:51 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 7616 byte(s)
Diff to previous 441 , to selected 153
Major changes this time: updated for new bfilter (filtering on first
element), outline in sorted index when found in thesaurus.


Revision 441 - (view) (annotate) - [select for diffs]
Modified Tue Sep 14 17:07:59 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 7607 byte(s)
Diff to previous 440 , to selected 153
refactore tree generation into WebPAC::Tree


Revision 440 - (view) (annotate) - [select for diffs]
Modified Tue Sep 14 09:23:00 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 11512 byte(s)
Diff to previous 437 , to selected 153
missing rename mfn### -> id### because of IE bug


Revision 437 - (view) (annotate) - [select for diffs]
Modified Mon Sep 13 15:32:55 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 11513 byte(s)
Diff to previous 433 , to selected 153
command-line options


Revision 433 - (view) (annotate) - [select for diffs]
Modified Mon Sep 13 12:13:04 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 11121 byte(s)
Diff to previous 432 , to selected 153
don't create <a name="xx"> and <ul id="xx"> because getElementById in
Internet Exporer is stupid and searches *BOTH* name and id!


Revision 432 - (view) (annotate) - [select for diffs]
Modified Sun Sep 12 20:36:33 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 11132 byte(s)
Diff to previous 431 , to selected 153
navigation


Revision 431 - (view) (annotate) - [select for diffs]
Modified Sun Sep 12 20:31:34 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 10897 byte(s)
Diff to previous 430 , to selected 153
new option: start_mfn


Revision 430 - (view) (annotate) - [select for diffs]
Modified Sun Sep 12 19:15:54 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 10877 byte(s)
Diff to previous 429 , to selected 153
history support for JavaScript show/hide


Revision 429 - (view) (annotate) - [select for diffs]
Modified Sun Sep 12 17:21:47 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 10521 byte(s)
Diff to previous 428 , to selected 153
don't use body on load, use style in html (speed up first access to browse
tree)


Revision 428 - (view) (annotate) - [select for diffs]
Modified Sun Sep 12 02:31:52 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 10501 byte(s)
Diff to previous 427 , to selected 153
JavaScript to show/hide all items and return to default.


Revision 427 - (view) (annotate) - [select for diffs]
Modified Sun Sep 12 01:34:32 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 9410 byte(s)
Diff to previous 426 , to selected 153
more levels


Revision 426 - (view) (annotate) - [select for diffs]
Modified Sun Sep 12 01:25:55 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 8112 byte(s)
Diff to previous 425 , to selected 153
two more levels, lot more debug messages


Revision 425 - (view) (annotate) - [select for diffs]
Modified Sat Sep 11 23:32:41 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 7247 byte(s)
Diff to previous 424 , to selected 153
one more level in tree, nodes with children are no longer links.


Revision 424 - (view) (annotate) - [select for diffs]
Modified Sat Sep 11 22:32:06 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 6550 byte(s)
Diff to previous 423 , to selected 153
refactore tree output into data-driven one (and recursive)


Revision 423 - (view) (annotate) - [select for diffs]
Modified Sat Sep 11 20:21:28 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 5595 byte(s)
Diff to previous 422 , to selected 153
added show/hide to tree view


Revision 422 - (view) (annotate) - [select for diffs]
Modified Sat Sep 11 08:36:38 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 5267 byte(s)
Diff to previous 421 , to selected 153
better progress_bar, more documentation


Revision 421 - (view) (annotate) - [select for diffs]
Modified Fri Sep 10 22:24:42 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 5188 byte(s)
Diff to previous 420 , to selected 153
low_mem option for desktop class-machines


Revision 420 - (view) (annotate) - [select for diffs]
Modified Thu Sep 9 18:25:34 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 4925 byte(s)
Diff to previous 419 , to selected 153
warn, don't die


Revision 419 - (view) (annotate) - [select for diffs]
Modified Thu Sep 9 18:09:21 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 4754 byte(s)
Diff to previous 415 , to selected 153
create tree structure from input data


Revision 415 - (view) (annotate) - [select for diffs]
Modified Wed Sep 8 15:30:07 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 2786 byte(s)
Diff to previous 413 , to selected 153
clean old data before generating new, create JavaScript indexes,
fix inserting into index


Revision 413 - (view) (annotate) - [select for diffs]
Modified Tue Sep 7 18:02:06 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 2736 byte(s)
Diff to previous 411 , to selected 153
create index files for bfilter


Revision 411 - (view) (annotate) - [select for diffs]
Modified Sun Sep 5 22:22:37 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 2716 byte(s)
Diff to previous 410 , to selected 153
implemented filtered sorted indexes


Revision 410 - (view) (annotate) - [select for diffs]
Modified Sun Sep 5 21:40:57 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 2451 byte(s)
Diff to previous 407 , to selected 153
added sorted index using WebPAC::Index module


Revision 407 - (view) (annotate) - [select for diffs]
Modified Sun Sep 5 17:54:51 2004 UTC (14 years, 10 months ago) by dpavlin
File length: 2104 byte(s)
Diff to previous 398 , to selected 153
create index with much larger B, found jsFind bug.


Revision 398 - (view) (annotate) - [select for diffs]
Modified Sat Jul 24 13:48:08 2004 UTC (14 years, 11 months ago) by dpavlin
File length: 2090 byte(s)
Diff to previous 392 , to selected 153
moved headline information into $webpac->{'headline'} after data_structure is
called. This makes headline desapier from output templates, and namebles new
template veriable 'headline' to contain headline.


Revision 392 - (view) (annotate) - [select for diffs]
Modified Wed Jul 21 16:58:27 2004 UTC (15 years ago) by dpavlin
File length: 2112 byte(s)
Diff to previous 390 , to selected 153
fix path for index


Revision 390 - (view) (annotate) - [select for diffs]
Modified Wed Jul 21 16:18:45 2004 UTC (15 years ago) by dpavlin
File length: 2081 byte(s)
Diff to previous 389 , to selected 153
first cut into making jsFind-based search


Revision 389 - (view) (annotate) - [select for diffs]
Modified Tue Jul 20 17:15:48 2004 UTC (15 years ago) by dpavlin
File length: 1591 byte(s)
Diff to previous 374 , to selected 153
added progress_bar


Revision 374 - (view) (annotate) - [select for diffs]
Modified Sun Jun 20 16:57:52 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1591 byte(s)
Diff to previous 373 , to selected 153
save mfn as field v000, _get logger handles calls from main as it should,
support for <filename> tag


Revision 373 - (view) (annotate) - [select for diffs]
Modified Sun Jun 20 15:49:09 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1280 byte(s)
Diff to previous 372 , to selected 153
a lot more logging, lookups are now working as expected (and documented)


Revision 372 - (view) (annotate) - [select for diffs]
Modified Sat Jun 19 18:16:20 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1344 byte(s)
Diff to previous 370 , to selected 153
Log4perl implementation


Revision 370 - (view) (annotate) - [select for diffs]
Modified Thu Jun 17 17:25:12 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1093 byte(s)
Diff to previous 366 , to selected 153
method output using Template Toolkit to produce output


Revision 366 - (view) (annotate) - [select for diffs]
Modified Thu Jun 17 01:44:25 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1825 byte(s)
Diff to previous 363 , to selected 153
make in-memory data_structure


Revision 363 - (view) (annotate) - [select for diffs]
Modified Wed Jun 16 20:05:19 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1768 byte(s)
Diff to previous 362 , to selected 153
open_import_xml, debug option to new


Revision 362 - (view) (annotate) - [select for diffs]
Modified Wed Jun 16 16:50:30 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1730 byte(s)
Diff to previous 359 , to selected 153
fetch_rec method


Revision 359 - (view) (annotate) - [select for diffs]
Modified Wed Jun 16 15:41:16 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1843 byte(s)
Diff to previous 358 , to selected 153
implemeted eval{...}


Revision 358 - (view) (annotate) - [select for diffs]
Modified Wed Jun 16 14:31:33 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1653 byte(s)
Diff to previous 357 , to selected 153
format seems to work


Revision 357 - (view) (annotate) - [select for diffs]
Modified Wed Jun 16 13:39:17 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1550 byte(s)
Diff to previous 354 , to selected 153
implement limit_mfn


Revision 354 - (view) (annotate) - [select for diffs]
Modified Wed Jun 16 11:31:42 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1535 byte(s)
Diff to previous 353 , to selected 153
WebPac -> WebPAC


Revision 353 - (view) (annotate) - [select for diffs]
Modified Wed Jun 16 11:29:37 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1535 byte(s)
Diff to previous 352 , to selected 153
varous clenups


Revision 352 - (view) (annotate) - [select for diffs]
Modified Tue Jun 15 22:40:07 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 1610 byte(s)
Diff to previous 349 , to selected 153
Object-orineted design re-implementation: simple field substitution and
lookups are working well.
Added some documentation about new features.


Revision 349 - (view) (annotate) - [select for diffs]
Modified Sun Jun 13 20:39:44 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 2582 byte(s)
Diff to previous 348 , to selected 153
don't materialize hash values which are undef


Revision 348 - (view) (annotate) - [select for diffs]
Modified Sun Jun 13 20:20:29 2004 UTC (15 years, 1 month ago) by dpavlin
File length: 2643 byte(s)
Copied from: trunk2/all2xml.pl revision 337
Diff to previous 337 , to selected 153
first commit of new code


Revision 337 - (view) (annotate) - [select for diffs]
Modified Thu Jun 10 19:22:40 2004 UTC (15 years, 1 month ago) by dpavlin
Original Path: trunk2/all2xml.pl
File length: 26617 byte(s)
Diff to previous 333 , to selected 153
new trunk for webpac v2


Revision 333 - (view) (annotate) - [select for diffs]
Modified Tue May 18 18:15:19 2004 UTC (15 years, 2 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 26617 byte(s)
Diff to previous 320 , to selected 153
print warning if type is not handled (probably a typo)


Revision 320 - (view) (annotate) - [select for diffs]
Modified Sun Apr 18 00:57:39 2004 UTC (15 years, 3 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 26547 byte(s)
Diff to previous 298 , to selected 153
implement my_unac_string function, and my_unac_filter option in global.conf
which you *REALLY* want to use if you don't have only clean 7-bit characters 
in your data


Revision 298 - (view) (annotate) - [select for diffs]
Modified Fri Apr 2 23:31:25 2004 UTC (15 years, 3 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 26333 byte(s)
Diff to previous 290 , to selected 153
You can now specify configuration file as command-line option, and
if you don't do that, it will use default one called all2xml.conf


Revision 290 - (view) (annotate) - [select for diffs]
Modified Sun Mar 14 20:19:42 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 26159 byte(s)
Diff to previous 279 , to selected 153
delimiter and append now works as expected


Revision 279 - (view) (annotate) - [select for diffs]
Modified Sun Mar 14 14:59:43 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 26152 byte(s)
Diff to previous 274 , to selected 153
Implemented new form of delimiters like this:

<tag>
	<delimiter>, </delimiter>
	<value>200a</value>
</tag>

which is equivavelnt to following old mark-up:

<tag delimiter=", ">200a</tag>

but, it won't loose spaces in attribute values (which
are invalid by XML specification and XML::Simple removes
them so WebPac never get them)


Revision 274 - (view) (annotate) - [select for diffs]
Modified Sun Mar 14 11:50:29 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25529 byte(s)
Diff to previous 263 , to selected 153
<config> tags (which use values from all2xml.conf) are now properly handled
if there is more than one in same swish tag. However, to use <config
type="index"> is useless IMHO, and <config type="index_lookup"> is not
implemented.


Revision 263 - (view) (annotate) - [select for diffs]
Modified Fri Mar 12 15:06:58 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25532 byte(s)
Diff to previous 259 , to selected 153
ported r260 from hidra branch: moved eval to parse_format.pm where it
belongs. Also changed eval format to: eval{v901^a eq "Mikrotezaurus"}
(please note same format as in ISIS formating language)


Revision 259 - (view) (annotate) - [select for diffs]
Modified Thu Mar 11 18:23:59 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25885 byte(s)
Diff to previous 256 , to selected 153
ported 257:258 from hidra branch
all2xml.pl - fix for swish without filter
openisis/perl/OpenIsis.pm - removed warning


Revision 256 - (view) (annotate) - [select for diffs]
Modified Tue Mar 9 12:18:17 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25878 byte(s)
Diff to previous 255 , to selected 153
ported r254 from hidra branch


Revision 255 - (view) (annotate) - [select for diffs]
Modified Tue Mar 9 12:17:05 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 26092 byte(s)
Diff to previous 234 , to selected 153
ported r248:252 from hidra branch:

r248: much improved installation instructions, especially for Debian
      GNU/Linux distributions
r249: changed use of Spreadsheet::ParseExcel and MARC to require/import so
      that dependency on those modules can be resolved in runtime.
r250: finished installation documentation
r251: removing dependency on HTML::Parser would ease installation
r252: smaller eval{} fiexes. eval{} logic should really move to
      parse_format.pm


Revision 234 - (view) (annotate) - [select for diffs]
Modified Sun Mar 7 22:51:14 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25755 byte(s)
Diff to previous 233 , to selected 153
eval{...} now works for type="swish" also...


Revision 233 - (view) (annotate) - [select for diffs]
Modified Fri Mar 5 23:33:19 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25796 byte(s)
Diff to previous 231 , to selected 153
lookup_key and lookup_val types now support filters


Revision 231 - (view) (annotate) - [select for diffs]
Modified Fri Mar 5 22:53:30 2004 UTC (15 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25556 byte(s)
Diff to previous 224 , to selected 153
clear memory cache when opening new file lookup


Revision 224 - (view) (annotate) - [select for diffs]
Modified Sun Feb 8 20:16:54 2004 UTC (15 years, 5 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25490 byte(s)
Diff to previous 218 , to selected 153
important bug fix for bug introduced in 1.57: it might eat your data
if you are not using filter. This one was hard do find...


Revision 218 - (view) (annotate) - [select for diffs]
Modified Thu Feb 5 10:55:58 2004 UTC (15 years, 5 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25489 byte(s)
Diff to previous 215 , to selected 153
Changed never userd format configuration option for import_xml to
marc_format to prevent clash with format for output. If you don't
specify it (as I never do) it will default to 'usmarc' which is probably
the right thing (tm).


Revision 215 - (view) (annotate) - [select for diffs]
Modified Sun Feb 1 22:06:00 2004 UTC (15 years, 5 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25462 byte(s)
Diff to previous 207 , to selected 153
brown-bag bug: I was using MARC.pm wrong: now whole file will be loaded
at start of indexing, changing memory usage to much more step-like, but
that enables real progress indicator and few seconds gain in indexing
speed.


Revision 207 - (view) (annotate) - [select for diffs]
Modified Sat Jan 31 21:03:06 2004 UTC (15 years, 5 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25315 byte(s)
Diff to previous 199 , to selected 153
thesaurus is finally working... It contains recursive entries to parnet
term, and we actually needed to display narrower terms, so mem_lookup was
created. Important changes:
- you can write eval{"901a" eq "Mikrotezaurus"} within <isis>
  tag and if expression evaluates to false, no content will be outputed
  (It's used to hide microtesarus terms from lover level descriptors)
- mem_lookup.pm now supports formats: you can write something like
  [a:5614];;[d:[a:5614]] and it will correctly embed values


Revision 199 - (view) (annotate) - [select for diffs]
Modified Wed Jan 7 12:29:11 2004 UTC (15 years, 6 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25019 byte(s)
Diff to previous 197 , to selected 153
fixed filter delimiter bug


Revision 197 - (view) (annotate) - [select for diffs]
Modified Sun Dec 21 03:27:02 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25019 byte(s)
Diff to previous 196 , to selected 153
Changed behaviour of creating data for swish_exact when using type="index".
Now every line is separate entry in swish_exact. That will create additional
clutter in index (fields which wouldn't be used because we are not insering
them in index), but you will have to bare with this for now.


Revision 196 - (view) (annotate) - [select for diffs]
Modified Mon Dec 15 00:12:16 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 25247 byte(s)
Diff to previous 195 , to selected 153
correct support for swish_exact when there are repeatable fields


Revision 195 - (view) (annotate) - [select for diffs]
Modified Sun Dec 14 20:50:03 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 24995 byte(s)
Diff to previous 188 , to selected 153
don't repeat field name if same as last, support format_name and
format_delimiter on field level if using iterate_by_page (without this, it's
really hard to get useful formating when using iterate_by_page), don't warn
on rare occasion (which is faulty import_xml definition, but anyway...) when
using append="1"


Revision 188 - (view) (annotate) - [select for diffs]
Modified Sat Nov 29 19:07:00 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 24159 byte(s)
Diff to previous 182 , to selected 153
implemented index_delimiter which enables to to format index entries in format
(values to be inserted in index);;(values to be displayed) if there is
definition of index_delimiter=";;". This will allow you to index (and
search) through values from original database and still have ability to
display lookup fields.


Revision 182 - (view) (annotate) - [select for diffs]
Modified Sat Nov 29 15:59:19 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 23692 byte(s)
Diff to previous 181 , to selected 153
make index with lookup field working with iterate on page


Revision 181 - (view) (annotate) - [select for diffs]
Modified Tue Nov 25 20:19:03 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 23088 byte(s)
Diff to previous 180 , to selected 153
fix swish_exact fields so that they don't show up in display


Revision 180 - (view) (annotate) - [select for diffs]
Modified Tue Nov 25 20:04:24 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 23064 byte(s)
Diff to previous 178 , to selected 153
invalidate memory cache when needed


Revision 178 - (view) (annotate) - [select for diffs]
Modified Mon Nov 24 21:54:19 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 23001 byte(s)
Diff to previous 177 , to selected 153
major improvements: you can select order of scanning in each topic tag
to be eather by line (which is default, repeatable fields in one line will
be unrolled) or page-by-page (using new interate_by_page="1" attribute).
New page-by-page mode is really useful with lookups (because you can
append fields with lookups in same line, but using two tags), but it will
create multiple rows in html output.


Revision 177 - (view) (annotate) - [select for diffs]
Modified Mon Nov 24 01:19:15 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 19332 byte(s)
Diff to previous 170 , to selected 153
support for lookup fields. Implemented using GDBM or TDB (which I recommend
because it's fastest implementation)


Revision 170 - (view) (annotate) - [select for diffs]
Modified Sun Nov 23 15:42:16 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 17207 byte(s)
Diff to previous 164 , to selected 153
Re-wrote parsing for ISO-type data (isis, marc) to use in-memory cache of
format... 10% speed improvement and cleaner code. Include filter functions
just once.


Revision 164 - (view) (annotate) - [select for diffs]
Modified Sat Nov 22 22:04:05 2003 UTC (15 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 16888 byte(s)
Diff to previous 163 , to selected 153
implemented filter which can replace (or be used together with) unac_string
from Text::Unaccent


Revision 163 - (view) (annotate) - [select for diffs]
Modified Thu Nov 20 21:23:40 2003 UTC (15 years, 8 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 16781 byte(s)
Diff to previous 153
Added type="swish_exact" to save data into swish index with boundaries
xxbxx data xxexxx. This is helpful to implement exact match from beginning
of query and exact match to full query which are defined using e[nr] field
in web user interface (with same [nr] as f[nr] and v[nr] fields) which
have to have value 1 (from beginning) 2 (from end, not that useful...) or
3 (1+2 - exact match)


Revision 153 - (view) (annotate) - [selected]
Modified Sun Nov 16 22:42:41 2003 UTC (15 years, 8 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 16190 byte(s)
Diff to previous 144
implemented formats which can be used to produce links between records
in WebPac (documented in README.links)


Revision 144 - (view) (annotate) - [select for diffs]
Modified Sun Nov 16 11:55:18 2003 UTC (15 years, 8 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 15286 byte(s)
Diff to previous 138 , to selected 153
fixed filters (again)


Revision 138 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 23:10:51 2003 UTC (15 years, 8 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 15252 byte(s)
Diff to previous 137 , to selected 153
Aargh! I should really go to sleep or make PostgeSQL replication or something...


Revision 137 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 22:57:43 2003 UTC (15 years, 8 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 15112 byte(s)
Diff to previous 136 , to selected 153
I removed too much: this always added delimiter before first element


Revision 136 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 22:46:49 2003 UTC (15 years, 8 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 15074 byte(s)
Diff to previous 135 , to selected 153
another fix for repeatable fields


Revision 135 - (view) (annotate) - [select for diffs]
Modified Wed Oct 29 21:27:00 2003 UTC (15 years, 8 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 15165 byte(s)
Diff to previous 109 , to selected 153
fix repeatable fields in index data


Revision 109 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 18:50:39 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 15165 byte(s)
Diff to previous 108 , to selected 153
erase also *.PTR files


Revision 108 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 18:20:27 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 14925 byte(s)
Diff to previous 106 , to selected 153
Overcome limit of 32 open databases. Unfortunatly, OpenIsis in current
version (0.9.0) doesn't support close call, so you need patch from:
http://www.rot13.org/~dpavlin/projects/openisis-0.9.0-perl_close.diff


Revision 106 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 17:09:36 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 14721 byte(s)
Diff to previous 104 , to selected 153
check for bogus *.TXT databases (with zero length or 0 records) and
erase them to force OpenIsis to use binary files


Revision 104 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:55:35 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 13949 byte(s)
Diff to previous 102 , to selected 153
remove fake progress bar also


Revision 102 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:54:34 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 13917 byte(s)
Diff to previous 101 , to selected 153
removed debugging


Revision 101 - (view) (annotate) - [select for diffs]
Modified Mon Jul 14 10:52:13 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 13977 byte(s)
Diff to previous 98 , to selected 153
- better error reporing from OpenIsis
- added show_progress in global.conf to turn off progress bar


Revision 98 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 22:29:14 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 13585 byte(s)
Diff to previous 97 , to selected 153
fixed ordering


Revision 97 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 21:57:12 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 13602 byte(s)
Diff to previous 90 , to selected 153
ability to join repeatable fields before inseting into index


Revision 90 - (view) (annotate) - [select for diffs]
Modified Sun Jul 13 13:22:50 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 13300 byte(s)
Diff to previous 81 , to selected 153
repeatable fields (broken when other input formats where introduced) work
again


Revision 81 - (view) (annotate) - [select for diffs]
Modified Tue Jul 8 22:13:56 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 13117 byte(s)
Diff to previous 74 , to selected 153
the great rename: isis2xml.* -> all2xml.*


Revision 74 - (view) (annotate) - [select for diffs]
Modified Sat Jul 5 22:37:30 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 12774 byte(s)
Diff to previous 67 , to selected 153
support for new feed format which have decimal number of field, semicolumn
and space at beginning of each line (like: 0: data)


Revision 67 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 23:29:27 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 12739 byte(s)
Diff to previous 62 , to selected 153
implemented feed method which calls external program that returns
data line-by-line


Revision 62 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 20:11:48 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 11691 byte(s)
Diff to previous 59 , to selected 153
added MARC file import


Revision 59 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 17:57:11 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 10549 byte(s)
Diff to previous 58 , to selected 153
added config tag which can read any variable from isis2xml.conf file for
that library


Revision 58 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 16:56:40 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 9895 byte(s)
Diff to previous 57 , to selected 153
support type and sub-types (in form type_subtype)


Revision 57 - (view) (annotate) - [select for diffs]
Modified Fri Jul 4 15:05:23 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 9738 byte(s)
Diff to previous 56 , to selected 153
don't choke on input which iconv can't convert


Revision 56 - (view) (annotate) - [select for diffs]
Modified Wed Jun 25 12:09:27 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 9727 byte(s)
Diff to previous 54 , to selected 153
use start_row from excel.xml


Revision 54 - (view) (annotate) - [select for diffs]
Modified Mon Jun 23 20:20:32 2003 UTC (16 years ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 9734 byte(s)
Diff to previous 50 , to selected 153
added Microsoft Excel file import


Revision 50 - (view) (annotate) - [select for diffs]
Modified Sun Jun 1 13:46:42 2003 UTC (16 years, 1 month ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 7470 byte(s)
Diff to previous 44 , to selected 153
move database arguments to .conf file


Revision 44 - (view) (annotate) - [select for diffs]
Modified Sat Mar 22 22:51:48 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 7223 byte(s)
Diff to previous 43 , to selected 153
fix


Revision 43 - (view) (annotate) - [select for diffs]
Modified Sat Mar 22 22:43:05 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 7232 byte(s)
Diff to previous 42 , to selected 153
fixed alphabet soup -- characters encoding should really work now!


Revision 42 - (view) (annotate) - [select for diffs]
Modified Sat Mar 15 21:48:48 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 7188 byte(s)
Diff to previous 40 , to selected 153
filter fix && optimisation


Revision 40 - (view) (annotate) - [select for diffs]
Modified Sat Mar 15 21:33:36 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 7153 byte(s)
Diff to previous 35 , to selected 153
major de-mungling of different codepages: use same codepage inside perl
(as opposed to UTF-8) and in files on disk


Revision 35 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 15:47:40 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 8276 byte(s)
Diff to previous 34 , to selected 153
last changes; completly broken charsets


Revision 34 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 08:06:07 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 8189 byte(s)
Diff to previous 32 , to selected 153
append="1" fix


Revision 32 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 07:53:01 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 8151 byte(s)
Diff to previous 29 , to selected 153
display fields using order="" attribute


Revision 29 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 07:08:54 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 7880 byte(s)
Diff to previous 21 , to selected 153
repeatable field support, filter functions added, broken charset (again!)


Revision 21 - (view) (annotate) - [select for diffs]
Modified Sun Feb 23 00:00:51 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 7225 byte(s)
Diff to previous 20 , to selected 153
fix


Revision 20 - (view) (annotate) - [select for diffs]
Modified Sat Feb 22 23:49:22 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 7245 byte(s)
Diff to previous 17 , to selected 153
add filter="name" for fields (to correct strane input data or make variations
for indexing)


Revision 17 - (view) (annotate) - [select for diffs]
Modified Sat Feb 22 21:33:04 2003 UTC (16 years, 4 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 6454 byte(s)
Diff to previous 13 , to selected 153
fix index insertion


Revision 13 - (view) (annotate) - [select for diffs]
Modified Sun Feb 16 22:41:37 2003 UTC (16 years, 5 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 6528 byte(s)
Diff to previous 10 , to selected 153
added configuration file with database descriptions,
moved isis.xml definition file in separate directory (in preparation for MARK),
support for different encodings in different files,
various fixes, improvements and badly written parts which will change ;-)


Revision 10 - (view) (annotate) - [select for diffs]
Modified Thu Jan 16 17:35:54 2003 UTC (16 years, 6 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 5683 byte(s)
Diff to previous 9 , to selected 153
bunch of changes: make design more modular, implement index (partial
implementation) and other small and big changes


Revision 9 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 19:55:30 2003 UTC (16 years, 6 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 6713 byte(s)
Diff to previous 7 , to selected 153
renamed "old" index to swish, and introduced index which is -- index;
implemented using PostgreSQL for now.


Revision 7 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 16:44:03 2003 UTC (16 years, 6 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 5543 byte(s)
Diff to previous 5 , to selected 153
major modifications to produce first (non-working) version of Web CGI
interface.


Revision 5 - (view) (annotate) - [select for diffs]
Modified Sat Jan 11 06:14:48 2003 UTC (16 years, 6 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 5107 byte(s)
Diff to previous 4 , to selected 153
require 1.02 version of Text::Unaccent (1.01 can't pass 'make test' here!)


Revision 4 - (view) (annotate) - [select for diffs]
Modified Sun Dec 1 22:51:29 2002 UTC (16 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 5065 byte(s)
Diff to previous 3 , to selected 153
remove subfield definition from values which are displayed and indexed


Revision 3 - (view) (annotate) - [select for diffs]
Modified Sat Nov 30 00:36:34 2002 UTC (16 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 4996 byte(s)
Diff to previous 1 , to selected 153
first really working version -- creates xml file for swish + swish config


Revision 1 - (view) (annotate) - [select for diffs]
Added Sun Nov 24 20:52:11 2002 UTC (16 years, 7 months ago) by dpavlin
Original Path: trunk/all2xml.pl
File length: 1483 byte(s)
Diff to selected 153
Initial revision


This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

  ViewVC Help
Powered by ViewVC 1.1.26