Parent Directory | Revision Log
Links to HEAD: | (view) (annotate) |
Sticky Revision: |
r1764@llin: dpavlin | 2009-04-21 18:56:13 +0200
dump error from $@
better report modify errors
migrate internal encoding to utf-8
Make cleanup of encodings, moving webpac closer to having internal utf-8 representation. This will break current code, but is really neceserry step toward checking input encoding for validity
tweaks to statistics: - support stats from simple field => 'value' structure - sort fields which are not numeric correctly
r1364@llin: dpavlin | 2007-10-30 02:51:21 +0100 generelize idea a bit, and sort every subfield which has more than one char (and is thus "special" or wrong :-) in front.
r1362@llin: dpavlin | 2007-10-30 02:46:05 +0100 Show indicators (available when using WebPAC::Input::MARC) as first two subfileds in statistics insted in alphabetical order
r1298@llin: dpavlin | 2007-06-22 02:03:23 +0200 input_config can be given to new or open now
r1289@llin: dpavlin | 2007-06-21 23:26:10 +0200 * transfer input configuration hash as input_config to input module
call low-level dump_ascii as it should
r1267@llin: dpavlin | 2007-05-27 16:44:54 +0200 sort fields in stats
work with fields which have number 0 (as opposed to 000) which has been noticed in the wild (invalid, but --stats shouldn't really die)
r1203@llin: dpavlin | 2007-04-11 14:22:28 +0200 spacial handling for empty subfields [0.18]
fix warning
decorate output from regexp modify with filename and line
minor tweaks to test modify_file
finish tweaking mock framework, test and fix problem with slashes in modify_record
small tweaks on seek
added regex: to modify_records
another swiping API change: input->dump is gone, replaced with input->dump_ascii which is more understandable. If you want to override default behaviour (which is to use Data::Dump's dump in input->fetch_rec) define dump_ascii in low-level WebPAC::Input:: API
r1123@llin: dpavlin | 2006-11-03 21:38:14 +0100 implement fallback dump if low-level API isn't exposing dump_rec [0.15]
implemented load_row and save_row closures to serialize input databases (using WebPAC::Store probably). This will allow lookups to share on-disk storage with low_mem option of WebPAC::Input, which is now gone (under pressure of 600000+ record database which we are now testing on)
Turn on option low_mem (which need rewrite to use db/row) if there are more than 10000 rows (hardcoded, but should go away). This prevents webpac from running out of memory with databases of about 300000 records on 4Gb of (virtual) memory.
fix dump (ugly, needs re-visiting)
r1045@llin: dpavlin | 2006-09-29 21:38:42 +0200 change low-level API to be OO (and remove various ugly cludges).
r1008@llin: dpavlin | 2006-09-25 17:23:42 +0200 lookup creation somewhat works
r988@llin: dpavlin | 2006-09-25 11:47:07 +0200 fix die
refactored internal WebPAC::Input::* API a bit, added dump_rec, validate is now more clever and reports all errors from database at end
implement new modify_file format which is (hopefully) simplier than yaml and/or perl [2.27] (yes, I know... It's a sin...)
move logging to debug level
fix MARC encoding whoes
r878@llin: dpavlin | 2006-08-26 14:00:08 +0200 removed some debugging output (or moved it to debug level), few tweaks [2.26]
r877@llin: dpavlin | 2006-08-25 21:55:05 +0200 removed traces of Text::Iconv and replaced them with Encode, code page 852 is now cp852 (instead of just 852) because Encode likes it that way, record encoding is now hard-coded to utf-8
r867@llin: dpavlin | 2006-08-25 14:32:05 +0200 statistics now show data before modify_records
r857@llin: dpavlin | 2006-08-23 13:04:58 +0200 modify_records is now applied only once for each field to prevent looping of regexpes
added --parallel option to utilize multiple CPUs in machine
r835@llin: dpavlin | 2006-07-13 15:56:53 +0200 test modify_record
r834@llin: dpavlin | 2006-07-13 14:49:23 +0200 fix pod
r831@llin: dpavlin | 2006-07-13 13:56:19 +0200 first cut in implementing modify_records using automatically generated regexpes
r823@llin: dpavlin | 2006-07-09 17:23:28 +0200 stats not report repeatable subfields
r810@llin: dpavlin | 2006-07-05 21:53:01 +0200 change of parametars to WebPAC::Input
transfer all input variables to open_db in input module
r689@llin: dpavlin | 2006-05-18 15:45:23 +0200 treat field names as strings, not numbers (Excel field names are chars, not numbers)
dump debug not info if skipping to mfn
r669@llin: dpavlin | 2006-05-15 15:18:36 +0200 added nicely formatted stats and --stats flag to run.pl
r663@llin: dpavlin | 2006-05-15 12:02:43 +0200 added stats gathering
r651@llin: dpavlin | 2006-05-14 21:47:08 +0200 allow 0 as valid db handle
r634@llin: dpavlin | 2006-05-14 13:12:43 +0200 don't use version which doesn't exits
r625@llin: dpavlin | 2006-05-14 11:37:22 +0200 added no_progress_bar for tests and cron usage
r494@llin: dpavlin | 2006-02-27 00:22:59 +0100 implemented recode option to input (for now, just for MARC)
r346@llin: dpavlin | 2005-12-31 17:53:29 +0100 rename $offset and $limit variables to $from_rec and $to_rec to avoid confusion with parametars which have same names
r345@llin: dpavlin | 2005-12-31 17:50:23 +0100 better output
r335@athlon: dpavlin | 2005-12-20 20:01:21 +0100 added debug output for record fetched from low-level API
moved clean into WebPAC::Output::Estraier, cleanup
r322@athlon: dpavlin | 2005-12-19 22:27:06 +0100 make run.pl moderatly chatty (along with other modules), added command line options (try perldoc run.pl) new target index (to reindex all) and run (to index first 100 records of each database)
r11790@llin: dpavlin | 2005-12-19 06:35:06 +0100 and small fix for codepage
r11787@llin: dpavlin | 2005-12-19 06:10:47 +0100 MARC indexing seems to work
r11784@llin: dpavlin | 2005-12-19 05:17:24 +0100 don't use Exporter after all
r11779@llin: dpavlin | 2005-12-19 04:07:22 +0100 and fixes to make it work
r11778@llin: dpavlin | 2005-12-19 03:59:54 +0100 move work on input
r11777@llin: dpavlin | 2005-12-19 00:02:47 +0100 refactor Input::ISIS::* [0.02]
fix
ISIS input is finished, low_mem option has code (and not only documentation :-)
a bit more work on WebPAC::Input::ISIS
added WebPAC::Input::ISIS
moved implementation of lookups from older code-base
initial import of some documentation and module structure
This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.
ViewVC Help | |
Powered by ViewVC 1.1.26 |