| Revision 1108 (by dpavlin, 2008/08/31 09:14:18) |
r1733@llin: dpavlin | 2008-08-31 11:14:02 +0200
beginning of marc_clone
|
+ add multiple databases
+ support multiple inputs to single database [2.00_1]
+ load multiple input/*.xml files for one input
+ lookups now works [2.00_3]
+ create links to other databases [2.00_4]
+ add regexp filter [2.00_5]
+ add YAML as normalize format
+ add more input formats (MARC::Fast and others) [2.00_6]
+ update to use Search::Estraier [2.09]
+ fix characters generated in [% search %] (move that to Webpacus?)
+ fix nasty bug with repeatable subfields [2.10]
+ write pure perl Search::HyperEstraier [Search::Estraier is on CPAN]
+ apply regex on records from input to fix-up character encodings [2.11]
+ add support for KinoSearch search library [2.12]
+ added new set-based normalizer which is pure perl code [2.13]
+ added --stats to report field and subfield usage [2.14]
+ add validator for input data [2.15]
+ add Excel input format [2.16]
+ remove WebPAC::Normalize::XML and promote WebPAC::Normalize::Set to WebPAC::Normalize [2.20]
+ support arrays for normalize/path [2.21]
+ add marc to normalize and create export MARC file [2.22]
+ implement indicators and repetable subfield in marc export [2.23]
+ add WebPAC::Output::MARC [2.24]
+ add config() and id() to WebPAC::Normalize
+ support local (by hostname) config files
+ implement marc_original_order to remap source records to marc [2.25]
+ fix statistics to use original data instead of data after modify_records
+ fix encoding and recoding issues (use UTF-8 as WebPAC native encoding) [2.26]
+ modify_file preserve order of translations in file [2.27]
+ modify_records regexpes now match just first occurence (repeat to get second...)
+ fix WebPAC::Output::MARC encoding troubles
+ generate reports (validation and stats) for each input
+ rewrite lookup support to use WebPAC::Normalize [2.28]
+ marc_leader shouldn't really be included in hash returned by data_structure
+ add checks for search directive in normalization to parser
+ lookup to another input file
+ support splitting of config yml to multiple files
+ add set and get to create in-memory lookup
+ support marc_remove('*')
+ fix-length fields (<100) support [2.29]
+ work without indexers installed or enabled [2.30]
+ simpliest possible conf/llin.yml as example
+ WebPAC::Output::JSON to produce data for Simile's exhibit
+ leader from WebPAC::Input::MARC is now available as rec('leader')
+ show i1 and i2 in statistcs specially (sorted first)
+ ISBN support in normalize
+ marc_template (WebPAC v1 in a command) [2.31]
+ marc_template must use marc_indicators data
+ indicator(field,nr) nr=1|2 for MARC input -> rec(field,'i1') or rec(field,'i2')
+ validator don't accept 0 as valid subfield
+ subroutines gets mungled in normalization files [2.33]
+ fix marc_template hadling of field value 0
+ csv/excel output conf/normalize/csv_ff-peri.pl [2.34]
+ fix input/excel encoding problems
+ use UTF-8 as internal encoding
- marc_clone to copy records/fields/indicators from input marc
- rewrite WebPAC::Input to be based on Module::Pluggable
- check usage of config in normalize file (database, input name?)
- load_ds/save_ds should use on-disk hash to preserve inodes
- validate should ignore modify_file
- add option to specify output marc path in config.yml
- add dBase input format
- remove delimiters characters from index and query entered
- delete unused files in database directories
- scoring for various fields in input/*.xml
- marclint - validate 035$9 as valid