/[BackupPC]/trunk/doc/Search.pm
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Contents of /trunk/doc/Search.pm

Parent Directory Parent Directory | Revision Log Revision Log


Revision 255 - (show annotations)
Mon Dec 12 16:31:07 2005 UTC (18 years, 4 months ago) by dpavlin
File size: 6505 byte(s)
md5sum fix

1 =head1 BackupPC search and archival extension
2
3 BackupPC search and archival extension is used to:
4
5 =over 4
6
7 =item *
8
9 find files in C<pool> by substring of filename
10
11 =item *
12
13 see state of one or all of your shares in some particular point in time
14 (file creation date on clients or backup date is supported as filter)
15
16 =item *
17
18 archive selected backups (per host, share and increment) to archival
19 medium (CD, DVD, tape etc.) with rich descriptions, archival locations,
20 multiple copies and so on.
21
22 =back
23
24 =head2 Requirements
25
26 This extension is based on PostgreSQL RDBMS and HyperEstraier full-text
27 search engine (and it's perl bindings).
28
29 If you are using Debian, you are in luck. All required packages are
30 allready part of C<unstable> distribution and can be installed with:
31
32 # apt-get install postgresql
33 # apt-get install hyperestraier libestraier-dev libqdbm-dev
34
35 Now you can skip to installation of L<HyperEstraier perl bindings> below.
36
37 If you don't have pre-packaged binaries for your installation, you will
38 need to install additional packages by hand.
39
40 =head3 PostgreSQL
41
42 You can use packages provided by your distribution or follow installation
43 instructions on PostgreSQL site.
44
45 =head3 QDBM
46
47 First, you need qdbm on which HyperEstraier depends. Installation is simple.
48
49 $ tar xvfz qdbm-1.8.31.tar.gz
50 $ cd qdbm-1.8.31
51 $ ./configure --enable-zlib
52 $ make
53 $ sudo make install
54
55 =head3 HyperEstraier
56
57 Also quite simple.
58
59 $ tar xvfz hyperestraier-0.5.4.tar.gz
60 $ cd hyperestraier-0.5.4
61 $ ./configure
62 $ make
63 $ sudo make install
64
65 Then you will have to install perl bindings for HyperEstraier.
66
67 =head3 HyperEstraier perl bindings
68
69 This might take a bit more work. You will need to have C<swig> and C<g++> installed
70 before you can install perl bindings.
71
72 $ tar xvfz hyper_estraier_wrappers-0.0.10.tar.gz
73 $ cd swig_hest/perl/
74 $ swig -c++ -perl5 -o HyperEstraier_wrap.cpp ../HyperEstraier.i
75 $ cp ../HyperEstraierWrapper.cpp ./
76 $ perl Makefile.PL
77
78 You might need to remove C<estconfig --mtlibs> from C<Makefile.PL> if it
79 fails to run with following message:
80
81 Unrecognized argument in LIBS ignored: 'configuration'
82 Unrecognized argument in LIBS ignored: 'helper'
83 Unrecognized argument in LIBS ignored: 'for'
84 Unrecognized argument in LIBS ignored: 'Hyper'
85 Unrecognized argument in LIBS ignored: 'Estraier'
86 Writing Makefile for HyperEstraier
87
88 Working C<Makefile.PL> for my particular installation is:
89
90 # File : Makefile.PL
91 use ExtUtils::MakeMaker;
92 WriteMakefile(
93 NAME => 'HyperEstraier',
94 INC => sprintf('%s', `pkg-config --cflags hyperestraier`),
95 LIBS => [sprintf('-lstdc++ %s %s' , `pkg-config --libs hyperestraier`)],
96 OBJECT => 'HyperEstraierWrapper.o HyperEstraier_wrap.o'
97 );
98
99 After succesfull C<perl Makefile.pl>, you can compile and install it.
100
101 $ make
102 $ sudo make install
103
104 =head3 CPAN modules
105
106 You will also need a few additional cpan modules
107
108 =over 4
109
110 =item File::Pid
111 =item File::Which
112 =item File::Path
113 =item File::Slurp
114
115 =item Spreadsheet::WriteExcel
116
117 =item Term::Menus
118 =item XML::Writer
119 =item Algorithm::Diff
120 =item Archive::Tar::Stream
121
122 =back
123
124 Last module, C<Spreadsheet::WriteExcel> is needed only if you want to use
125 C<BackupPC_xls_report> to generate Excel reports from your backup data.
126
127 Easiest way to install them is using C<cpan> shell.
128
129 $ sudo cpan File::Pid Spreadsheet::WriteExcel
130
131 =head1 Creation of initial database
132
133 Once you have all components installed, you should initially create data
134 about increments in RDBMS and full-text search engine index.
135
136 First, edit C<config.pl> and setup C<SearchDSN> to valid perl DSN (for example,
137 C<dbi:Pg:dbname=backuppc> and C<SearchUser> to database user which has priviledges
138 over that database. You might need to add additional directives in PostgreSQL's
139 C<pg_hba.conf> so that selected user can be connected without password (I know, it's not
140 perfect, but I trust local users on backuppc machine).
141
142 Then, create new database for backuppc.
143
144 $ createdb backuppc
145
146 Then invoke C<BackupPC_updatedb> for the first time with argument to create database
147 schema:
148
149 $ sudo -u backuppc /data/backuppc/bin/BackupPC_updatedb -c
150
151 You can also force full-text reindex by C<-i> flag. This will add all files which are in
152 database but for some reason are missing in full-text index.
153
154 Now, you can setup cron job which will daily check your backups and update database
155 and full-text index. This is as simple as invoking
156
157 /data/backuppc/bin/BackupPC_updatedb
158
159 As C<backuppc> user and redirecting output to log file.
160
161 B<Documentation is still incomplete>.
162
163 Now that you finished installation, you can select new options from
164 menu on the left and example search and archival.
165
166 =head1 Reporting in Excel
167
168 Often, it is useful to be able to present report about your BackupPC hosts, number
169 of snapshots, total size and other useful info. While all those information can
170 be accessed using web browser, for analysis it's useful to have them in spreadsheet
171 form. With this data, you can monitor changes on your backup cycle, effects of changes
172 on your server or network to your BackupPC installation and so on.
173
174 You can create Excel spreadsheet (which works perfectly with Gnumeric also) using
175 following command:
176
177 $ sudo -u backuppc /data/backuppc/bin/BackupPC_xls_report
178
179 =head1 Additional requirements
180
181 You will also need several other command-line utilities to make
182 C<BackupPC_burnArchiveCLI> run. Those include:
183
184 =over 4
185
186 =item mkisofs
187
188 =item cdrecord
189
190 Or equivavalent utility, on Debian you might want to use C<dvdrecord>.
191
192 =item eject
193
194 Please make a symlink from C</dev/cdrom> to you CD/DVD burner.
195
196 =item split
197
198 =back
199
200 =head1 Troubleshoting
201
202 =head2 md5sum problems on Debian
203
204 If you are running Debian stable and expiriencing error message like
205
206 -: No such file or directory
207
208 your C<md5sum> comes from C<coreutils> (which doesn't know how to handle C<->
209 as input) as opposed to C<md5sum> which comes with C<textutils> (which does).
210
211 Fix is simple, correct md5sum is allready installed, you just have to do:
212
213 sudo cp /usr/bin/md5sum.textutils /usr/bin/md5sum
214
215 =head1 Related projects
216
217 BackupPC allready has archival host which might suit your needs better (and
218 it's quite easier to install).
219
220 =over 4
221
222 =item PostgreSQL
223
224 L<http://www.postgresql.org/>
225
226 =item HyperEstraier
227
228 L<http://hyperestraier.sourceforge.net/>
229
230 =back
231
232 =head1 Authors
233
234 Ivan Klaric C<< <iklaric@gmail.com> >>
235
236 Dobrica Pavlinusic C<< <dpavlin@rot13.org> >>
237
238 =head1 Licence
239
240 This extension is released under GPL licence, same as BackupPC.
241

  ViewVC Help
Powered by ViewVC 1.1.26