/[Grep]/lib/Grep/Source/DokuWiki.pm
This is repository of my old source code which isn't updated any more. Go to git.rot13.org for current projects!
ViewVC logotype

Contents of /lib/Grep/Source/DokuWiki.pm

Parent Directory Parent Directory | Revision Log Revision Log


Revision 122 - (show annotations)
Sat Apr 28 17:54:53 2007 UTC (17 years ago) by dpavlin
File size: 627 byte(s)
from my limited sample, I would say that all DokuWiki results are inside
<div class="search_result">
1 #!/usr/bin/perl
2
3 use warnings;
4 use strict;
5
6 package Grep::Source::DokuWiki;
7
8 =head1 NAME
9
10 Grep::Source::DokuWiki - scraiper for DokuWiki search results
11
12 =head1 BUGS
13
14 Since DokuWiki html isn't structured enough (there is no block element
15 around content of page, sic!), search results contain form elements and
16 buttons.
17
18 =cut
19
20 sub content_have {
21 qr(generator.*DokuWiki);
22 }
23
24 sub fetch {
25 my $self = shift;
26 my $parent = shift;
27
28 $parent->scrape(
29 wrapper => [ qw/
30 div class page
31 div class dokuwiki
32 /
33 ],
34 results => [
35 qw/div class search_result/,
36 ],
37 scrape => [ qw/div class dokuwiki/ ],
38 );
39
40 }
41
42 1;

  ViewVC Help
Powered by ViewVC 1.1.26