![]() |
![]()
| ![]() |
![]()
NAMEHTML::Summary - generate a summary from a web page SYNOPSISuse HTML::Summary; use HTML::TreeBuilder; my $tree = HTML::TreeBuilder->new; $tree->parse( $document ); my $summarizer = HTML::Summary->new( LENGTH => 200, USE_META => 1, ); $summary = $summarizer->generate( $tree ); $summarizer->option( 'USE_META' => 1 ); $length = $summarizer->option( 'LENGTH' ); if ( $summarizer->meta_used() ) { # do something } DESCRIPTIONThe "HTML::Summary" module produces summaries from the textual content of web pages. It does so using the location heuristic, which determines the value of a given sentence based on its position and status within the document; for example, headings, section titles and opening paragraph sentences may be favoured over other textual content. A LENGTH option can be used to restrict the length of the summary produced. CONSTRUCTORnew( $attr1 => $value1 [, $attr2 => $value2 ] )Possible attributes are:
my $summarizer = HTML::Summary->new(LENGTH => 200); METHODSoption( )Get / set HTML::Summary configuration options. my $length = $summarizer->option( 'LENGTH' ); $summarizer->option( 'USE_META' => 1 ); generate( $tree )Takes an HTML::Element object, and generates a summary from it. my $tree = HTML::TreeBuilder->new; $tree->parse( $document ); my $summary = $summarizer->generate( $tree ); meta_used( )Returns 1 if the META tag description was used to generate the summary. if ( $summarizer->meta_used() ) { # do something ... } SEE ALSOHTML::TreeBuilder, Text::Sentence, Lingua::JA::Jcode, Lingua::JA::Jtruncate. REPOSITORY<https://github.com/neilb/HTML-Summary> AUTHORSThis module was originally whipped up by Neil Bowers and Tony Rose. It was then developed and maintained by Ave Wrigley and Tony Rose. Neil Bowers is currently maintaining the HTML-Summary distribution. Neil Bowers <neilb@cpan.org> COPYRIGHT AND LICENSECopyright (c) 1997 Canon Research Centre Europe (CRE). All rights reserved. This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.
|