GSP
Quick Navigator

Search Site

Unix VPS
A - Starter
B - Basic
C - Preferred
D - Commercial
MPS - Dedicated
Previous VPSs
* Sign Up! *

Support
Contact Us
Online Help
Handbooks
Domain Status
Man Pages

FAQ
Virtual Servers
Pricing
Billing
Technical

Network
Facilities
Connectivity
Topology Map

Miscellaneous
Server Agreement
Year 2038
Credits
 

USA Flag

 

 

Man Pages


Manual Reference Pages  -  HTML::RSSAUTODISCOVERY (3)

.ds Aq ’

NAME

HTML::RSSAutodiscovery - methods for retreiving RSS-ish information from an HTML document.

CONTENTS

SYNOPSIS



 use HTML::RSSAutodiscovery;
 use Data::Dumper;

 my $url = "http://www.diveintomark.org/";

 my $html = HTML::RSSAutodiscovery->new();
 print &Dumper($html->parse($url));

 # Marks gone a bit nuts with this and
 # the list is too long to include here...

 # see the POD for the parse method for
 # details of what it returns.



DESCRIPTION

Methods for retreiving RSS-ish information from an HTML document.

PACKAGE METHODS

__PACKAGE__->new()

Object constructor. Returns an object. Woot!

OBJECT METHODS

CW$obj->parse($arg)

Parse an HTML document and return RSS-ish <link> information.

$arg may be either:
o An HTML string, passed as a scalar reference.
o A URI.
Returns an array reference of hash references whose keys are :
o title
o type
o rel
o href

CW$obj->locate($uri,\%args)

Like the parse method, but will perform additional lookups, if necessary or specified.

Valid arguments are
o <B>uriB>

String. A live, breathing URI to slurp and parse.

Required

o Hash ref whose keys may be
o <B>noparseB>

Boolean. Don’t bother parsing the document, this will also prevent you from checking for embedded links.

I don’t know why you want to do this, but you can.

False, by default.

o <B>embeddedB>

Boolean. Check all embedded links ending in ’.xml’, ’.rss’ or ’.rdf’ (and then ’xml’, ’rss’ or ’rdf’) for RSS-ness.

False, by default, unless the initial parsing of the URI returns no RSS links.

o <B>embedded_and_remoteB>

Boolean.

Boolean. Check all embedded links whose root is not the same as $uri for RSS-ness.

False, by default.

o <B>syndic8B>

Boolean. Check the syndic8 servers for sites matching $uri

False, by default, unless the initial parsing of the URI and any embedded links returns no RSS links.

Returns an array reference of hash references whose keys are :
o title
o type
o rel
o href

VERSION

1.21

DATE

$Date: 2004/10/17 04:13:06 $

AUTHOR

Aaron Straup Cope

SEE ALSO

Because you shouldn’t need all that white space to do cool stuff ;-)

http://diveintomark.org/archives/2002/05/30.html#rss_autodiscovery

http://diveintomark.org/archives/2002/08/15.html

http://diveintomark.org/projects/misc/rssfinder.py.txt

REQUIREMENTS

    BASIC

These packages are required to actually parse an HTML document or URI.
o <B>HTML::ParserB>
o <B>LWP::UserAgentB>
o <B>HTTP::RequestB>

    EMBEDDED

These packages are required to check the embedded links in a URI for RSS files. They are not loaded until run-time so they are not required for doing basic parsing
o <B>XML::RSSB>

    SYNDIC8

These packages are required to query the syndic8 servers for RSS files associated with a URI. They are not loaded until run-time so they are not required for doing basic parsing
o <B>XMLRPC::LiteB>

LICENSE

Copyright (c) 2002-2004, Aaron Straup Cope. All Rights Reserved.

This is free software, you may use it and distribute it under the same terms as Perl itself.

Search for    or go to Top of page |  Section 3 |  Main Index


perl v5.20.3 HTML::RSSAUTODISCOVERY (3) 2004-10-17

Powered by GSP Visit the GSP FreeBSD Man Page Interface.
Output converted with manServer 1.07.