GSP
Quick Navigator

Search Site

Unix VPS
A - Starter
B - Basic
C - Preferred
D - Commercial
MPS - Dedicated
Previous VPSs
* Sign Up! *

Support
Contact Us
Online Help
Handbooks
Domain Status
Man Pages

FAQ
Virtual Servers
Pricing
Billing
Technical

Network
Facilities
Connectivity
Topology Map

Miscellaneous
Server Agreement
Year 2038
Credits
 

USA Flag

 

 

Man Pages
HXEXTRACT(1) HTML-XML-utils HXEXTRACT(1)

hxextract - extract selected elements from a HTML or XML file

hxextract [ -h | -? ] [ -x ] [ -s text ] [ -e text ] [ -b base ] element-or-class [ -c configfile | file-or-URL ]

hxextract outputs all elements with a certain name and/or class.

Input must be well-formed, since no HTML heuristics are applied.

The following options are supported:
-x
Use XML format conventions.
-s text
Insert text at the start of the output.
-e text
Insert text at the end of the output.
-b base
URL base
-c configfile
Read @chapter lines from configfile (lines must be of the form "@chapter filename") and extract elements from each of those files.
-h, -?
Print command usage.

The following operands are supported:
element-or-class
The name of an element to extract (e.g., "H2"), or the name of a class preceded by "." (e.g., ".example") or a combination of both (e.g., "H2.example").
file-or-URL
A file name or a URL. To read from standard input, use "-".

To use a proxy to retrieve remote files, set the environment variables http_proxy and ftp_proxy. E.g., http_proxy="http://localhost:8080/"

Remote files (specified with a URL) are currently only supported for HTTP. Password-protected files or files that depend on HTTP "cookies" are not handled. (You can use tools such as curl(1) or wget(1) to retrieve such files.)

hxselect(1)
10 Jul 2011 7.x

Search for    or go to Top of page |  Section 1 |  Main Index

Powered by GSP Visit the GSP FreeBSD Man Page Interface.
Output converted with ManDoc.