GSP
Quick Navigator

Search Site

Unix VPS
A - Starter
B - Basic
C - Preferred
D - Commercial
MPS - Dedicated
Previous VPSs
* Sign Up! *

Support
Contact Us
Online Help
Handbooks
Domain Status
Man Pages

FAQ
Virtual Servers
Pricing
Billing
Technical

Network
Facilities
Connectivity
Topology Map

Miscellaneous
Server Agreement
Year 2038
Credits
 

USA Flag

 

 

Man Pages


Manual Reference Pages  -  SEARCH::OPENFTS::PARSER (3)

.ds Aq ’

NAME

Search::OpenFTS::Parser - Provides functions for parsing

CONTENTS

SYNOPSIS

my $parser=Search::OpenFTS::Parser->new();

$types = $parser->alltypes;

$parser->start_parser( \$text );

$parser->start_parser( \*STDIN );>

$parser->start_parser( \*STDIN, $limit );

while( ( ($type, $word) = $parser->get_word ) && $type ) {



        print $parser->S<type_description>( $type ),"\t$word\n";
}



$parser->end_parser;

The parser will convert the text in the variable $text into a stream of lexemes together with their type ID. To get all types of lexemes that the parser supports use:

my @types = $parser->alltypes;

map {print $_ => $types[$_]\n; } 1..$#types;

DESCRIPTION

The package Search::OpenFTS::Parser is a wrapper around the parser functions. The particular functions that a parser must provide in order to work with OpenFTS are discussed in section VII (Customizations) of the OpenFTS primer.

Parser recognizes following types of lexemes:



 1 => Latin word
 2 => Cyrillic word
 3 => Word
 4 => Email
 5 => URL
 6 => Host
 7 => Scientific notation
 8 => VERSION
 9 => Part of hyphenated word
 10 => Cyrillic part of hyphenated word
 11 => Latin part of hyphenated word
 12 => Space symbols
 13 => Char in tag
 14 => HTTP head
 15 => Hyphenated word
 16 => Latin hyphenated word
 17 => Cyrillic hyphenated word
 18 => URI
 19 => File or path name
 20 => Decimal notation
 21 => Signed integer
 22 => Unsigned integer



SEE ALSO



    The OpenFTS Primer          (  see doc/ subdirectory )

    The Crash-course to OpenFTS ( in examples/ subdirectory )

    perldoc Search::OpenFTS::Search

    perldoc Search::OpenFTS::Index

    perldoc Search::OpenFTS::Dict::PorterEng

    perldoc Search::OpenFTS::Dict::Snowball

    perldoc Search::OpenFTS::Dict::UnknownDict

    perldoc Search::OpenFTS::Morph::ISpell



Search for    or go to Top of page |  Section 3 |  Main Index


perl v5.20.3 PARSER (3) 2009-12-04

Powered by GSP Visit the GSP FreeBSD Man Page Interface.
Output converted with manServer 1.07.