Common implementation of the Zend_Search_Lucene_Analysis_Analyzer interface.

There are several standard standard subclasses provided by Zend_Search_Lucene/Analysis subpackage: Zend_Search_Lucene_Analysis_Analyzer_Common_Text, ZSearchHTMLAnalyzer, ZSearchXMLAnalyzer.

category Zend
package Zend_Search_Lucene
subpackage Analysis
copyright Copyright (c) 2005-2015 Zend Technologies USA Inc. (http://www.zend.com)
license New BSD License
inherited_from \Zend_Search_Lucene_Analysis_Analyzer_Common_TextNum

 Methods

__construct()

__construct() 

Add Token filter to the Analyzer

addFilter(\Zend_Search_Lucene_Analysis_TokenFilter $filter) 
Inherited

Parameters

$filter

\Zend_Search_Lucene_Analysis_TokenFilter

Return the default Analyzer implementation used by indexing code.

getDefault() : \Zend_Search_Lucene_Analysis_Analyzer
InheritedStatic

Returns

\Zend_Search_Lucene_Analysis_Analyzer

Tokenization stream API Get next token Returns null at the end of stream

nextToken() : \Zend_Search_Lucene_Analysis_Token | null
Inherited

Tokens are returned in UTF-8 (internal Zend_Search_Lucene encoding)

Returns

\Zend_Search_Lucene_Analysis_Tokennull

Apply filters to the token. Can return null when the token was removed.

normalize(\Zend_Search_Lucene_Analysis_Token $token) : \Zend_Search_Lucene_Analysis_Token
Inherited

Parameters

$token

\Zend_Search_Lucene_Analysis_Token

Returns

\Zend_Search_Lucene_Analysis_Token

Reset token stream

reset() 
Inherited

Set the default Analyzer implementation used by indexing code.

setDefault(\Zend_Search_Lucene_Analysis_Analyzer $analyzer) 
InheritedStatic

Parameters

$analyzer

Tokenization stream API Set input

setInput(string $data, $encoding = ''
Inherited

Parameters

$data

string

$encoding

Tokenize text to a terms Returns array of Zend_Search_Lucene_Analysis_Token objects

tokenize(string $data, $encoding = '') : array
Inherited

Tokens are returned in UTF-8 (internal Zend_Search_Lucene encoding)

Parameters

$data

string

$encoding

Returns

array

 Properties

 

Input string encoding

$_encoding : string

Default

''
 

Input string

$_input : string

Default

null
 

The Analyzer implementation used by default.

$_defaultImpl : \Zend_Search_Lucene_Analysis_Analyzer

Default

Static
 

The set of Token filters applied to the Token stream.

$_filters : array

Default

array()

Array of Zend_Search_Lucene_Analysis_TokenFilter objects.

 

Current position in a stream

$_position : integer

Default