jcolibri.extensions.textual.IE.common
Class TextStemmer

java.lang.Object
  extended by jcolibri.extensions.textual.IE.common.TextStemmer

public class TextStemmer
extends java.lang.Object

Stemes the tokens of the text using the SnowBall package. http://snowball.tartarus.org
It stores the stem in each token using the flag with the same name.

This method uses the SnowBall package:

First version was developed at: Robert Gordon University - Aberdeen & Facultad Informática, Universidad Complutense de Madrid (GAIA)

Version:
2.0
Author:
Juan A. Recio-Garcia

Constructor Summary
TextStemmer()
           
 
Method Summary
static void stem(CBRQuery query)
          Performs the algorithm in all the attributes of a query These attributes must be IEText objects.
static void stem(CBRQuery query, java.util.Collection<Attribute> attributes)
          Performs the algorithm in the given attributes of a query.
static void stem(java.util.Collection<CBRCase> cases)
          Performs the algorithm in all the attributes of a collection of cases These attributes must be IEText objects.
static void stem(java.util.Collection<CBRCase> cases, java.util.Collection<Attribute> attributes)
          Performs the algorithm in the given attributes of a collection of cases.
static void stem(IEText text)
          Stems the tokens of the text.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

TextStemmer

public TextStemmer()
Method Detail

stem

public static void stem(java.util.Collection<CBRCase> cases,
                        java.util.Collection<Attribute> attributes)
Performs the algorithm in the given attributes of a collection of cases. These attributes must be IEText objects.


stem

public static void stem(CBRQuery query,
                        java.util.Collection<Attribute> attributes)
Performs the algorithm in the given attributes of a query. These attributes must be IEText objects.


stem

public static void stem(java.util.Collection<CBRCase> cases)
Performs the algorithm in all the attributes of a collection of cases These attributes must be IEText objects.


stem

public static void stem(CBRQuery query)
Performs the algorithm in all the attributes of a query These attributes must be IEText objects.


stem

public static void stem(IEText text)
Stems the tokens of the text. If no stem is found, it stores the original word as the stem.

Parameters:
text - to stem

GAIA - Group for Artificial Intelligence Applications
http://gaia.fdi.ucm.es