jcolibri.extensions.textual.IE.common
Class StopWordsDetector

java.lang.Object
  extended by jcolibri.extensions.textual.IE.common.StopWordsDetector

public class StopWordsDetector
extends java.lang.Object

Removes stop words (workds without relevant meaning) and punctuation symbols. It uses a built-in list and modifies the "isStopWord" flag of the tokens.

The first version was developed at: Robert Gordon University - Aberdeen & Facultad Informática, Universidad Complutense de Madrid (GAIA)

Version:
2.0
Author:
Juan A. Recio-Garcia

Constructor Summary
StopWordsDetector()
           
 
Method Summary
static void detectStopWords(CBRQuery query)
          Performs the algorithm in all the attributes of a query These attributes must be IEText objects.
static void detectStopWords(CBRQuery query, java.util.Collection<Attribute> attributes)
          Performs the algorithm in the given attributes of a query.
static void detectStopWords(java.util.Collection<CBRCase> cases)
          Performs the algorithm in all the attributes of a collection of cases These attributes must be IEText objects.
static void detectStopWords(java.util.Collection<CBRCase> cases, java.util.Collection<Attribute> attributes)
          Performs the algorithm in the given attributes of a collection of cases.
static void detectStopWords(IEText text)
          Performs the algorithm in a given IEText object
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

StopWordsDetector

public StopWordsDetector()
Method Detail

detectStopWords

public static void detectStopWords(java.util.Collection<CBRCase> cases,
                                   java.util.Collection<Attribute> attributes)
Performs the algorithm in the given attributes of a collection of cases. These attributes must be IEText objects.


detectStopWords

public static void detectStopWords(CBRQuery query,
                                   java.util.Collection<Attribute> attributes)
Performs the algorithm in the given attributes of a query. These attributes must be IEText objects.


detectStopWords

public static void detectStopWords(java.util.Collection<CBRCase> cases)
Performs the algorithm in all the attributes of a collection of cases These attributes must be IEText objects.


detectStopWords

public static void detectStopWords(CBRQuery query)
Performs the algorithm in all the attributes of a query These attributes must be IEText objects.


detectStopWords

public static void detectStopWords(IEText text)
Performs the algorithm in a given IEText object


GAIA - Group for Artificial Intelligence Applications
http://gaia.fdi.ucm.es