Package dataloader

Class Summary
DocLoader Class implementing the graphical user interface of the document loader.
FileHandler A class that reads and loads the data set containing all the news.
HTMLStripper The class loads an HTML file or an URL to an HTML site and strips this for HTML tags.
KMP This is a straightforward implementation of the famous Knuth-Morris-Pratt algorithm for string-matching.
XMLParseren The class reads config.xml and gets the URL or catalogue with the HTML-files one wishes to tokenize.