agents
Class HTMLFetcherAgent

java.lang.Object
  extended by agents.AgentFormat
      extended by agents.HTMLFetcherAgent
All Implemented Interfaces:
java.lang.Runnable

public class HTMLFetcherAgent
extends AgentFormat

This agent uses an external library to download the pure text-contents from web-pages. This agent will receive the link to a page, and it will download the contents and it will remove all that is not pure text. This agent takes one input and one output. The input is where HTMLFetcherAgent receives the URLs that HTMLFetcherAgent has to fetch and extract the text from. The output variable contains the downloaded text.

Author:
Håvard Rykkelid

Field Summary
 
Fields inherited from class agents.AgentFormat
input, output
 
Constructor Summary
HTMLFetcherAgent()
           
 
Method Summary
 void run()
          This method will be run when the Thread containing this agentdescription is started
 
Methods inherited from class agents.AgentFormat
init, outputAll
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HTMLFetcherAgent

public HTMLFetcherAgent()
Method Detail

run

public void run()
This method will be run when the Thread containing this agentdescription is started