• norsk
    • English
  • English 
    • norsk
    • English
  • Login
View Item 
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for datateknologi og informatikk
  • View Item
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for datateknologi og informatikk
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Growing a Forest: - Genetic Decision tree Induction

Fuglseth, Anders Nikolai
Master thesis
Thumbnail
View/Open
14785_FULLTEXT.pdf (2.235Mb)
14785_ATTACHMENT.zip (3.410Mb)
14785_COVER.pdf (1.622Mb)
URI
http://hdl.handle.net/11250/2445849
Date
2016
Metadata
Show full item record
Collections
  • Institutt for datateknologi og informatikk [3955]
Abstract
In decision tree learning, the traditional top-down divide and conquer approach searches a limited part of the hypothesis space, often leading to sub-optimal solutions. By doing decision tree induction with the use of an evolutionary algorithm the hypothesis space can be searched globally, leading to stronger solutions, while maintaining the inherent comprehensibility that decision trees offers. We have developed EMTI, the Evolutionary Multi-class Tree Inductor, a genetic programming method for inducing parallel axis, poly-ary decision trees for multiclass classification problems. It focuses on creating accurate decision trees with a high degree of human readability. EMTI uses a genetic programming encoding-scheme representing individuals directly as decision trees, and implements tree-specific crossover and mutation operators. Initial population is generated in the form of minimal, one decision node trees, which grow rapidly in size as the evolution cycle count increases. The multi-objective fitness function rewards classification accuracy while favoring smaller trees over larger ones. Traditional decision tree pruning methods and early stopping methods are shown to be viable ways of avoiding overfitting in the algorithm. EMTI scores favorably in terms of classification accuracy compared to C4.5 and shows a strong ability to ignore data noise and irrelevant attributes.
Publisher
NTNU

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit
 

 

Browse

ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsDocument TypesJournalsThis CollectionBy Issue DateAuthorsTitlesSubjectsDocument TypesJournals

My Account

Login

Statistics

View Usage Statistics

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit