Show simple item record

dc.contributor.advisorGulla, Jon Atle
dc.contributor.authorSelvig, Ole Christer Andre Asikainen
dc.date.accessioned2019-09-11T10:55:40Z
dc.date.created2018-07-09
dc.date.issued2018
dc.identifierntnudaim:17793
dc.identifier.urihttp://hdl.handle.net/11250/2615793
dc.description.abstractSeveral problematic data characteristics were revealed, such as multilingual reports, and significant class imbalances. While no consistent scheme for conduct-ing data preparation was found, several techniques were frequently reiterated in the most promising experiments. For the three classifiers tested (Naive Bayes, Support Vector Machines, and Random Forest), Support Vector Machines was the overall best choice, being the only classifier to generalize well beyond observed data. The various re-sampling techniques decreased the overall performance, which seems to indicate that more noise was generated instead.en
dc.languageeng
dc.publisherNTNU
dc.subjectInformatikk, Kunstig intelligensen
dc.titleClassification of Maintenance Reports - Statistical NLP meets the Oil & Gas Industryen
dc.typeMaster thesisen
dc.source.pagenumber107
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi og elektroteknikk,Institutt for datateknologi og informatikknb_NO
dc.date.embargoenddate10000-01-01


Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record