Vis enkel innførsel

dc.contributor.advisorRue, Håvard
dc.contributor.advisorHesse, Dirk
dc.contributor.authorKvamme, Håvard
dc.date.accessioned2016-03-30T14:01:01Z
dc.date.available2016-03-30T14:01:01Z
dc.date.created2015-12-21
dc.date.issued2015
dc.identifierntnudaim:14409
dc.identifier.urihttp://hdl.handle.net/11250/2383180
dc.description.abstractIn this thesis, methods for predicting the gender of Norwegian Twitter accounts were investigated. Through Twitterâ s public APIs, various account information is available. Tweets (text), personal descriptions, friends networks, and profile images were the main fields investigated. First separate classifiers were fitted to features from the different fields, and later the individual classifiersâ posterior probability estimates were combined to achieve increased accuracy. The datasets were labeled though comparison of the accountsâ names and names in the Norwegian population. Subsets of accounts with very gender specific names were used for training and testing. The highest balanced accuracy obtained was around 0.89. This, however, required access to the accountsâ profile images (85% of the data). Without images, the accuracy dropped to around 0.85.
dc.languageeng
dc.publisherNTNU
dc.subjectFysikk og matematikk, Industriell matematikk
dc.titleGender prediction on Norwegian Twitter accounts
dc.typeMaster thesis
dc.source.pagenumber119


Tilhørende fil(er)

Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel