Vis enkel innførsel

dc.contributor.advisorGambäck, Björn
dc.contributor.authorGiæver, Ingrid Nelson
dc.date.accessioned2018-11-19T15:01:04Z
dc.date.available2018-11-19T15:01:04Z
dc.date.created2018-06-11
dc.date.issued2018
dc.identifierntnudaim:19396
dc.identifier.urihttp://hdl.handle.net/11250/2573608
dc.description.abstractFor the purpose of this study, 7096 users and 10.7M tweets were collected from Twitter and manually annotated. The data set included users taking part in pro-ED communities and users whose tweets were either recovery-oriented or unrelated to eating disorders. Analysis of the data set revealed differentiating characteristics in the users tweets and profile information, with respect to emoji use, presence of URLs and user mentions, and references to eating disorders and related topics. Based on the established differences, groups of features, such as tweet n-grams and emojis, were extracted and used to train a series of supervised classifiers. Four machine learning models were explored; a Support Vector Machine, a Naïve Bayes model, a Logistic Regression model and a Random Forest. The highest F1-score (0.98) was achieved both when using an SVM and when using an ensemble approach trained on weighted feature groups with emphasis on unigrams from tweets.
dc.languageeng
dc.publisherNTNU
dc.subjectDatateknologi, Databaser og søk
dc.titleClassification of Pro-Eating Disorder Users on Twitter
dc.typeMaster thesis


Tilhørende fil(er)

Thumbnail
Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel