Tag Prediction in Social Media - Predicting Image Tags with Computer Vision and Word Embedding
Master thesis
Permanent lenke
http://hdl.handle.net/11250/2571692Utgivelsesdato
2018Metadata
Vis full innførselSamlinger
Sammendrag
Social Media produces vast amounts of user-generated content (UGC) every second, andimages are increasingly part of enriching this content. The need for effective ways to organizeand categorize content is bigger than ever. The proliferation of Big Data also offernew opportunities in regards to utilizing UGC in recommender systems. Considering thenoisy and unstructured nature of user-generated text however, extracting valuable knowledgefrom it is not an easy task. Therefore, this thesis looks in the direction of images.With the goal to extract some usable knowledge from these Social Media images, thisthesis proposes a novel approach to predicting the tags and content of an image fromSocial Media with the help of deep convolutional neural networks (deep CNNs) and wordembedding models.A pre-trained model for computer vision is used to classify an image and extract predictionsof its most likely content, and then evaluated against the image s tags to discoverthe model s tag prediction ability. Each of the predictions are used to produce similar syntacticand semantic information from a word embedding model. Using this aggregated information,the model s prediction ability is re-evaluated and performances are compared.In addition, the predictions are studied qualitatively to understand their degree of relevance.The model is evaluated on a subset of the MIRFLICKR25000 data set, which consistsof 25000 images under the Creative Commons licence gathered from the Social Mediaplatform Flickr. Although image auto-tagging is thoroughly researched, the task of tagprediction from images using computer vision and word embedding in this way is notdone previously. The evaluation of this model on the data subset shows that comparableaccuracy to state-of-the-art is achieved. Although they are not groundbreaking in terms ofaccuracy, results show a significant increase when expanding queries using a word embeddingmodel.