Vis enkel innførsel

dc.contributor.authorÖzdikis, Özer
dc.contributor.authorRamampiaro, Heri
dc.contributor.authorNørvåg, Kjetil
dc.date.accessioned2019-11-04T08:44:12Z
dc.date.available2019-11-04T08:44:12Z
dc.date.created2019-06-06T13:44:36Z
dc.date.issued2019
dc.identifier.citationInformation Processing & Management. 2019, 56 (4), 1280-1299.nb_NO
dc.identifier.issn0306-4573
dc.identifier.urihttp://hdl.handle.net/11250/2626278
dc.description.abstractWhile geographical metadata referring to the originating locations of tweets provides valuable information to perform effective spatial analysis in social networks, scarcity of such geotagged tweets imposes limitations on their usability. In this work, we propose a content-based location prediction method for tweets by analyzing the geographical distribution of tweet texts using Kernel Density Estimation (KDE). The primary novelty of our work is to determine different settings of kernel functions for every term in tweets based on the location indicativeness of these terms. Our proposed method, which we call locality-adapted KDE, uses information-theoretic metrics and does not require any parameter tuning for these settings. As a further enhancement on the term-level distribution model, we describe an analysis of spatial point patterns in tweet texts in order to identify bigrams that exhibit significant deviation from the underlying unigram patterns. We present an expansion of feature space using the selected bigrams and show that it eventually yields further improvement in prediction accuracy of our locality-adapted KDE. We demonstrate that our expansion results in a limited increase in the size of feature space and it does not hinder online localization of tweets. The methods we propose rely purely on statistical approaches without requiring any language-specific setting. Experiments conducted on three tweet sets from different countries show that our proposed solution outperforms existing state-of-the-art techniques, yielding significantly more accurate predictions.nb_NO
dc.language.isoengnb_NO
dc.publisherElseviernb_NO
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/deed.no*
dc.titleLocality-adapted kernel densities of term co-occurrences for location prediction of tweetsnb_NO
dc.typeJournal articlenb_NO
dc.typePeer reviewednb_NO
dc.description.versionacceptedVersionnb_NO
dc.source.pagenumber1280-1299nb_NO
dc.source.volume56nb_NO
dc.source.journalInformation Processing & Managementnb_NO
dc.source.issue4nb_NO
dc.identifier.doi10.1016/j.ipm.2019.02.013
dc.identifier.cristin1703185
dc.description.localcode© 2019. This is the authors’ accepted and refereed manuscript to the article. Locked until 22.3.2021 due to copyright restrictions. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/nb_NO
cristin.unitcode194,63,10,0
cristin.unitnameInstitutt for datateknologi og informatikk
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode2


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal