Vis enkel innførsel

dc.contributor.authorØystein, Repp
dc.contributor.authorRamampiaro, Heri
dc.date.accessioned2019-02-13T13:54:20Z
dc.date.available2019-02-13T13:54:20Z
dc.date.created2018-06-19T09:10:12Z
dc.date.issued2018
dc.identifier.citationJournal of Statistics & Management Systems. 2018, 21 (4), 695-723.nb_NO
dc.identifier.issn2169-0014
dc.identifier.urihttp://hdl.handle.net/11250/2585271
dc.description.abstractTwitter stream has become a large source of information, but the magnitude of tweets posted and the noisy nature of its content makes harvesting of knowledge from Twitter has challenged researchers for long time. Aiming at overcoming some of the main challenges of extracting hidden information from tweet streams, this work proposes a new approach for real-time detection of news events from the Twitter stream. We divide our approach into three steps. The first step is to use a neural network or deep learning to detect news-relevant tweets from the stream. The second step is to apply a novel streaming data clustering algorithm to the detected news tweets to form news events. The third and final step is to rank the detected events based on the size of the event clusters and growth speed of the tweet frequencies. We evaluate the proposed system on a large, publicly available corpus of annotated news events from Twitter. As part of the evaluation, we compare our approach with a related state-of-theart solution. Overall, our experiments and user-based evaluation show that our approach on detecting current (real) news events delivers a state-of-the-art performance.nb_NO
dc.language.isoengnb_NO
dc.publisherTaylor & Francisnb_NO
dc.relation.urihttp://www.idi.ntnu.no/~heri/papers/ReppRamampiaro2018.pdf
dc.subjectDatagruvedriftnb_NO
dc.subjectDataminingnb_NO
dc.subjectMaskinlæringnb_NO
dc.subjectMachine learningnb_NO
dc.titleExtracting news events from microblogsnb_NO
dc.typeJournal articlenb_NO
dc.typePeer reviewednb_NO
dc.description.versionacceptedVersionnb_NO
dc.subject.nsiVDP::Datateknologi: 551nb_NO
dc.subject.nsiVDP::Computer technology: 551nb_NO
dc.source.pagenumber695-723nb_NO
dc.source.volume21nb_NO
dc.source.journalJournal of Statistics & Management Systemsnb_NO
dc.source.issue4nb_NO
dc.identifier.doi10.1080/09720510.2018.1486273
dc.identifier.cristin1592102
dc.relation.projectNorges teknisk-naturvitenskapelige universitet: 548172nb_NO
dc.description.localcodeThis is an Accepted of an article published by Taylor & Francis in Journal of Statistics & Management Systems on 19.06.2018 available at https://doi.org/10.1080/09720510.2018.1486273 Locked until 19.06.2019 due to copyright restrictions.nb_NO
cristin.unitcode194,63,10,0
cristin.unitnameInstitutt for datateknologi og informatikk
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel