Show simple item record

dc.contributor.authorDam, Thu-Lan
dc.contributor.authorChester, Sean
dc.contributor.authorNørvåg, Kjetil
dc.contributor.authorDuong, Quang-Huy
dc.date.accessioned2020-12-14T12:14:27Z
dc.date.available2020-12-14T12:14:27Z
dc.date.created2020-12-10T16:52:27Z
dc.date.issued2021
dc.identifier.citationInformation Systems. 2021, 97 .en_US
dc.identifier.issn0306-4379
dc.identifier.urihttps://hdl.handle.net/11250/2719165
dc.description.abstractMassive amounts of data with spatio-temporal-textual information are being generated due to the proliferation of GPS-equipped mobile devices. Much of this data are social media posts, often used to share and spread personal updates and news. Exploring valuable information from a dynamic collection of social posts is of great interest and has attracted many studies. However, because the size of data is huge, the existing methods mostly work with the time window model where the old data is discarded. In this work, we introduce the task of efficiently discovering the top-k most popular terms within a user specified bounded region over a stream of social posts, where the recent posts are more important than the old ones. To make this feasible, we propose a hybrid index structure and algorithms to efficiently answer such top-k queries. Our index employs a spatial index augmented by top-k time-weighted term lists and a bulk updating technique to support fast digestion of social post streams. Further, these top-k term lists are employed in the aggregation step to produce the final results so that incoming queries can be efficiently processed. An extensive experimental study with a large collection of social posts shows that the proposed methods are capable of both online aggregation and accurate query processing.en_US
dc.language.isoengen_US
dc.publisherElsevieren_US
dc.titleEfficient top-k recently-frequent term querying over spatio-temporal textual streamsen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionacceptedVersionen_US
dc.source.pagenumber14en_US
dc.source.volume97en_US
dc.source.journalInformation Systemsen_US
dc.identifier.doihttps://doi.org/10.1016/j.is.2020.101687
dc.identifier.cristin1858472
dc.description.localcode© 2020. This is the authors’ accepted and refereed manuscript to the article. Locked until 5.12.2022 due to copyright restrictions.en_US
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode2


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record