dc.contributor.author | Duong, Quang-Huy | |
dc.contributor.author | Ramampiaro, Heri | |
dc.contributor.author | Nørvåg, Kjetil | |
dc.date.accessioned | 2019-11-18T09:11:00Z | |
dc.date.available | 2019-11-18T09:11:00Z | |
dc.date.created | 2019-11-14T10:15:38Z | |
dc.date.issued | 2019 | |
dc.identifier.isbn | 978-1-4503-6976-3 | |
dc.identifier.uri | http://hdl.handle.net/11250/2628898 | |
dc.description.abstract | We propose a novel sketching approach for streaming data that, even with limited computing resources, enables processing high volume and high velocity data efficiently. Our approach accounts for the fact that a stream of data is generally dynamic, with the underlying distribution possibly changing all the time. Specifically, we propose a hashing (sketching) technique that is able to automatically estimate a histogram from a stream of data by using a model with adaptive coefficients. Such a model is necessary to enable the preservation of histogram similarities, following the varying weight/importance of the generated histograms. To address the dynamic properties of data streams, we develop a novel algorithm that can sketch the histograms from a data stream using multiple weighted factors. The results from our extensive experiments on both synthetic and real-world datasets show the effectiveness and the efficiency of the proposed method. | nb_NO |
dc.description.abstract | Sketching Streaming Histogram Elements using Multiple Weighted Factors | nb_NO |
dc.language.iso | eng | nb_NO |
dc.publisher | ACM Publications | nb_NO |
dc.relation.ispartof | CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge Management | |
dc.relation.uri | https://www.ntnu.edu/idi/mused | |
dc.subject | Datagruvedrift | nb_NO |
dc.subject | Datamining | nb_NO |
dc.title | Sketching Streaming Histogram Elements using Multiple Weighted Factors | nb_NO |
dc.type | Chapter | nb_NO |
dc.description.version | acceptedVersion | nb_NO |
dc.subject.nsi | VDP::Informasjons- og kommunikasjonsvitenskap: 420 | nb_NO |
dc.subject.nsi | VDP::Information and communication science: 420 | nb_NO |
dc.source.pagenumber | 19-28 | nb_NO |
dc.identifier.cristin | 1747420 | |
dc.description.localcode | This chapter will not be available due to copyright restrictions (c) 2019 by ACM Publications | nb_NO |
cristin.unitcode | 194,63,10,0 | |
cristin.unitname | Institutt for datateknologi og informatikk | |
cristin.ispublished | true | |
cristin.fulltext | postprint | |
cristin.qualitycode | 1 | |