Vis enkel innførsel

dc.contributor.authorDam, Thu-Lan
dc.contributor.authorRamampiaro, Heri
dc.contributor.authorNørvåg, Kjetil
dc.contributor.authorDuong, Quang-Huy
dc.date.accessioned2019-05-21T05:13:52Z
dc.date.available2019-05-21T05:13:52Z
dc.date.created2018-12-07T15:11:49Z
dc.date.issued2018
dc.identifier.citationKnowledge-Based Systems. 2019, 165 13-29.nb_NO
dc.identifier.issn0950-7051
dc.identifier.urihttp://hdl.handle.net/11250/2598125
dc.description.abstractThe set of closed high-utility itemsets (CHUIs) concisely represents the exact utility of all itemsets. Yet, it can be several orders of magnitude smaller than the set of all high-utility itemsets. Existing CHUI mining algorithms assume that databases are static, making them very expensive in the case of incremental data, since the whole dataset has to be processed for each batch of new transactions. To address this challenge, this paper presents the first approach, called IncCHUI, that mines CHUIs efficiently from incremental databases. In order to achieve this, we propose an incremental utility-list structure, which is built and updated with only one database scan. Further, we apply effective pruning strategies to fast construct incremental utility-lists and eliminate candidates that are not updated. Finally, we suggest an efficient hash-based approach to update or insert new closed sets that are found. Our extensive experimental evaluation on both real-life and synthetic databases shows the efficiency, as well as the feasibility of our approach. It significantly outperforms previously proposed methods that are mainly run in batch mode in terms of speed, and it is scalable with respect to the number of transactions.nb_NO
dc.language.isoengnb_NO
dc.publisherElseviernb_NO
dc.relation.urihttp://hdl.handle.net/11250/2576916
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/deed.no*
dc.titleTowards efficiently mining closed high utility itemsets from incremental databasesnb_NO
dc.typeJournal articlenb_NO
dc.typePeer reviewednb_NO
dc.description.versionacceptedVersionnb_NO
dc.source.pagenumber13-29nb_NO
dc.source.volume165nb_NO
dc.source.journalKnowledge-Based Systemsnb_NO
dc.identifier.doi10.1016/j.knosys.2018.11.019
dc.identifier.cristin1640457
dc.relation.projectNorges forskningsråd: 90002403nb_NO
dc.relation.projectNorges teknisk-naturvitenskapelige universitet: 70440079nb_NO
dc.description.localcode© 2018. This is the authors’ accepted and refereed manuscript to the article. Locked until 16.11.2020 due to copyright restrictions. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/nb_NO
cristin.unitcode194,63,10,0
cristin.unitnameInstitutt for datateknologi og informatikk
cristin.ispublishedtrue
cristin.fulltextpreprint
cristin.fulltextpostprint
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal