Vis enkel innførsel

dc.contributor.authorDam, Thu-Lan
dc.contributor.authorLi, Kenli
dc.contributor.authorFournier-Viger, Philippe
dc.contributor.authorDuong, Quang-Huy
dc.date.accessioned2019-04-12T06:40:11Z
dc.date.available2019-04-12T06:40:11Z
dc.date.created2018-11-06T21:14:04Z
dc.date.issued2018
dc.identifier.citationFrontiers of Computer Science. 2018, 1-25.nb_NO
dc.identifier.issn2095-2228
dc.identifier.urihttp://hdl.handle.net/11250/2594338
dc.description.abstractHigh-utility itemset mining (HUIM) is a popular data mining task with applications in numerous domains. However, traditional HUIM algorithms often produce a very large set of high-utility itemsets (HUIs). As a result, analyzing HUIs can be very time consuming for users. Moreover, a large set of HUIs also makes HUIM algorithms less efficient in terms of execution time and memory consumption. To address this problem, closed high-utility itemsets (CHUIs), concise and lossless representations of all HUIs, were proposed recently. Although mining CHUIs is useful and desirable, it remains a computationally expensive task. This is because current algorithms often generate a huge number of candidate itemsets and are unable to prune the search space effectively. In this paper, we address these issues by proposing a novel algorithm called CLS-Miner. The proposed algorithm utilizes the utility-list structure to directly compute the utilities of itemsets without producing candidates. It also introduces three novel strategies to reduce the search space, namely chain-estimated utility co-occurrence pruning, lower branch pruning, and pruning by coverage. Moreover, an effective method for checking whether an itemset is a subset of another itemset is introduced to further reduce the time required for discovering CHUIs. To evaluate the performance of the proposed algorithm and its novel strategies, extensive experiments have been conducted on six benchmark datasets having various characteristics. Results show that the proposed strategies are highly efficient and effective, that the proposed CLS-Miner algorithm outperforms the current state-of-the-art CHUD and CHUI-Miner algorithms, and that CLS-Miner scales linearly.nb_NO
dc.language.isoengnb_NO
dc.publisherSpringernb_NO
dc.relation.urihttps://www.researchgate.net/publication/324075037_CLS-Miner_efficient_and_effective_closed_high-utility_itemset_mining
dc.titleCLS-Miner: efficient and effective closed high-utility itemset miningnb_NO
dc.typeJournal articlenb_NO
dc.typePeer reviewednb_NO
dc.description.versionacceptedVersionnb_NO
dc.source.pagenumber1-25nb_NO
dc.source.journalFrontiers of Computer Sciencenb_NO
dc.identifier.doi10.1007/s11704-016-6245-4
dc.identifier.cristin1627743
dc.description.localcodeThis is a post-peer-review, pre-copyedit version of an article published in Frontiers of Computer Science. Locked until 11 April 2020 due to copyright restrictions. The final authenticated version is available online at: https://doi.org/10.1007/s11704-016-6245-4nb_NO
cristin.unitcode194,63,10,0
cristin.unitnameInstitutt for datateknologi og informatikk
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel