dc.contributor.author | Long, Zhong-Zhen | |
dc.contributor.author | Xu, Guoxia | |
dc.contributor.author | Du, Jiao | |
dc.contributor.author | Zhu, Hu | |
dc.contributor.author | Yan, Taiyu | |
dc.contributor.author | Yu, Yu-Feng | |
dc.date.accessioned | 2021-03-26T12:46:53Z | |
dc.date.available | 2021-03-26T12:46:53Z | |
dc.date.created | 2021-03-23T14:34:09Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Big Data Research. 2021, 23 100170-?. | en_US |
dc.identifier.issn | 2214-5796 | |
dc.identifier.uri | https://hdl.handle.net/11250/2735753 | |
dc.description.abstract | Regarding as an important computing paradigm, cloud computing is to address big and distributed databases and rather simple computation. In this paradigm, data mining is one of the most important and fundamental problems. A large amount of data is generated by sensors and other intelligent devices. Data mining for these big data is crucial in various applications. K-means clustering is a typical technique to group the similar data into the same clustering, and has been commonly used in data mining. However, it is still a challenge to the data containing a large amount of noise, outliers and redundant features. In this paper, we propose a robust K-means clustering algorithm, namely, flexible subspace clustering. The proposed method incorporates feature selection and K-means clustering into a unified framework, which can select the refined features and improve the clustering performance. Moreover, for the purpose of enhancing the robustness, the -norm is embedded into the objective function. We can flexibly choose appropriate p according to the different data and thus obtain more robust performance. Experimental results verify the presented method has more robust and better performance on benchmark databases compared to the existing approaches. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Elsevier | en_US |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internasjonal | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/deed.no | * |
dc.title | Flexible Subspace Clustering: A Joint Feature Selection and K-Means Clustering Framework | en_US |
dc.type | Peer reviewed | en_US |
dc.type | Journal article | en_US |
dc.description.version | acceptedVersion | en_US |
dc.source.pagenumber | 100170-? | en_US |
dc.source.volume | 23 | en_US |
dc.source.journal | Big Data Research | en_US |
dc.identifier.doi | 10.1016/j.bdr.2020.100170 | |
dc.identifier.cristin | 1900284 | |
dc.description.localcode | "© 2020. This is the authors’ accepted and refereed manuscript to the article. Locked until 12.11.2022 due to copyright restrictions. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/ " | en_US |
cristin.ispublished | true | |
cristin.fulltext | postprint | |
cristin.qualitycode | 1 | |