Vis enkel innførsel

dc.contributor.authorYang, Zhirong
dc.contributor.authorChen, Yuwei
dc.contributor.authorSedov, Denis
dc.contributor.authorKaski, Samuel
dc.contributor.authorCorander, Jukka
dc.date.accessioned2023-02-14T12:23:34Z
dc.date.available2023-02-14T12:23:34Z
dc.date.created2022-12-11T10:02:17Z
dc.date.issued2023
dc.identifier.citationStatistics and computing. 2023, 33 (12), .en_US
dc.identifier.issn0960-3174
dc.identifier.urihttps://hdl.handle.net/11250/3050708
dc.description.abstractNeighbor embedding (NE) aims to preserve pairwise similarities between data items and has been shown to yield an effective principle for data visualization. However, even the best existing NE methods such as stochastic neighbor embedding (SNE) may leave large-scale patterns hidden, for example clusters, despite strong signals being present in the data. To address this, we propose a new cluster visualization method based on the Neighbor Embedding principle. We first present a family of Neighbor Embedding methods that generalizes SNE by using non-normalized Kullback–Leibler divergence with a scale parameter. In this family, much better cluster visualizations often appear with a parameter value different from the one corresponding to SNE. We also develop an efficient software that employs asynchronous stochastic block coordinate descent to optimize the new family of objective functions. Our experimental results demonstrate that the method consistently and substantially improves the visualization of data clusters compared with the state-of-the-art NE approaches. The code of our method is publicly available at https://github.com/rozyangno/sce.en_US
dc.language.isoengen_US
dc.publisherSpringeren_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleStochastic Cluster Embeddingen_US
dc.title.alternativeStochastic Cluster Embeddingen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber14en_US
dc.source.volume33en_US
dc.source.journalStatistics and computingen_US
dc.source.issue12en_US
dc.identifier.doi10.1007/s11222-022-10186-z
dc.identifier.cristin2091532
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal