Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE

Melit Devassy, Binu; George, Sony; Nussbaum, Peter

Melit Devassy, Binu; George, Sony; Nussbaum, Peter

Peer reviewed, Journal article

Published version

Åpne

Devassy.pdf (3.031Mb)

Permanent lenke

https://hdl.handle.net/11250/2654822

Utgivelsesdato

2020

Sammendrag

For a suspected forgery that involves the falsification of a document or its contents, the investigator will primarily analyze the document’s paper and ink in order to establish the authenticity of the subject under investigation. As a non-destructive and contactless technique, Hyperspectral Imaging (HSI) is gaining popularity in the field of forensic document analysis. HSI returns more information compared to conventional three channel imaging systems due to the vast number of narrowband images recorded across the electromagnetic spectrum. As a result, HSI can provide better classification results. In this publication, we present results of an approach known as the t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm, which we have applied to HSI paper data analysis. Even though t-SNE has been widely accepted as a method for dimensionality reduction and visualization of high dimensional data, its usefulness has not yet been evaluated for the classification of paper data. In this research, we present a hyperspectral dataset of paper samples, and evaluate the clustering quality of the proposed method both visually and quantitatively. The t-SNE algorithm shows exceptional discrimination power when compared to traditional PCA with k-means clustering, in both visual and quantitative evaluations.

Utgiver

MDPI

Tidsskrift

Journal of Imaging

Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal