Comparison of Principal Component Analysis and Spectral Angle Mapping for Identification of Materials in Terahertz Transmission Measurements
MetadataVis full innførsel
The terahertz range of the electromagnetic spectrum ranges from 0.1 to 10 THz,and has some unique properties which make it interesting for security applications.The identification of a range of dangerous substances is possible using THz radiation,because many of these materials feature characteristic absorption lines in thisregime. Another property is the ability to penetrate common sealing materials,such as paper, plastic and cloth, enabling the possibility for identification of concealedsubstances. This thesis compares two methods, namely principal component analysis (PCA)and spectral angle mapping (SAM), for identification of different materials actingas simulants for dangerous substances. PCA is a method which transforms a numberof correlated variables into a smaller number of uncorrelated variables, calledprincipal components. The original data is projected on to these, forming a newcoordinate system where the original data is expressed in an optimal way, usingmuch fewer dimensions. SAM is a spectral recognition technique, which calculatesthe dot product between an unknown spectrum, and a reference spectrum, bothtreated as vectors. Measurements on samples containing Tartaric acid, Lactose and RDX (an explosive)were carried out using Terahertz time-domain spectroscopy, and the spectralfingerprints were obtained, and used for training each algorithm. Two spectralcharacteristics were considered: The absorption spectrum itself, and its derivative,both investigated for two different window widths. Four terahertz images fortesting the algorithms were acquired, one using no barrier, and three using eitherpaper, plastic or a piece of cloth for covering the samples. Also tested was theability to recognize a material when its sample properties differ from those usedfor training the algorithms, by looking at four different Tartaric acid samples. Thealgorithms were implemented using MATLAB, and compared using ROC curves.The performance of PCA showed that careful consideration must be taken whenchoosing the number of principal components, and that the optimal number differsdepending on spectral characteristic. In general, very good results were obtained when appropriate windowing was applied,and the best overall performance resulted from applying the narrower window,both for PCA and SAM. A true positive rate above 0.9 with a false positive rate of less than 0.2 couldbe obtained, regardless of barrier, also in the case of Tartaric acid. For PCA, theseresults were obtained using the absorption spectrum, while for SAM, this was thecase regardless of spectral characteristic. The paper and plastic barriers were not challenging for either algorithm, and usingthese yielded essentially the same results as using no barrier in most cases. Therewere some differences in the performance of PCA and SAM, but these were small.The most challenging barrier was the cloth, for which classification using SAMwith the absorption spectrum was slightly better than PCA, but the advantagewas small.