Clustering Document Fragments using Background Color and Texture Information
Journal article, Peer reviewed
Permanent lenke
http://hdl.handle.net/11250/142560Utgivelsesdato
2012Metadata
Vis full innførselSamlinger
Originalversjon
Chanda, S., Franke, K. & Pal, U. (2012). Clustering Document Fragments using Background Color and Texture Information. Proceedings of SPIE, the International Society for Optical Engineering, 8297. http://dx.doi.org/10.1117/12.910567Sammendrag
Forensic analysis of questioned documents sometimes can be extensively data intensive. A forensic expert might
need to analyze a heap of document fragments and in such cases to ensure reliability he/she should focus only
on relevant evidences hidden in those document fragments. Relevant document retrieval needs finding of similar
document fragments. One notion of obtaining such similar documents could be by using document fragment’s
physical characteristics like color, texture, etc. In this article we propose an automatic scheme to retrieve
similar document fragments based on visual appearance of document paper and texture. Multispectral color
characteristics using biologically inspired color differentiation techniques are implemented here. This is done
by projecting document color characteristics to Lab color space. Gabor filter-based texture analysis is used to
identify document texture. It is desired that document fragments from same source will have similar color and
texture. For clustering similar document fragments of our test dataset we use a Self Organizing Map (SOM)
of dimension 5×5, where the document color and texture information are used as features. We obtained an
encouraging accuracy of 97.17% from 1063 test images.
Beskrivelse
This is the copy of journal's version originally published in Proc. SPIE 8297: http://dx.doi.org/10.1117/12.910567. Reprinted with permission of SPIE.