Classifying OCR Errors for Use in Retrieval Methods for Norwegian Text
Abstract
Analysing Norwegian documents processed with Optical Character Recognition. Using information gathered about errors, a retrieval system is tested to improve information retrieval despite OCR errors in the text. Corpus gathered from the National Library of Norway.