• norsk
    • English
  • English 
    • norsk
    • English
  • Login
View Item 
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for datateknologi og informatikk
  • View Item
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for datateknologi og informatikk
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

De-identification of Norwegian Health Record Notes: An Experimental Approach

Bjurstrøm, Roar; Singh, Jaspreet
Master thesis
Thumbnail
View/Open
658755_COVER01.pdf (184.4Kb)
658755_ATTACHMENT01.zip (50.65Mb)
658755_FULLTEXT01.pdf (3.859Mb)
URI
http://hdl.handle.net/11250/253488
Date
2013
Metadata
Show full item record
Collections
  • Institutt for datateknologi og informatikk [3779]
Abstract
The conversion of paper-based health records to electronic health records creates new opportunities within medical research, medical education and patient treatment. However, electronic health records have to be de-identified or anonymized before disclosure, in order to conduct ethically sound research. Manual de-identification is time-consuming and costly, and thus limits the amount of health records that can be disclosed for research purposes. The aim of this project was to develop an application for automatic or semi-automatic de-identification for Norwegian free text clinical notes. As no directly related studies have been performed on Norwegian clinical notes, our approach was highly experimental. We have employed different methods and techniques. These have been evaluated in different combinations to find the best match. The method combination which obtained the best evaluation results constitutes our final de-identification application. The application we have developed is based on pattern matching techniques and a simple statistical method. It produces de-identified output which is evaluated against a manually annotated reference standard consisting of 225 clinical notes. Our best system configurations recognized 77% of the total 3320 sensitive identifiers, with a precision of 68%. Most of the insensitive contents remained intact with a fallout of 5%.
Publisher
Institutt for datateknikk og informasjonsvitenskap

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit
 

 

Browse

ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsDocument TypesJournalsThis CollectionBy Issue DateAuthorsTitlesSubjectsDocument TypesJournals

My Account

Login

Statistics

View Usage Statistics

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit