Vis enkel innførsel

dc.contributor.advisorRamampiaro, Herindrasananb_NO
dc.contributor.authorPaulsen, Jon Runenb_NO
dc.date.accessioned2014-12-19T13:31:40Z
dc.date.available2014-12-19T13:31:40Z
dc.date.created2010-09-03nb_NO
dc.date.issued2007nb_NO
dc.identifier347455nb_NO
dc.identifierntnudaim:1289nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/250436
dc.description.abstractSearch engines for biological information are not a new technology. Since the 1960s computers have emerged as an important tool for biologists. Online Mendelian Inheritance in Man (OMIM) is a comprehensive catalogue containing approximately 14 000 records with information about human genes and genetic disorders. An approach called Latent Semantic Indexing (LSI) was introduced in 1990 that is based on Singular Value Decomposition (SVD). This approach improved the information retrieval and reduced the storage requirements. This thesis applies LSI on the collection of OMIM records. To further improve the retrieval effectiveness and efficiency, the author propose a clustering method based on the standard k-means algorithm, called Two step k-means. Both the standard k-means and the Two step k-means algorithms are tested and compared with each other.nb_NO
dc.languageengnb_NO
dc.publisherInstitutt for datateknikk og informasjonsvitenskapnb_NO
dc.subjectntnudaimno_NO
dc.subjectMIT informatikkno_NO
dc.subjectInformasjonsforvaltningno_NO
dc.titleOptimal Information Retrieval Model for Molecular Biology Informationnb_NO
dc.typeMaster thesisnb_NO
dc.source.pagenumber91nb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO


Tilhørende fil(er)

Thumbnail
Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel