Show simple item record

dc.contributor.advisorRamampiaro, Herindrasananb_NO
dc.contributor.authorBlixhavn, Øystein Hoelnb_NO
dc.date.accessioned2014-12-19T13:42:19Z
dc.date.available2014-12-19T13:42:19Z
dc.date.created2014-12-07nb_NO
dc.date.issued2014nb_NO
dc.identifier769314nb_NO
dc.identifierntnudaim:12121nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/254014
dc.description.abstractThis master thesis looks at how clustering techniques can be appliedto a collection of scientific documents. Approximately one year of serverlogs from the CERN Document Server (CDS) are analyzed and preprocessed.Based on the findings of this analysis, and a review of thecurrent state of the art, three different clustering methods are selectedfor further work: Simple k-Means, Hierarchical Agglomerative Clustering(HAC) and Graph Partitioning. In addition, a custom, agglomerativeclustering algorithm is made in an attempt to tackle some of the problemsencountered during the experiments with k-Means and HAC. The resultsfrom k-Means and HAC are poor, but the graph partitioning methodyields some promising results.The main conclusion of this thesis is that the inherent clusters withinthe user-record relationship of a scientific collection are nebulous, butexisting. Furthermore, the most common clustering algorithms are notsuitable for this type of clustering.nb_NO
dc.languageengnb_NO
dc.publisherInstitutt for datateknikk og informasjonsvitenskapnb_NO
dc.subjectntnudaim:12121no_NO
dc.subjectMTDT Datateknologino_NO
dc.subjectData- og informasjonsforvaltningno_NO
dc.titleClustering User Behavior in Scientific Collectionsnb_NO
dc.typeMaster thesisnb_NO
dc.source.pagenumber114nb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record