Vis enkel innførsel

dc.contributor.authorTomassen, Stein L.nb_NO
dc.date.accessioned2014-12-19T13:37:58Z
dc.date.available2014-12-19T13:37:58Z
dc.date.created2011-10-20nb_NO
dc.date.issued2011nb_NO
dc.identifier450422nb_NO
dc.identifier.isbn978-82-471-2625-7 (printed ver.)nb_NO
dc.identifier.isbn978-82-471-2626-4 (electronic ver.)nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/252683
dc.description.abstractSearching for information on the Web can be frustrating. One of the reasons is the ambiguity of words. The work presented in this thesis concentrates on how the effectiveness of standard information retrieval systems can be enhanced with semantic technologies like ontologies. Ontologies are knowledge models that can represent knowledge of any universe of discourse by describing how concepts of a domain are related. Creating and maintaining ontologies can be tedious and costly. However, we focus on reusing ontologies, rather than engineering, and on their applicability to improve the retrieval effectiveness of existing search systems. The aim of this work is to find an effective approach for applying ontologies to existing search systems. The basic idea is that these ontologies can be used to tackle the problem of ambiguous words and hence improve the retrieval effectiveness. Our approach to semantic search builds on feature vectors (FV). The basic idea is to connect the (standardised) domain terminology encoded in an ontology to the actual terminology used in a text corpus. Therefore, we propose to associate every ontology entity (classes and individuals are called entities in this work) with a FV that is tailored to the actual terminology used in a text corpus like the Web. These FVs are created off-line and later used on-line to filter (i.e. to disambiguate search) and re-rank the search results from an underlying search system. This pragmatic approach is applicable to existing search systems since it only depends on extending the query and presentation components, in other words there is no need to alter either the indexing or the ranking components of the existing systems. A set of experiments have been carried out and the results report on improvement by more than 10%. Furthermore, we have shown that the approach is neither dependent on highly specific queries nor on a collection comprised only of relevant documents. In addition, we have shown that the FVs are relatively persistent, i.e. little maintenance of the FVs is required. In this work, we focus on the creation and evaluation of these feature vectors. As a result, a part of the contribution of this work is a framework for the construction of FVs. Furthermore, we have proposed a set of metrics to measure the quality of the created FVs. We have also provided a set of guidelines for optimal construction of feature vectors for different categories of ontologies.nb_NO
dc.languageengnb_NO
dc.publisherNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO
dc.relation.ispartofseriesDoktoravhandlinger ved NTNU, 1503-8181; 2011:51nb_NO
dc.relation.haspartTomassen, Stein L.; Strasunskas, Darijus. Construction of Ontology based Semantic-Linguistic Feature Vectors for Searching. PROCEEDINGS OF THE 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3: 133-138, 2009. 10.1109/WI-IAT.2009.248.nb_NO
dc.relation.haspartTomassen, S.L.; Strasunskas, D.. Semantic-Linguistic Feature Vectors for Search. The Semantic Web, LNCS 5926: 199-215, 2009. 10.1007/978-3-642-10871-6_14.nb_NO
dc.relation.haspartTomassen, S.L; Strasunskas, D.. Relating ontology and Web terminologies by feature vectors: unsupervised construction and experimental validation. Proceedings of the 11th Int. Conf. on Information Integration and Web-based Applications & Services: 86-93, 2009. 10.1145/1806338.1806362.nb_NO
dc.relation.haspartTomassen, S. L; Strasunskas, D.. An ontology-driven approach to Web search:analysis of its sensitivity to ontology quality and search tasks. Proceedings of the 11th International Conference onInformation Integration and Web-based Applications & Services, ACM., 2009. 10.1145/1806338.1806368.nb_NO
dc.relation.haspartLilleng, J.; Tomassen, S.L.. Cross-lingual information retrieval by feature vectors. Natural Language Processing and Information Systems, LNCS 4592: 229-239, 2007. 10.1007/978-3-540-73351-5_20.nb_NO
dc.titleConceptual Ontology Enrichment for Web Information Retrievalnb_NO
dc.typeDoctoral thesisnb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO
dc.description.degreePhD i Informasjonsteknologinb_NO
dc.description.degreePhD in Information Technologyen_GB


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel