Semantic and Distributed Entity Search in the Web of Data

Neumayer, Robert

dc.contributor.author	Neumayer, Robert	nb_NO
dc.date.accessioned	2014-12-19T13:39:28Z
dc.date.available	2014-12-19T13:39:28Z
dc.date.created	2013-03-07	nb_NO
dc.date.issued	2013	nb_NO
dc.identifier	609768	nb_NO
dc.identifier.isbn	978-82-471-4208-0 (printed ver.)	nb_NO
dc.identifier.isbn	978-82-471-4209-7 (electronic ver.)	nb_NO
dc.identifier.uri	http://hdl.handle.net/11250/253111
dc.description.abstract	Both the growth and ubiquitious character of the Internet have had a profound effect on how we access and consume ata and information. More recently, the Semantic Web, an extension of the current Web has come increasingly relevant due to its widespread adoption. The Web of Data (WoD) is an extension of the current web, where not only documents are interlinked by means of hyperlinks but also data in terms of predicates. Specifically, it describes objects, entities or “things” in terms of their attributes and their relationships, using RDF data (and often is used equivalently to Linked Data). Given its growth, there is a strong need for making this wealth of knowledge accessible by keyword search (the de-facto standard paradigm for accessing information online). The overall goal of this thesis is to provide new techniques for accessing this data, i.e., to leverage its full potential to end users. We therefore address the following four main issues: a) how can the Web of Data be searched by means of keyword search?, b) what sets apart search in the WoD from traditional web search?, c) how can these elements be used in a theoretically sound and effective way?, and d) How can the techniques be adapted to a distributed environment? To this end, we develop techniques for effectively searching WoD sources. We build upon and formalise existing entity modelling approaches within a generative language modelling framework, and compare them experimentally using standard test collections. We show that these models outperform the current state-of-the-art in terms of retrieval effectiveness, however, this is done at the cost of abandoning a large part of the semantics behind the data. We propose a novel entity model capable of preserving the semantics associated with entities, without sacrificing retrieval effectiveness. We further show how these approaches can be applied in the distributed context, both with low (federated search) and high numbers (Peerto- peer or P2P) of independent repositories, collections, or nodes. The main contributions are as follows: • We develop a hybrid approach to search in the Web of Data, using elements from traditional information retrieval and structured retrieval alike. • We formalise our approaches in a language model setting. • Our extensions are successfully evaluated with respect to their applicability in different distributed environments such as federated search and P2P. • We discuss and analyse based on our empirical evaluation and provide insights into the entity search problem.	nb_NO
dc.language	eng	nb_NO
dc.publisher	Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskap	nb_NO
dc.relation.ispartofseries	Doktoravhandlinger ved NTNU, 1503-8181; 2013:56	nb_NO
dc.title	Semantic and Distributed Entity Search in the Web of Data	nb_NO
dc.type	Doctoral thesis	nb_NO
dc.contributor.department	Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskap	nb_NO
dc.description.degree	PhD i informasjonsteknologi	nb_NO
dc.description.degree	PhD in Information Technology	en_GB

Tilhørende fil(er)

Filnavn:: 609768_FULLTEXT01.pdf
Størrelse:: 1.396Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Institutt for datateknologi og informatikk [6544]

Vis enkel innførsel