Semantic Search with Knowledge Bases

Hasibi, Faegheh

dc.contributor.advisor	Bratsberg, Svein Erik
dc.contributor.advisor	Balog, Krisztian
dc.contributor.advisor	Torbjørnsen, Øystein
dc.contributor.author	Hasibi, Faegheh
dc.date.accessioned	2018-03-28T10:11:41Z
dc.date.available	2018-03-28T10:11:41Z
dc.date.issued	2018
dc.identifier.isbn	978-82-326-2967-1
dc.identifier.issn	1503-8181
dc.identifier.uri	http://hdl.handle.net/11250/2492307
dc.description.abstract	Over the past decade, modern search engines have made significant progress towards better understanding searchers' intents and providing them with more focused answers, a paradigm that is called \semantic search." Semantic search is a broad area that encompasses a variety of tasks and has a core enabling data component, called the knowledge base. In this thesis, we utilize knowledge bases to address three tasks involved in semantic search: (i) query understanding, (ii) entity retrieval, and (iii) entity summarization. Query understanding is the first step in virtually every semantic search system. We study the problem of identifying entity mentions in queries and linking them to the corresponding entries in a knowledge base. We formulate this as the task of entity linking in queries, propose refinements to evaluation measures, and publish a test collection for training and evaluation purposes. We further establish a baseline method for this task through a reproducibility study, and introduce different methods with the aim to strike a balance between efficiency and effectiveness. Next, we turn to using the obtained annotations for answering the queries. Here, our focus is on the entity retrieval task: answering search queries by returning a ranked list of entities. We introduce a general feature-based model based on Markov Random Fields, and show improvements over existing baseline methods. We find that the largest gains are achieved for complex natural language queries. Having generated an answer to the query (from the entity retrieval step), we move on to presentation aspects of the results. We introduce and address the novel problem of dynamic entity summarization for entity cards, by breaking it into two subtasks, fact ranking and summary generation. We perform an extensive evaluation of our method using crowdsourcing, and show that our supervised fact ranking method brings substantial improvements over the most comparable baselines. In this thesis, we take the reproducibility of our research very seriously. Therefore, all resources developed within the course of this work are made publicly available. We further make two major software and resource contributions: (i) the Nordlys toolkit, which implements a range of methods for semantic search, and (ii) the extended DBpedia-Entity test collection.	nb_NO
dc.language.iso	eng	nb_NO
dc.publisher	NTNU	nb_NO
dc.relation.ispartofseries	Doctoral theses at NTNU;2018:88
dc.title	Semantic Search with Knowledge Bases	nb_NO
dc.type	Doctoral thesis	nb_NO
dc.subject.nsi	VDP::Technology: 500::Information and communication technology: 550::Computer technology: 551	nb_NO

Tilhørende fil(er)

Filnavn:: Faegheh Hasibi_PhD.pdf
Størrelse:: 3.136Mb
Format:: PDF
Beskrivelse:: Fulltext (PDF) available

Åpne

Denne innførselen finnes i følgende samling(er)

Institutt for datateknologi og informatikk [6552]

Vis enkel innførsel