Vis enkel innførsel

dc.contributor.advisorNørvåg, Kjetilnb_NO
dc.contributor.advisorGulla, Jon Atlenb_NO
dc.contributor.advisorTomassen, Stein L.nb_NO
dc.contributor.authorBøhn, Christiannb_NO
dc.date.accessioned2014-12-19T13:32:23Z
dc.date.available2014-12-19T13:32:23Z
dc.date.created2010-09-03nb_NO
dc.date.issued2008nb_NO
dc.identifier347646nb_NO
dc.identifierntnudaim:4290nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/250696
dc.description.abstractIn news articles the focus on named entities is quite common and usually a news case is tied around a person, a company, or similar. One challenge from an information retrieval point of view is that one entity often have more than one way of referring to it. This means that when users use news search engines they have to use the exact same name for the entity as the articles they are interested in use. Therefore the usage of synonyms to refer to the same entity forms the basis of this thesis. We explore the idea of using Wikipedia as a data source for building a large dictionary of named entities and their synonyms. An entity dictionary like that would be very interesting because it make it possible to link synonyms to the same entity. The evaluation shows that Wikipedia is well suited as a source of named entities and synonyms as the semi-structure aids in recognizing the entities and related synonyms. The use of the dictionary in a modified search solution shows on the other hand mixed results. On problem with evaluating a solution like this is that the precision of the different synonyms is usually very high for popular entities, and when we combine different synonyms in the same query we end up giving more weight to the results that use multiple synonyms.nb_NO
dc.languageengnb_NO
dc.publisherInstitutt for datateknikk og informasjonsvitenskapnb_NO
dc.subjectntnudaimno_NO
dc.subjectSIF2 datateknikkno_NO
dc.subjectProgram- og informasjonssystemerno_NO
dc.titleExtracting Named Entities and Synonyms from Wikipedia for use in News Searchnb_NO
dc.typeMaster thesisnb_NO
dc.source.pagenumber91nb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO


Tilhørende fil(er)

Thumbnail
Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel