Show simple item record

dc.contributor.authorHu, Xuke
dc.contributor.authorSun, Yeran
dc.contributor.authorKersten, Jens
dc.contributor.authorZhou, Zhiyong
dc.contributor.authorKlan, Friederike
dc.contributor.authorFan, Hongchao
dc.date.accessioned2023-10-23T10:58:14Z
dc.date.available2023-10-23T10:58:14Z
dc.date.created2023-02-18T18:21:02Z
dc.date.issued2023
dc.identifier.citationInternational Journal of Applied Earth Observation and Geoinformation. 2023, 117 .en_US
dc.identifier.issn1569-8432
dc.identifier.urihttps://hdl.handle.net/11250/3098036
dc.description.abstractNatural language texts, such as tweets and news, contain a vast amount of geospatial information, which can be extracted by first recognizing toponyms in texts (toponym recognition) and then identifying their geospatial representations (toponym disambiguation). This paper focuses on toponym disambiguation, which can be approached by toponym resolution and entity linking. Recently, many novel approaches, especially deep learning-based, have been proposed, such as CamCoder, GENRE, and BLINK. However, these approaches were not compared on the same and large datasets. Moreover, there is still a need and space to improve their robustness and generalizability further. To mitigate the two research gaps, in this paper, we propose a spatial clustering-based voting approach combining several individual approaches and compare a voting ensemble with 20 latest and commonly-used approaches based on 12 public datasets, including several highly challenging datasets (e.g., WikToR). They are in six types: tweets, historical documents, news, web pages, scientific articles, and Wikipedia articles, containing 98,300 toponyms. Experimental results show that the voting ensemble performs the best on all the datasets, achieving an average Accuracy@161km of 0.86, proving its generalizability and robustness. It also drastically improves the performance of resolving fine-grained places, i.e., POIs, natural features, and traffic ways. The detailed evaluation results can inform future methodological developments and guide the selection of proper approaches based on application needs.en_US
dc.language.isoengen_US
dc.publisherElsevier B. V.en_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleHow can voting mechanisms improve the robustness and generalizability of toponym disambiguation?en_US
dc.title.alternativeHow can voting mechanisms improve the robustness and generalizability of toponym disambiguation?en_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.source.volume117en_US
dc.source.journalInternational Journal of Applied Earth Observation and Geoinformationen_US
dc.identifier.doi10.1016/j.jag.2023.103191
dc.identifier.cristin2127217
dc.source.articlenumber103191en_US
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Navngivelse 4.0 Internasjonal
Except where otherwise noted, this item's license is described as Navngivelse 4.0 Internasjonal