dc.contributor.author | Swamy, Steve Durairaj | |
dc.contributor.author | Jamatia, Anupam | |
dc.contributor.author | Gambäck, Björn | |
dc.date.accessioned | 2019-11-13T12:00:26Z | |
dc.date.available | 2019-11-13T12:00:26Z | |
dc.date.created | 2019-11-06T16:42:11Z | |
dc.date.issued | 2019 | |
dc.identifier.isbn | 978-1-950737-72-7 | |
dc.identifier.uri | http://hdl.handle.net/11250/2628214 | |
dc.description.abstract | Work on Abusive Language Detection has tackled a wide range of subtasks and domains. As a result of this, there exists a great deal of redundancy and non-generalisability between datasets. Through experiments on cross-dataset training and testing, the paper reveals that the preconceived notion of including more non-abusive samples in a dataset (to emulate reality) may have a detrimental effect on the generalisability of a model trained on that data. Hence a hierarchical annotation model is utilised here to reveal redundancies in existing datasets and to help reduce redundancy in future efforts. | nb_NO |
dc.language.iso | eng | nb_NO |
dc.publisher | Association for Computational Linguistics | nb_NO |
dc.relation.ispartof | CoNLL 2019 The 23rd Conference on Computational Natural Language Learning Proceedings of the Conference | |
dc.relation.uri | https://www.aclweb.org/anthology/K19-1088.pdf | |
dc.rights | Navngivelse 4.0 Internasjonal | * |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/deed.no | * |
dc.title | Studying Generalisability across Abusive Language Detection Datasets | nb_NO |
dc.type | Chapter | nb_NO |
dc.description.version | publishedVersion | nb_NO |
dc.source.pagenumber | 940-950 | nb_NO |
dc.identifier.doi | 10.18653/v1/K19-1088 | |
dc.identifier.cristin | 1744694 | |
dc.description.localcode | Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License. | nb_NO |
cristin.unitcode | 194,63,10,0 | |
cristin.unitname | Institutt for datateknologi og informatikk | |
cristin.ispublished | true | |
cristin.fulltext | postprint | |
cristin.qualitycode | 1 | |