NARC – Norwegian Anaphora Resolution Corpus
Mæhlum, Petter; Haug, Dag Trygve Truslew; Jørgensen, Tollef Emil; Kåsen, Andre; Nøklestad, Anders; Rønningstad, Egil; Solberg, Per Erik; Velldal, Erik; Øvrelid, Lilja
Peer reviewed, Journal article
Accepted version
View/ Open
Date
2022Metadata
Show full item recordCollections
Original version
International Conference on Computational Linguistics (ICCL) (COLING). 2022, 29 (7), 48-60.Abstract
Published in: Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC): https://aclanthology.org/venues/coling/. We present the Norwegian Anaphora Resolution Corpus (NARC), the first publicly available corpus annotated with anaphoric relations between noun phrases for Norwegian. The paper describes the annotated data for 326 documents in Norwegian Bokmål, together with inter-annotator agreement and discussions of relevant statistics. We also present preliminary modelling results which are comparable to existing corpora for other languages, and discuss relevant problems in relation to both modelling and the annotations themselves. NARC – Norwegian Anaphora Resolution Corpus