SP-BERT: A Language Model for Political Text in Scandinavian Languages
Journal article
Submitted version
Permanent lenke
https://hdl.handle.net/11250/3110537Utgivelsesdato
2023Metadata
Vis full innførselSamlinger
Originalversjon
Lecture Notes in Computer Science (LNCS). 2023, 13913 467-477. https://doi.org/10.1007/978-3-031-35320-8_34Sammendrag
Language models are at the core of modern Natural Language Processing. We present a new BERT-style language model dedicated to political texts in Scandinavian languages. Concretely, we introduce SP-BERT, a model trained with parliamentary speeches in Norwegian, Swedish, Danish, and Icelandic. To show its utility, we evaluate its ability to predict the speakers’ party affiliation and explore language shifts of politicians transitioning between Cabinet and Opposition.