SP-BERT: A Language Model for Political Text in Scandinavian Languages
Original version
Lecture Notes in Computer Science (LNCS). 2023, 13913 467-477. https://doi.org/10.1007/978-3-031-35320-8_34Abstract
Language models are at the core of modern Natural Language Processing. We present a new BERT-style language model dedicated to political texts in Scandinavian languages. Concretely, we introduce SP-BERT, a model trained with parliamentary speeches in Norwegian, Swedish, Danish, and Icelandic. To show its utility, we evaluate its ability to predict the speakers’ party affiliation and explore language shifts of politicians transitioning between Cabinet and Opposition.