Semantically Meaningful Metrics for Norwegian ASR Systems

Rugayan, Janine Lizbeth Cabrera; Svendsen, Torbjørn Karl; Salvi, Giampiero

dc.contributor.author	Rugayan, Janine Lizbeth Cabrera
dc.contributor.author	Svendsen, Torbjørn Karl
dc.contributor.author	Salvi, Giampiero
dc.date.accessioned	2023-02-17T09:09:16Z
dc.date.available	2023-02-17T09:09:16Z
dc.date.created	2022-09-28T08:06:57Z
dc.date.issued	2022
dc.identifier.issn	2308-457X
dc.identifier.uri	https://hdl.handle.net/11250/3051822
dc.description.abstract	Evaluation metrics are important for quanitfying the performance of Automatic Speech Recognition (ASR) systems. However, the widely used word error rate (WER) captures errors at the word-level only and weighs each error equally, which makes it insufficient to discern ASR system performance for downstream tasks such as Natural Language Understanding (NLU) or information retrieval. We explore in this paper a more robust and discriminative evaluation metric for Norwegian ASR systems through the use of semantic information modeled by a transformer-based language model. We propose Aligned Semantic Distance (ASD) which employs dynamic programming to quantify the similarity between the reference and hypothesis text. First, embedding vectors are generated using the NorBERT model. Afterwards, the minimum global distance of the optimal alignment between these vectors is obtained and normalized by the sequence length of the reference embedding vector. In addition, we present results using Semantic Distance (SemDist), and compare them with ASD. Results show that for the same WER, ASD and SemDist values can vary significantly, thus, exemplifying that not all recognition errors can be considered equally important. We investigate the resulting data, and present examples which demonstrate the nuances of both metrics in evaluating various transcription errors.	en_US
dc.language.iso	eng	en_US
dc.publisher	International Speech Communication Association	en_US
dc.title	Semantically Meaningful Metrics for Norwegian ASR Systems	en_US
dc.title.alternative	Semantically Meaningful Metrics for Norwegian ASR Systems	en_US
dc.type	Peer reviewed	en_US
dc.type	Journal article	en_US
dc.description.version	acceptedVersion	en_US
dc.source.journal	Interspeech (USB)	en_US
dc.identifier.doi	10.21437/Interspeech.2022-817
dc.identifier.cristin	2056124
dc.relation.project	Norges forskningsråd: 322964	en_US
dc.relation.project	The EEA and Norway Grants Fund for Regional Cooperation: CZ-RESEARCH-0022	en_US
cristin.ispublished	true
cristin.fulltext	original
cristin.fulltext	postprint
cristin.qualitycode	1

Tilhørende fil(er)

Filnavn:: SemEvalASR_INTERSPEECH_2022_fi ...
Størrelse:: 342.6Kb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Institutt for elektroniske systemer [2297]
Publikasjoner fra CRIStin - NTNU [37509]

Vis enkel innførsel