Cross-lingual Speaker Verification: Evaluation On X-Vector Method

Mandalapu, Hareesh; Møller Elbo, Thomas; Ramachandra, Raghavendra; Busch, Christoph

Mandalapu, Hareesh; Møller Elbo, Thomas; Ramachandra, Raghavendra; Busch, Christoph

Chapter

Accepted version

Åpne

Mandalapu (406.3Kb)

Permanent lenke

https://hdl.handle.net/11250/2736064

Utgivelsesdato

2021

Metadata

Vis full innførsel

Samlinger

Institutt for informasjonssikkerhet og kommunikasjonsteknologi [2517]
Publikasjoner fra CRIStin - NTNU [37175]

Sammendrag

Automatic Speaker Verification (ASV) systems accuracy is based on the spoken language used in training and enrolling speakers. Language dependency makes voice-based security systems less robust and generalizable to a wide range of applications. In this work, a study on language dependency of a speaker verification system and experiments are performed to benchmark the robustness of the x-vector based techniques to language dependency. Experiments are carried out on a smartphone multi-lingual dataset with 50 subjects containing utterances in four different languages captured in five sessions. We have used two world training datasets, one with only one language and one with multiple languages. Results show that performance is degraded when there is a language mismatch in enrolling and testing. Further, our experimental results indicate that the performance degradation depends on the language present in the word training data.

Utgiver

Springer