Lexical Robustness for Automatic Speech Recognition

Mertens, Timo Pascal

dc.contributor.author	Mertens, Timo Pascal	nb_NO
dc.date.accessioned	2014-12-19T13:48:17Z
dc.date.accessioned	2015-12-22T11:47:46Z
dc.date.available	2014-12-19T13:48:17Z
dc.date.available	2015-12-22T11:47:46Z
dc.date.created	2013-01-15	nb_NO
dc.date.issued	2012	nb_NO
dc.identifier	588242	nb_NO
dc.identifier.isbn	978-82-471-3728-4 (electronic ver.)	nb_NO
dc.identifier.isbn	978-82-471-3727-7 (printed ver.)
dc.identifier.uri	http://hdl.handle.net/11250/2370673
dc.description.abstract	The lexicon plays a crucial role in a speech recognition system. It defines the mapping between the words that the system can recognize and the different ways these words can be pronounced. In this thesis we address various shortcomings of the lexicon with the aim to increase lexical robustness of the speech recognizer. We focus on three aspects of lexical robustness: first we address how words that are not in the lexicon can be recognized, which is also known as the out-ofvocabulary problem. We then investigate how pronunciation variation, especially of non-native speakers, can be handled in the lexicon. Finally, we develop approaches that learn lexical entries from data in a semi-supervised fashion. Like most machine learning techniques, many of our proposed approaches depend on training data to work well. Due to data sparsity we exploit appealing properties inherent to subword modeling to adapt the lexicon in various setups, or use subwords directly as the recognition unit when decoding the speech signal. We evaluate our novel methods in the context of transcription as well as Spoken Term Detection, since both tasks rely significantly on the robustness of the lexicon.	nb_NO
dc.language	eng	nb_NO
dc.publisher	Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for elektronikk og telekommunikasjon	nb_NO
dc.relation.ispartofseries	Doktoravhandlinger ved NTNU, 1503-8181; 2012:214	nb_NO
dc.title	Lexical Robustness for Automatic Speech Recognition	nb_NO
dc.type	Doctoral thesis	nb_NO
dc.contributor.department	Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for elektronikk og telekommunikasjon	nb_NO
dc.description.degree	PhD i elektronikk og telekommunikasjon	nb_NO
dc.description.degree	PhD in Electronics and Telecommunication

Tilhørende fil(er)

Filnavn:: 588242_FULLTEXT01.pdf
Størrelse:: 8.674Mb
Format:: PDF

Låst

Denne innførselen finnes i følgende samling(er)

Institutt for elektroniske systemer [2286]

Vis enkel innførsel