• norsk
    • English
  • English 
    • norsk
    • English
  • Login
View Item 
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for elektroniske systemer
  • View Item
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for elektroniske systemer
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Lexical Robustness for Automatic Speech Recognition

Mertens, Timo Pascal
Doctoral thesis
View/Open
588242_FULLTEXT01.pdf (Locked)
URI
http://hdl.handle.net/11250/2370673
Date
2012
Metadata
Show full item record
Collections
  • Institutt for elektroniske systemer [1533]
Abstract
The lexicon plays a crucial role in a speech recognition system. It defines the mapping between the words that the system can recognize and the different ways these words can be pronounced. In this thesis we address various shortcomings of the lexicon with the aim to increase lexical robustness of the speech recognizer. We focus on three aspects of lexical robustness: first we address how words that are not in the lexicon can be recognized, which is also known as the out-ofvocabulary problem. We then investigate how pronunciation variation, especially of non-native speakers, can be handled in the lexicon. Finally, we develop approaches that learn lexical entries from data in a semi-supervised fashion. Like most machine learning techniques, many of our proposed approaches depend on training data to work well. Due to data sparsity we exploit appealing properties inherent to subword modeling to adapt the lexicon in various setups, or use subwords directly as the recognition unit when decoding the speech signal. We evaluate our novel methods in the context of transcription as well as Spoken Term Detection, since both tasks rely significantly on the robustness of the lexicon.
Publisher
Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for elektronikk og telekommunikasjon
Series
Doktoravhandlinger ved NTNU, 1503-8181; 2012:214

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit
 

 

Browse

ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsDocument TypesJournalsThis CollectionBy Issue DateAuthorsTitlesSubjectsDocument TypesJournals

My Account

Login

Statistics

View Usage Statistics

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit