Vis enkel innførsel

dc.contributor.advisorSvendsen, Torbjørn
dc.contributor.authorOlfati, Negar
dc.date.accessioned2015-12-28T10:04:52Z
dc.date.available2015-12-28T10:04:52Z
dc.date.created2015-02-22
dc.date.issued2015
dc.identifierntnudaim:6646
dc.identifier.urihttp://hdl.handle.net/11250/2371401
dc.description.abstractThis work is intended to explore the performance of a new set of acoustic model units in speech recognition. The acoustic models were built and evaluated from scratch in several steps: Feature extraction, acoustic detection and merging, acoustic segmentation of TIMIT corpus, clustering the segment representatives, assigning labels to each cluster and labelling the segments by cluster labels, and finally acoustic modeling. At the acoustic modeling phase, two experiments were investigated, using standard HMM structures and HTK toolkit; In the first experiment, the models were trained and evaluated by the annotated version of training data from TIMIT database in terms of cluster labels. In the second experiment, the time-aligned version of transcriptions was utilized to train acoustic models. Both experiments were carried out on four systems with 128, 256, 512 and 1024 units. Both single and mixture probability estimators were testified. In both experiments, the best results were achieved using GMMs with three-components for the 128 units system.
dc.languageeng
dc.publisherNTNU
dc.subjectElektronikk (2årig), Akustikk
dc.titleMachine Learning of Sub-Phonemic Units for Speech Recognition
dc.typeMaster thesis
dc.source.pagenumber120


Tilhørende fil(er)

Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel