Vis enkel innførsel

dc.contributor.advisorSvendsen, Torbjørnnb_NO
dc.contributor.advisorPettersen, Svein Gunnar
dc.contributor.authorJohansen, Martin Etnestadnb_NO
dc.date.accessioned2014-12-19T13:49:39Z
dc.date.accessioned2015-12-22T11:50:16Z
dc.date.available2014-12-19T13:49:39Z
dc.date.available2015-12-22T11:50:16Z
dc.date.created2014-06-27nb_NO
dc.date.issued2009nb_NO
dc.identifier730486nb_NO
dc.identifierntnudaim:4707
dc.identifier.urihttp://hdl.handle.net/11250/2370992
dc.description.abstractThe public switched telephone network (PSTN) restricts the acoustic bandwidth of telephonyspeech to less than 4 kHz. For compatibility with analog telephone networks, a 0.3 − 3.4 kHz passband is common. This bandwidth reduction has a significant impact on perceived quality, andis especially noticeable and even distracting when PSTN users call into, e.g., video conferencingsystems in which the other participants may use wideband (50 − 7k Hz) speech codecs. To reducethe gap in quality, one may attempt to resynthesize the missing spectrum. Techniques for thisare referred to as bandwidth extension (BWE).For this thesis, two systems for BWE of speech into the high band (f ≥ 3.4 kHz) were imple-mented in Matlab, based on systems proposed in literature. The extension was done accordingto the linear source-filter model for speech, meaning estimation of the excitation and spectralenvelope from the narrowband (0.3 − 3.4 kHz) signal were done separately.BWE System 1 made use of linear prediction (LP) analysis in combination with modulation forextension of the excitation. Its wideband spectral envelope estimation was primarily based onlinear prediction cepstral coefficients (LPCC) and artificial neural networks (ANN).BWE System 2 made use of bandpass-modulation of Gaussian noise (BP-MGN) for extension ofthe excitation. Its wideband spectral envelope estimation was based on Mel-frequency cepstralcoefficients (MFCC) and Gaussian mixture modelling (GMM), which was the most complexestimation method of the two systems.Objective analysis of the two systems? spectral envelope estimation and informal listening testswere carried out. These analyses showed that BWE System 1 performed best, though bothsystems improved the perceived quality. BWE systems based on LP analysis therefore seem tobe preferrable due to the superior excitation, and efficient computation of the cepstrum.nb_NO
dc.languageengnb_NO
dc.publisherInstitutt for elektronikk og telekommunikasjonnb_NO
dc.titleBandwidth Extension of Telephony Speechnb_NO
dc.typeMaster thesisnb_NO
dc.source.pagenumber79nb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for elektronikk og telekommunikasjonnb_NO


Tilhørende fil(er)

Thumbnail
Thumbnail
Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel