Show simple item record

dc.creatorHennings Yeomans, Pablo
dc.date.accessioned2015-08-17T09:50:07Zen
dc.date.available2015-08-17T09:50:07Zen
dc.date.issued2002-05-01
dc.identifier.urihttp://hdl.handle.net/11285/568018en
dc.description.abstractIn this work we present results attained by noisy speech recognition experiments using wavelet analysis schemes. It is shown that under white noisy signals the wavelet parameters outperform the Mel-Frequency Cepstral Coefficients (MFCC). The main difference between the wavelet derived coefficients and the traditional MFCC consists in the computation of the spectrum, since the proposed parameters apply a wavelet packet transform instead a discrete Fourier transform. The filters in the wavelet packet transforms used in this work are Daubechies 20, Beylkin 18 and Vaidyanathan 24. The Daubechies filters maximize the smoothness of the associated scaling function by maximizing the rate of decay of its Fourier transform [Daubechies]. The Beylkin's filter was designed by placing roots for the frequency response polynomial close to the Nyquist frequency on the real axis, thus concentrating power spectrum energy in the desired band. Vaidyanathan's filter was optimized for its length to satisfy standard requirements for effective speech coding [Wikerhauser]. The experimental work show that under a noisy continuous spoken digits task the word accuracy improves up to a 32% compared with the MFCC results.
dc.languageeng
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0*
dc.subject.classificationArea::INGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA ELECTRÓNICA::RADARes_MX
dc.titleA Wavelet-Based Front end for Robust ASR-Edición Únicaen
dc.typeTesis de maestría
dc.contributor.departmentITESM-Campus Monterreyen
dc.contributor.committeememberNolazco Flores, Juan Arturoes
dc.contributor.committeememberGalván Rodríguez, Arturoes
dc.contributor.committeememberRodríguez Cruz, Ramónes
refterms.dateFOA2018-03-20T07:13:20Z
refterms.dateFOA2018-03-20T07:13:20Z
html.description.abstractIn this work we present results attained by noisy speech recognition experiments using wavelet analysis schemes. It is shown that under white noisy signals the wavelet parameters outperform the Mel-Frequency Cepstral Coefficients (MFCC). The main difference between the wavelet derived coefficients and the traditional MFCC consists in the computation of the spectrum, since the proposed parameters apply a wavelet packet transform instead a discrete Fourier transform. The filters in the wavelet packet transforms used in this work are Daubechies 20, Beylkin 18 and Vaidyanathan 24. The Daubechies filters maximize the smoothness of the associated scaling function by maximizing the rate of decay of its Fourier transform [Daubechies]. The Beylkin's filter was designed by placing roots for the frequency response polynomial close to the Nyquist frequency on the real axis, thus concentrating power spectrum energy in the desired band. Vaidyanathan's filter was optimized for its length to satisfy standard requirements for effective speech coding [Wikerhauser]. The experimental work show that under a noisy continuous spoken digits task the word accuracy improves up to a 32% compared with the MFCC results.
dc.identificatorCampo||7||33||3307||330710


Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

info:eu-repo/semantics/openAccess
Except where otherwise noted, this item's license is described as info:eu-repo/semantics/openAccess