Non-uniform speaker normalization using frequency-dependent scaling function

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

In this paper, we present improvements in the estimation of frequency-dependent scaling function for non-uniform speaker normalization when compared to the method in [1]. Further, unlike [1], using the estimated frequency-dependent scaling function, γ(f), we estimated the universal warping function that is necessary to separate out the speaker-dependent term as a translation factor and show that it is similar to mel-scale. Since, the proposed warping function is similar to mel-scale, we argue that our study "justifies" the usage of mel-scale in speech recognition, not only from the point of view of psychoacoustics but also from the view point of speaker normalization. Finally, in [2], we have assumed the commonly used formula for mel-scale, mel = 2595 log10 1 + f/700 for the universal warping function. In this paper, we have tried to fit a mel-like formula to the estimated universal warping function and use it to do non-uniform speaker normalization. We present the recognition results using these different universal warping functions with word error rate as the performance measure.

Original languageEnglish (US)
Title of host publication2004 International Conference on Signal Processing and Communications, SPCOM
Pages305-309
Number of pages5
StatePublished - 2004
Event2004 International Conference on Signal Processing and Communications, SPCOM - Banalore, India
Duration: Dec 11 2004Dec 14 2004

Publication series

Name2004 International Conference on Signal Processing and Communications, SPCOM

Other

Other2004 International Conference on Signal Processing and Communications, SPCOM
Country/TerritoryIndia
CityBanalore
Period12/11/0412/14/04

All Science Journal Classification (ASJC) codes

  • General Engineering

Fingerprint

Dive into the research topics of 'Non-uniform speaker normalization using frequency-dependent scaling function'. Together they form a unique fingerprint.

Cite this