Analysis of compressed speech signals in an Automatic Speaker Recognition system

Richard A. Metzger, John F. Doherty, David M. Jenkins

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

This paper analyzes the effects popular audio compression algorithms have on the performance of a speaker recognition system. Popular audio compression algorithms were used to compress both clean and noisy speech before being passed to a speaker recognition system. The features extracted from each speaker were 19-dimensional Mel-Frequency Cepstrum Coefficients (MFCC) and the corresponding features were modeled using a 16 mixture Gaussian Mixture Model (GMM). Our experiments show that compression will have a negative effect on recognition rates if the compressed speech is clean. However, if small amounts of white Gaussian noise are added before the speech is compressed, recognition rates can be increased by as much as 7% with certain compression algorithms.

Original languageEnglish (US)
Title of host publication2015 49th Annual Conference on Information Sciences and Systems, CISS 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781479984282
DOIs
StatePublished - Apr 15 2015
Event2015 49th Annual Conference on Information Sciences and Systems, CISS 2015 - Baltimore, United States
Duration: Mar 18 2015Mar 20 2015

Publication series

Name2015 49th Annual Conference on Information Sciences and Systems, CISS 2015

Other

Other2015 49th Annual Conference on Information Sciences and Systems, CISS 2015
Country/TerritoryUnited States
CityBaltimore
Period3/18/153/20/15

All Science Journal Classification (ASJC) codes

  • Information Systems

Fingerprint

Dive into the research topics of 'Analysis of compressed speech signals in an Automatic Speaker Recognition system'. Together they form a unique fingerprint.

Cite this