Channel-robust speaker identification using Modified-Mean Cepstral Mean Normalization with Frequency Warping

Alvin A. Garcia, Richard J. Mammone

Research output: Contribution to journalConference article

20 Scopus citations

Abstract

The performance of automatic speaker recognition systems is significantly degraded by acoustic mismatches between training and testing conditions. Such acoustic mismatches are commonly encountered in systems that operate on speech collected over telephone networks, where different handsets and different network routes impose varying convolutional distortions on the speech signal. A new algorithm, the Modified-Mean Cepstral Mean Normalization with Frequency Warping (MMCMNFW) method, which improves upon the commonly-employed Cepstral Mean Subtraction method, has been developed. Experimental results on closed-set speaker identification tasks on a channel-corrupted subset of the TIMIT database and on a subset of the NTIMIT database are presented. The new algorithm is shown to offer improved recognition rates over other existing channel normalization methods on these databases.

Original languageEnglish (US)
Pages (from-to)325-328
Number of pages4
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
StatePublished - Jan 1 1999
Externally publishedYes
EventProceedings of the 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-99) - Phoenix, AZ, USA
Duration: Mar 15 1999Mar 19 1999

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Channel-robust speaker identification using Modified-Mean Cepstral Mean Normalization with Frequency Warping'. Together they form a unique fingerprint.

  • Cite this