Share Email Print
cover

Proceedings Paper

Improvements in scale-transform-based features for speech analysis
Author(s): Srinivasan Umesh; Leon Cohen; Douglas J. Nelson
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

In this paper, we present improvements over the original scale-cepstrum proposed. The scale-cepstrum was proposed as an acoustic feature for speech analysis and was motivated by a desire to normalize the first-order effects of differences in vocal-tract lengths for a given vowel. Our subsequent work has shown that a more appropriate frequency-warping than the log-warping used is necessary to account for the frequency dependency of the scale-factor. Using this more appropriate frequency-warping and a modified method of computing the scale-cepstrum we have obtained improved features that provide better separability between vowels than before, and are also robust to noise. We have used the generalized F-ratio test as a measure of separability and have compared the proposed improved features with the melcepstral features. The data used in the comparison consist of ten vowels extracted from sentences spoken by different speakers in the TIMIT database.

Paper Details

Date Published: 30 October 1997
PDF: 14 pages
Proc. SPIE 3169, Wavelet Applications in Signal and Image Processing V, (30 October 1997); doi: 10.1117/12.292805
Show Author Affiliations
Srinivasan Umesh, Indian Institute of Technology (United States)
Leon Cohen, CUNY/Hunter College (United States)
Douglas J. Nelson, Dept. of Defense (United States)


Published in SPIE Proceedings Vol. 3169:
Wavelet Applications in Signal and Image Processing V
Akram Aldroubi; Andrew F. Laine; Michael A. Unser, Editor(s)

© SPIE. Terms of Use
Back to Top