Share Email Print

Proceedings Paper

Scale-invariant speech analysis via joint time-frequency-scale processing
Author(s): Leon Cohen; Nenad M. Marinovic; Srinivasan Umesh; Douglas J. Nelson
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We argue that an important aspect of the human speech signal is scaling in the frequency domain. We discuss the two physical mechanisms responsible for the scaling. The first mechanism is that when we have a harmonic signal whose fundamental is frequency modulated then the spectrum is the sum of scaled functions. The second comes about from the consideration that while different speakers have very different size vocal tracts (for example an adult and a child), we none the less produce speech which is similar in some sense. We will argue and present evidence to show that the speaker differences result in scaling in the frequency domain. We further discuss how one can handle scale processing.

Paper Details

Date Published: 1 September 1995
PDF: 16 pages
Proc. SPIE 2569, Wavelet Applications in Signal and Image Processing III, (1 September 1995); doi: 10.1117/12.217607
Show Author Affiliations
Leon Cohen, CUNY/Hunter College (United States)
Nenad M. Marinovic, CUNY/City College (United States)
Srinivasan Umesh, CUNY/Hunter College (United States)
Douglas J. Nelson, Dept. of Defense (United States)

Published in SPIE Proceedings Vol. 2569:
Wavelet Applications in Signal and Image Processing III
Andrew F. Laine; Michael A. Unser, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?