Author: W. Hess
Publisher: Springer Science & Business Media
ISBN: 3642819265
Category : Science
Languages : en
Pages : 713
Book Description
Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measure ment. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz).
Pitch Determination of Speech Signals
Author: W. Hess
Publisher: Springer Science & Business Media
ISBN: 3642819265
Category : Science
Languages : en
Pages : 713
Book Description
Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measure ment. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz).
Publisher: Springer Science & Business Media
ISBN: 3642819265
Category : Science
Languages : en
Pages : 713
Book Description
Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measure ment. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz).
Index to IEEE Publications
Author: Institute of Electrical and Electronics Engineers
Publisher:
ISBN:
Category : Electric engineering
Languages : en
Pages : 668
Book Description
Issues for 1973- cover the entire IEEE technical literature.
Publisher:
ISBN:
Category : Electric engineering
Languages : en
Pages : 668
Book Description
Issues for 1973- cover the entire IEEE technical literature.
ICASSP 80
Author:
Publisher:
ISBN:
Category : Acoustic filters
Languages : en
Pages : 1188
Book Description
Publisher:
ISBN:
Category : Acoustic filters
Languages : en
Pages : 1188
Book Description
ICASSP 81
Author:
Publisher:
ISBN:
Category : Acoustic filters
Languages : en
Pages : 1344
Book Description
Publisher:
ISBN:
Category : Acoustic filters
Languages : en
Pages : 1344
Book Description
Proceedings of the Fourth International Joint Conference on Pattern Recognition, November 7-10, 1978, Kyoto, Japan
Author:
Publisher:
ISBN:
Category : Optical pattern recognition
Languages : en
Pages : 1202
Book Description
Publisher:
ISBN:
Category : Optical pattern recognition
Languages : en
Pages : 1202
Book Description
ICASSP 82
Author:
Publisher:
ISBN:
Category : Acoustical engineering
Languages : en
Pages : 846
Book Description
Publisher:
ISBN:
Category : Acoustical engineering
Languages : en
Pages : 846
Book Description
Artificial Intelligence
Author: M. J. Elphick
Publisher:
ISBN:
Category : Artificial intelligence
Languages : en
Pages : 206
Book Description
Publisher:
ISBN:
Category : Artificial intelligence
Languages : en
Pages : 206
Book Description
1981 Office Automation Conference Digest
Author:
Publisher:
ISBN:
Category : Electronic data processing
Languages : en
Pages : 488
Book Description
Publisher:
ISBN:
Category : Electronic data processing
Languages : en
Pages : 488
Book Description
Sydney R. Parker
Author:
Publisher:
ISBN:
Category :
Languages : en
Pages : 472
Book Description
Publisher:
ISBN:
Category :
Languages : en
Pages : 472
Book Description
Spoken Language Generation and Understanding
Author: J.C. Simon
Publisher: Springer Science & Business Media
ISBN: 940099091X
Category : Mathematics
Languages : en
Pages : 577
Book Description
Proceedings of the NATO Advanced Study Institute, Bonas, France, June 26-July 7, 1979
Publisher: Springer Science & Business Media
ISBN: 940099091X
Category : Mathematics
Languages : en
Pages : 577
Book Description
Proceedings of the NATO Advanced Study Institute, Bonas, France, June 26-July 7, 1979