Pitch Determination of Speech Signals in the Presence of Noise

Pitch Determination of Speech Signals in the Presence of Noise PDF Author: Martin Roy Varley
Publisher:
ISBN:
Category : Image processing
Languages : en
Pages :

Get Book Here

Book Description

Pitch Determination of Speech Signals in the Presence of Noise

Pitch Determination of Speech Signals in the Presence of Noise PDF Author: Martin Roy Varley
Publisher:
ISBN:
Category : Image processing
Languages : en
Pages :

Get Book Here

Book Description


Pitch Determination of Speech Signals

Pitch Determination of Speech Signals PDF Author: W. Hess
Publisher: Springer Science & Business Media
ISBN: 3642819265
Category : Science
Languages : en
Pages : 713

Get Book Here

Book Description
Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measure ment. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz).

Pith Determination of Speech Signals in the Presence of Noise

Pith Determination of Speech Signals in the Presence of Noise PDF Author: M.R Varley
Publisher:
ISBN:
Category :
Languages : en
Pages : 270

Get Book Here

Book Description


Visual Representations of Speech Signals

Visual Representations of Speech Signals PDF Author: Martin Cooke
Publisher:
ISBN:
Category : Computers
Languages : en
Pages : 406

Get Book Here

Book Description
Presents a wide range of graphical representations of some speech signals and allows current speech analysis techniques to be assessed and directly compared. Describes time-frequency representations, auditory modeling, neural networks, pitch and multi-channel analysis. The study of over 40 different analyses of speech is represented in myriad images found throughout.

New Time-frequency Domain Pitch Estimation Methods for Speed Signals Under Low Levels of SNR

New Time-frequency Domain Pitch Estimation Methods for Speed Signals Under Low Levels of SNR PDF Author: Celia Shahnaz
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Get Book Here

Book Description
The major objective of this research is to develop novel pitch estimation methods capable of handling speech signals in practical situations where only noise-corrupted speech observations are available. With this objective in mind, the estimation task is carried out in two different approaches. In the first approach, the noisy speech observations are directly employed to develop two new time-frequency domain pitch estimation methods. These methods are based on extracting a pitch-harmonic and finding the corresponding harmonic number required for pitch estimation. Considering that voiced speech is the output of a vocal tract system driven by a sequence of pulses separated by the pitch period, in the second approach, instead of using the noisy speech directly for pitch estimation, an excitation-like signal (ELS) is first generated from the noisy speech or its noise- reduced version. In the first approach, at first, a harmonic cosine autocorrelation (HCAC) model of clean speech in terms of its pitch-harmonics is introduced. In order to extract a pitch-harmonic, we propose an optimization technique based on least-squares fitting of the autocorrelation function (ACF) of the noisy speech to the HCAC model. By exploiting the extracted pitch-harmonic along with the fast Fourier transform (FFT) based power spectrum of noisy speech, we then deduce a harmonic measure and a harmonic-to-noise-power ratio (HNPR) to determine the desired harmonic number of the extracted pitch-harmonic. In the proposed optimization, an initial estimate of the pitch-harmonic is obtained from the maximum peak of the smoothed FFT power spectrum. In addition to the HCAC model, where the cross-product terms of different harmonics are neglected, we derive a compact yet accurate harmonic sinusoidal autocorrelation (HSAC) model for clean speech signal. The new HSAC model is then used in the least-squares model-fitting optimization technique to extract a pitch-harmonic. In the second approach, first, we develop a pitch estimation method by using an excitation-like signal (ELS) generated from the noisy speech. To this end, a technique is based on the principle of homomorphic deconvolution is proposed for extracting the vocal-tract system (VTS) parameters from the noisy speech, which are utilized to perform an inverse-filtering of the noisy speech to produce a residual signal (RS). In order to reduce the effect of noise on the RS, a noise-compensation scheme is introduced in the autocorrelation domain. The noise-compensated ACF of the RS is then employed to generate a squared Hilbert envelope (SHE) as the ELS of the voiced speech. With a view to further overcome the adverse effect of noise on the ELS, a new symmetric normalized magnitude difference function of the ELS is proposed for eventual pitch estimation. Cepstrum has been widely used in speech signal processing but has limited capability of handling noise. One potential solution could be the introduction of a noise reduction block prior to pitch estimation based on the conventional cepstrum, a framework already available in many practical applications, such as mobile communication and hearing aids. Motivated by the advantages of the existing framework and considering the superiority of our ELS to the speech itself in providing clues for pitch information, we develop a cepstrum-based pitch estimation method by using the ELS obtained from the noise-reduced speech. For this purpose, we propose a noise subtraction scheme in frequency domain, which takes into account the possible cross-correlation between speech and noise and has advantages of noise being updated with time and adjusted at each frame. The enhanced speech thus obtained is utilized to extract the vocal-tract system (VTS) parameters via the homomorphic deconvolution technique. A residual signal (RS) is then produced by inverse-filtering the enhanced speech with the extracted VTS parameters. It is found that, unlike the previous ELS-based method, the squared Hilbert envelope (SHE) computed from the RS of the enhanced speech without noise compensation, is sufficient to represent an ELS. Finally, in order to tackle the undesirable effect of noise of the ELS at a very low SNR and overcome the limitation of the conventional cepstrum in handling different types of noises, a time-frequency domain pseudo cepstrum of the ELS of the enhanced speech, incorporating information of both magnitude and phase spectra of the ELS, is proposed for pitch estimation. (Abstract shortened by UMI.).

Pitch estimation of aperiodic and noisy speech signals

Pitch estimation of aperiodic and noisy speech signals PDF Author: T. V. Sreenivas
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description


SPEECH AND AUDIO SIGNAL PROCESSING: PROCESSING AND PERCEPTION OF SPEECH AND MUSIC

SPEECH AND AUDIO SIGNAL PROCESSING: PROCESSING AND PERCEPTION OF SPEECH AND MUSIC PDF Author: Ben Gold & Nelson Morgan
Publisher: John Wiley & Sons
ISBN: 9788126508228
Category :
Languages : en
Pages : 708

Get Book Here

Book Description
Market_Desc: Professionals in the fields of ASR and speaker recognition, speech bandwidth compression, speech analysis and synthesis, and music analysis and synthesis Special Features: · Provides a top-level summary of speech and music processing from a historical perspective.· Introduce brief and selected introduction, when necessary, to mathematical concepts such as difference equation or probability dense functions. About The Book: Speech and music are the most basic means of adult human communication. As technology advances and increasingly sophisticated tools become available to use with speech and music signals, scientists can study these sounds more effectively, and invent new ways of applying them for the benefit of humankind. This text includes coverage of the physiology and psychoacoustics of hearing as well as the results from research on pitch and speech perception, vocoding methods and information on many aspects of automatic speech recognition (ASR) systems. The authors have made use of their own research in these fields, as well as the methods and results of many other contributors.

Fundamentals of Speech Signal Processing

Fundamentals of Speech Signal Processing PDF Author: Shūzō Saitō
Publisher:
ISBN:
Category : Language Arts & Disciplines
Languages : en
Pages : 290

Get Book Here

Book Description


Pitch Determination of Speech Signals

Pitch Determination of Speech Signals PDF Author: Mark David Anderson
Publisher:
ISBN:
Category :
Languages : en
Pages : 294

Get Book Here

Book Description


Basic Techniques in Pitch Determination of Speech Signals

Basic Techniques in Pitch Determination of Speech Signals PDF Author: Nghia Van Le
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description