Pitch Estimation of Speech Signals Using Cyclic Statistics

Pitch Estimation of Speech Signals Using Cyclic Statistics PDF Author: Osman Burak Onal
Publisher:
ISBN:
Category :
Languages : en
Pages : 138

Get Book Here

Book Description

Pitch Estimation of Speech Signals Using Cyclic Statistics

Pitch Estimation of Speech Signals Using Cyclic Statistics PDF Author: Osman Burak Onal
Publisher:
ISBN:
Category :
Languages : en
Pages : 138

Get Book Here

Book Description


Pitch Determination of Speech Signals

Pitch Determination of Speech Signals PDF Author: W. Hess
Publisher: Springer Science & Business Media
ISBN: 3642819265
Category : Science
Languages : en
Pages : 713

Get Book Here

Book Description
Pitch (i.e., fundamental frequency FO and fundamental period TO) occupies a key position in the acoustic speech signal. The prosodic information of an utterance is predominantly determined by this parameter. The ear is more sensitive to changes of fundamental frequency than to changes of other speech signal parameters by an order of magnitude. The quality of vocoded speech is essentially influenced by the quality and faultlessness of the pitch measure ment. Hence the importance of this parameter necessitates using good and reliable measurement methods. At first glance the task looks simple: one just has to detect the funda mental frequency or period of a quasi-periodic signal. For a number of reasons, however, the task of pitch determination has to be counted among the most difficult problems in speech analysis. 1) In principle, speech is a nonstationary process; the momentary position of the vocal tract may change abruptly at any time. This leads to drastic variations in the temporal structure of the signal, even between subsequent pitch periods, and assuming a quasi-periodic signal is often far from realistic. 2) Due to the flexibility of the human vocal tract and the wide variety of voices, there exist a multitude of possible temporal structures. Narrow-band formants at low harmonics (especially at the second or third harmonic) are an additional source of difficulty. 3) For an arbitrary speech signal uttered by an unknown speaker, the fundamental frequency can vary over a range of almost four octaves (50 to 800 Hz).

Multi-Pitch Estimation

Multi-Pitch Estimation PDF Author: Mads Christensen
Publisher: Morgan & Claypool Publishers
ISBN: 1598298399
Category : Technology & Engineering
Languages : en
Pages : 159

Get Book Here

Book Description
Periodic signals can be decomposed into sets of sinusoids having frequencies that are integer multiples of a fundamental frequency. The problem of finding such fundamental frequencies from noisy observations is important in many speech and audio applications, where it is commonly referred to as pitch estimation. These applications include analysis, compression, separation, enhancement, automatic transcription and many more. In this book, an introduction to pitch estimation is given and a number of statistical methods for pitch estimation are presented. The basic signal models and associated estimation theoretical bounds are introduced, and the properties of speech and audio signals are discussed and illustrated. The presented methods include both single- and multi-pitch estimators based on statistical approaches, like maximum likelihood and maximum a posteriori methods, filtering methods based on both static and optimal adaptive designs, and subspace methods based on the principles of subspace orthogonality and shift-invariance. The application of these methods to analysis of speech and audio signals is demonstrated using both real and synthetic signals, and their performance is assessed under various conditions and their properties discussed. Finally, the estimators are compared in terms of computational and statistical efficiency, generalizability and robustness. Table of Contents: Fundamentals / Statistical Methods / Filtering Methods / Subspace Methods / Amplitude Estimation

Discrete-Time Speech Signal Processing

Discrete-Time Speech Signal Processing PDF Author: Thomas F. Quatieri
Publisher: Pearson Education
ISBN: 0132441233
Category : Technology & Engineering
Languages : en
Pages : 1226

Get Book Here

Book Description
Essential principles, practical examples, current applications, and leading-edge research. In this book, Thomas F. Quatieri presents the field's most intensive, up-to-date tutorial and reference on discrete-time speech signal processing. Building on his MIT graduate course, he introduces key principles, essential applications, and state-of-the-art research, and he identifies limitations that point the way to new research opportunities. Quatieri provides an excellent balance of theory and application, beginning with a complete framework for understanding discrete-time speech signal processing. Along the way, he presents important advances never before covered in a speech signal processing text book, including sinusoidal speech processing, advanced time-frequency analysis, and nonlinear aeroacoustic speech production modeling. Coverage includes: Speech production and speech perception: a dual view Crucial distinctions between stochastic and deterministic problems Pole-zero speech models Homomorphic signal processing Short-time Fourier transform analysis/synthesis Filter-bank and wavelet analysis/synthesis Nonlinear measurement and modeling techniques The book's in-depth applications coverage includes speech coding, enhancement, and modification; speaker recognition; noise reduction; signal restoration; dynamic range compression, and more. Principles of Discrete-Time Speech Processing also contains an exceptionally complete series of examples and Matlab exercises, all carefully integrated into the book's coverage of theory and applications.

New Time-frequency Domain Pitch Estimation Methods for Speed Signals Under Low Levels of SNR

New Time-frequency Domain Pitch Estimation Methods for Speed Signals Under Low Levels of SNR PDF Author: Celia Shahnaz
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Get Book Here

Book Description
The major objective of this research is to develop novel pitch estimation methods capable of handling speech signals in practical situations where only noise-corrupted speech observations are available. With this objective in mind, the estimation task is carried out in two different approaches. In the first approach, the noisy speech observations are directly employed to develop two new time-frequency domain pitch estimation methods. These methods are based on extracting a pitch-harmonic and finding the corresponding harmonic number required for pitch estimation. Considering that voiced speech is the output of a vocal tract system driven by a sequence of pulses separated by the pitch period, in the second approach, instead of using the noisy speech directly for pitch estimation, an excitation-like signal (ELS) is first generated from the noisy speech or its noise- reduced version. In the first approach, at first, a harmonic cosine autocorrelation (HCAC) model of clean speech in terms of its pitch-harmonics is introduced. In order to extract a pitch-harmonic, we propose an optimization technique based on least-squares fitting of the autocorrelation function (ACF) of the noisy speech to the HCAC model. By exploiting the extracted pitch-harmonic along with the fast Fourier transform (FFT) based power spectrum of noisy speech, we then deduce a harmonic measure and a harmonic-to-noise-power ratio (HNPR) to determine the desired harmonic number of the extracted pitch-harmonic. In the proposed optimization, an initial estimate of the pitch-harmonic is obtained from the maximum peak of the smoothed FFT power spectrum. In addition to the HCAC model, where the cross-product terms of different harmonics are neglected, we derive a compact yet accurate harmonic sinusoidal autocorrelation (HSAC) model for clean speech signal. The new HSAC model is then used in the least-squares model-fitting optimization technique to extract a pitch-harmonic. In the second approach, first, we develop a pitch estimation method by using an excitation-like signal (ELS) generated from the noisy speech. To this end, a technique is based on the principle of homomorphic deconvolution is proposed for extracting the vocal-tract system (VTS) parameters from the noisy speech, which are utilized to perform an inverse-filtering of the noisy speech to produce a residual signal (RS). In order to reduce the effect of noise on the RS, a noise-compensation scheme is introduced in the autocorrelation domain. The noise-compensated ACF of the RS is then employed to generate a squared Hilbert envelope (SHE) as the ELS of the voiced speech. With a view to further overcome the adverse effect of noise on the ELS, a new symmetric normalized magnitude difference function of the ELS is proposed for eventual pitch estimation. Cepstrum has been widely used in speech signal processing but has limited capability of handling noise. One potential solution could be the introduction of a noise reduction block prior to pitch estimation based on the conventional cepstrum, a framework already available in many practical applications, such as mobile communication and hearing aids. Motivated by the advantages of the existing framework and considering the superiority of our ELS to the speech itself in providing clues for pitch information, we develop a cepstrum-based pitch estimation method by using the ELS obtained from the noise-reduced speech. For this purpose, we propose a noise subtraction scheme in frequency domain, which takes into account the possible cross-correlation between speech and noise and has advantages of noise being updated with time and adjusted at each frame. The enhanced speech thus obtained is utilized to extract the vocal-tract system (VTS) parameters via the homomorphic deconvolution technique. A residual signal (RS) is then produced by inverse-filtering the enhanced speech with the extracted VTS parameters. It is found that, unlike the previous ELS-based method, the squared Hilbert envelope (SHE) computed from the RS of the enhanced speech without noise compensation, is sufficient to represent an ELS. Finally, in order to tackle the undesirable effect of noise of the ELS at a very low SNR and overcome the limitation of the conventional cepstrum in handling different types of noises, a time-frequency domain pseudo cepstrum of the ELS of the enhanced speech, incorporating information of both magnitude and phase spectra of the ELS, is proposed for pitch estimation. (Abstract shortened by UMI.).

Linear Prediction of Speech

Linear Prediction of Speech PDF Author: J.D. Markel
Publisher: Springer Science & Business Media
ISBN: 3642662862
Category : Science
Languages : en
Pages : 276

Get Book Here

Book Description
During the past ten years a new area in speech processing, generally referred to as linear prediction, has evolved. As with all scientific research, results did not always get published in a logical order and terminology was not always con sistent. In mid-1974, we decided to begin an extra hours and weekends project of organizing the literature in linear prediction of speech and developing it into a unified presentation in terms of content and terminology. This effort was completed in November, 1975, with the contents presented herein. If there are two words which describe our goals in this book, they are unifica tion and depth. Considerable effort has been spent on showing the interrelation ships among various linear prediction formulations and solutions, and in develop ing extensions such as acoustic tube models and synthesis filter structures in a unified manner with consistent terminology. Topics are presented in such a manner that derivations and theoretical details are covered, along with Fortran sub routines and practical considerations. Using this approach we hope to have made the material useful for a wide range of backgrounds and interests.

Pitch estimation of aperiodic and noisy speech signals

Pitch estimation of aperiodic and noisy speech signals PDF Author: T. V. Sreenivas
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description


Cyclostationary Processes and Time Series

Cyclostationary Processes and Time Series PDF Author: Antonio Napolitano
Publisher: Academic Press
ISBN: 0081027370
Category : Technology & Engineering
Languages : en
Pages : 626

Get Book Here

Book Description
Many processes in nature arise from the interaction of periodic phenomena with random phenomena. The results are processes that are not periodic, but whose statistical functions are periodic functions of time. These processes are called cyclostationary and are an appropriate mathematical model for signals encountered in many fields including communications, radar, sonar, telemetry, acoustics, mechanics, econometrics, astronomy, and biology. Cyclostationary Processes and Time Series: Theory, Applications, and Generalizations addresses these issues and includes the following key features. Presents the foundations and developments of the second- and higher-order theory of cyclostationary signals Performs signal analysis using both the classical stochastic process approach and the functional approach for time series Provides applications in signal detection and estimation, filtering, parameter estimation, source location, modulation format classification, and biological signal characterization Includes algorithms for cyclic spectral analysis along with Matlab/Octave code Provides generalizations of the classical cyclostationary model in order to account for relative motion between transmitter and receiver and describe irregular statistical cyclicity in the data

Clinical Assessment of Voice, Second Edition

Clinical Assessment of Voice, Second Edition PDF Author: Robert Thayer Sataloff
Publisher: Plural Publishing
ISBN: 1944883738
Category : Medical
Languages : en
Pages : 777

Get Book Here

Book Description
In Clinical Assessment of Voice, Second Edition, Dr. Sataloff brings together a dynamic group of professionals who share his interdisciplinary philosophy of voice care. They provide an introduction to medical diagnostics and special problems with professional performers and voice users and offer a rare look at the assessment procedures used by the top voice care teams in the world. Clinical Assessment of Voice, Second Edition, includes chapters written by individuals with specialties in laryngology, teaching of singing and acting, voice science, and speech-language pathology, nursing, and acoustics. Starting with an extensive case history and following with the physical examination, the objective documentation in the voice laboratory, and the latest diagnostic imaging with laryngeal computed tomography and strobovideolaryngoscopy, the chapters delineate the possible diagnoses and treatment approaches that currently represent the state of the art in assessment of voice disorders. Added is current information on the medical-legal evaluation, now ever more important for the professional performer. New to this edition: New chapters on high-speed digital imaging, evolution of technology, magnetic resonance imaging, pediatric voice disorders, and thyroid disorders.Many chapters have been rewritten extensively to include the most recent practices and techniques, as well as updated references.Discussion of a large number of studies that were not addressed previously and a review of the latest literature, while also retaining classic literature.New information on topics such as measuring voice treatment outcomes, World Trade Center syndrome, and laryngeal effects of asbestos exposure.A selection of new authors who provide an interdisciplinary approach and valuable insights into the care of vocal performers. Clinical Assessment of Voice, Second Edition is ideal for speech-language pathology students and clinicians and is suitable for classroom use as well as for reference. For practicing otolaryngologists and speech-language pathologists, it is an invaluable guide for understanding the techniques for proper diagnosis and for organizing a plan of treatment. For singers and performers, knowledge of the assessment process is presented in a manner that allows them to determine what level of assessment they should pursue for the most current treatment.

Pitch Determination of Speech Signals Using the Generalized Spectrum

Pitch Determination of Speech Signals Using the Generalized Spectrum PDF Author: Tim Black
Publisher:
ISBN:
Category :
Languages : en
Pages : 128

Get Book Here

Book Description