Assessment and Prediction of Speech Quality in Telecommunications

Assessment and Prediction of Speech Quality in Telecommunications PDF Author: Sebastian Möller
Publisher: Springer Science & Business Media
ISBN: 1475731175
Category : Science
Languages : en
Pages : 253

Get Book Here

Book Description
The quality of a telecommunication voice service is largely inftuenced by the quality of the transmission system. Nevertheless, the analysis, synthesis and prediction of quality should take into account its multidimensional aspects. Quality can be regarded as a point where the perceived characteristics and the desired or expected ones meet. A schematic is presented which classifies different entities which contribute to the quality of a service, taking into account conversational, user as weIl as service related contributions. Starting from this concept, perceptively relevant constituents of speech communication quality are identified. The perceptive factors result from ele ments of the transmission configuration. A simulation model is developed and implemented which allows the most relevant parameters of traditional trans mission configurations to be manipulated, in real time and for the conversation situation. Inputs into the simulation are instrumentally measurable quality elements commonly used in transmission planning of telephone networks. A reduced set of these quality elements forms a basis for models which aim at predicting mouth-to-ear quality as it would be perceived by a user of the sys tem. These models are an important tool for the planner of telecommunication networks, as they allow the expected quality to be estimated in advance, even before the network has been set up. Two well-known models (the SUBMOD and the E-model) are analyzed in more detail, with an emphasis on the psy choacoustic and psychophysical backgrounds.

Voice and Speech Quality Perception

Voice and Speech Quality Perception PDF Author: Ute Jekosch
Publisher: Springer Science & Business Media
ISBN: 3540288600
Category : Science
Languages : en
Pages : 208

Get Book Here

Book Description
Foundations of Voice and Speech Quality Perception starts out with the fundamental question of: "How do listeners perceive voice and speech quality and how can these processes be modeled?" Any quantitative answers require measurements. This is natural for physical quantities but harder to imagine for perceptual measurands. This book approaches the problem by actually identifying major perceptual dimensions of voice and speech quality perception, defining units wherever possible and offering paradigms to position these dimensions into a structural skeleton of perceptual speech and voice quality. The emphasis is placed on voice and speech quality assessment of systems in artificial scenarios. Many scientific fields are involved. This book bridges the gap between two quite diverse fields, engineering and humanities, and establishes the new research area of Voice and Speech Quality Perception.

Multimedia Analysis, Processing and Communications

Multimedia Analysis, Processing and Communications PDF Author: Lin Weisi
Publisher: Springer Science & Business Media
ISBN: 3642195504
Category : Mathematics
Languages : en
Pages : 753

Get Book Here

Book Description
This book has brought 24 groups of experts and active researchers around the world together in image processing and analysis, video processing and analysis, and communications related processing, to present their newest research results, exchange latest experiences and insights, and explore future directions in these important and rapidly evolving areas. It aims at increasing the synergy between academic and industry professionals working in the related field. It focuses on the state-of-the-art research in various essential areas related to emerging technologies, standards and applications on analysis, processing, computing, and communication of multimedia information. The target audience of this book is researchers and engineers as well as graduate students working in various disciplines linked to multimedia analysis, processing and communications, e.g., computer vision, pattern recognition, information technology, image processing, and artificial intelligence. The book is also meant to a broader audience including practicing professionals working in image/video applications such as image processing, video surveillance, multimedia indexing and retrieval, and so on. We hope that the researchers, engineers, students and other professionals who read this book would find it informative, useful and inspirational toward their own work in one way or another.

Human Information Processing in Speech Quality Assessment

Human Information Processing in Speech Quality Assessment PDF Author: Stefan Uhrig
Publisher:
ISBN: 9783030713904
Category :
Languages : en
Pages : 0

Get Book Here

Book Description
This book provides a new multi-method, process-oriented approach towards speech quality assessment, which allows readers to examine the influence of speech transmission quality on a variety of perceptual and cognitive processes in human listeners. Fundamental concepts and methodologies surrounding the topic of process-oriented quality assessment are introduced and discussed. The book further describes a functional process model of human quality perception, which theoretically integrates results obtained in three experimental studies. This book's conceptual ideas, empirical findings, and theoretical interpretations should be of particular interest to researchers working in the fields of Quality and Usability Engineering, Audio Engineering, Psychoacoustics, Audiology, and Psychophysiology. Presents a new process-oriented approach towards speech quality assessment to uncover influences of speech transmission quality on human information processing; Proposes a multi-method assessment approach including subjective, behavioral, and neurophysiological levels of analysis; Reports findings from three experimental studies, that demonstrate interactions between perceived speech quality, contextual, and content-related influencing factors.

The Assessment of Speech Quality

The Assessment of Speech Quality PDF Author: R. S. Nickerson
Publisher:
ISBN:
Category :
Languages : en
Pages : 71

Get Book Here

Book Description
Various methods that have been used to assess the quality of speech are reviewed. These methods are organized under three general topics: unidimensional quality assessment, judging of individual speech qualities, and multidimensional scaling. The importance of effects attributable to speech material, talkers and listeners is emphasized. The desirability of objective measures of speech quality is noted and some candidate measures are discussed. Finally, the relationship between quality assessment and intelligibility testing is briefly considered.

Integral and Diagnostic Intrusive Prediction of Speech Quality

Integral and Diagnostic Intrusive Prediction of Speech Quality PDF Author: Nicolas Côté
Publisher: Springer Science & Business Media
ISBN: 3642184634
Category : Technology & Engineering
Languages : en
Pages : 255

Get Book Here

Book Description
This work deals with the instrumental measurement methods for the perceived quality of transmitted speech. These measures simulate the speech perception process employed by human subjects during auditory experiments. The measure standardized by the International Telecommunication Union (ITU), called “Wideband-Perceptual Speech Quality Evaluation (WB-PESQ)”, is not able to quantify all these perceived characteristics on a unidimensional quality scale, the Mean Opinion Score (MOS) scale. Recent experimental studies showed that subjects make use of several perceptual dimensions to judge about the quality of speech signals. In order to represent the signal at a higher stage of perception, a new model, called “Diagnostic Instrumental Assessment of Listening quality (DIAL)”, has been developed. It includes a perceptual and a cognitive model which simulate the whole quality judgment process. Except for strong discontinuities, DIAL predicts very well speech quality of different speech processing and transmission systems, and it outperforms the WB-PESQ.

Data-driven Non-intrusive Speech Quality and Intelligibility Assessment

Data-driven Non-intrusive Speech Quality and Intelligibility Assessment PDF Author: Xuan Dong (Data scientist)
Publisher:
ISBN:
Category : Automatic speech recognition
Languages : en
Pages : 173

Get Book Here

Book Description
Speech quality and intelligibility are vital factors when assessing a listening environment, communication channel, or speech enhancement algorithm. Subjective listening studies are the most accurate forms of assessing speech quality and intelligibility. However, this assessment form is generally costly and time-consuming to perform when a large-scale assessment is needed. Thus, objective methods are often used since they provide low cost and efficient speech assessment quickly.Objective methods can be divided into two categories. Intrusive measures assess a distorted speech signal's quality and intelligibility by comparing it to its clean, undistorted version. Hence, a clean reference signal is required. A fundamental limitation of intrusive measures is that the reference signal is usually not available in real-world environments or challenging to obtain. Non-intrusive measures, on the other hand, assess speech based on the distorted signal only, which means that real-world assessment is possible. Although existing non-intrusive approaches enable real-world evaluation, their real-world capabilities are limited since (1) current measures are developed from simulated data that does not adequately model real environments; and (2) they predict objective scores that are not always strongly correlated with subjective ratings as compared to their intrusive counterparts. Hence, an active research area involves developing non-intrusive algorithms that can assess speech in real-life scenarios and are strongly correlated with human assessment. This dissertation developed several data-driven non-intrusive algorithms to better assess speech quality and intelligibility in real environments. The contribution of this dissertation is threefold. First, a two-stage framework that estimates the objective score of these intrusive measures at the frame-level is proposed. Second, three neural network-based models are developed to provide utterance-level speech evaluation. Third, since a large dataset of real-world signals with listener quality ratings did not previously exist, to help facilitate real-world assessment, crowdsourcing listening studies are conducted to obtain perceptual quality ratings from human participants. An encoder-decoder model with an attention mechanism is proposed to predict the human-level perceived speech quality on real-world signals.

Springer Handbook of Speech Processing

Springer Handbook of Speech Processing PDF Author: Jacob Benesty
Publisher: Springer Science & Business Media
ISBN: 3540491252
Category : Technology & Engineering
Languages : en
Pages : 1170

Get Book Here

Book Description
This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.

Dimension-based Quality Modeling of Transmitted Speech

Dimension-based Quality Modeling of Transmitted Speech PDF Author: Marcel Wältermann
Publisher: Springer Science & Business Media
ISBN: 3642350194
Category : Technology & Engineering
Languages : en
Pages : 208

Get Book Here

Book Description
In this book, speech transmission quality is modeled on the basis of perceptual dimensions. The author identifies those dimensions that are relevant for today's public-switched and packet-based telecommunication systems, regarding the complete transmission path from the mouth of the speaker to the ear of the listener. Both narrowband (300-3400 Hz) as well as wideband (50-7000 Hz) speech transmission is taken into account. A new analytical assessment method is presented that allows the dimensions to be rated by non-expert listeners in a direct way. Due to the efficiency of the test method, a relatively large number of stimuli can be assessed in auditory tests. The test method is applied in two auditory experiments. The book gives the evidence that this test method provides meaningful and reliable results. The resulting dimension scores together with respective overall quality ratings form the basis for a new parametric model for the quality estimation of transmitted speech based on the perceptual dimensions. In a two-step model approach, instrumental dimension models estimate dimension impairment factors in a first step. The resulting dimension estimates are combined by a Euclidean integration function in a second step in order to provide an estimate of the total impairment.

Uniform Non-Intrusive Speech Quality Assessment Model

Uniform Non-Intrusive Speech Quality Assessment Model PDF Author: Elez Shenhar
Publisher: LAP Lambert Academic Publishing
ISBN: 9783659189722
Category :
Languages : en
Pages : 80

Get Book Here

Book Description
Speech quality plays key role in defining the consumer's perception of the overall Quality of Service offered to him by his telecommunications service provider. For this reason, it is essential for service providers to have the ability of online assessment of the perceived quality of speech during live calls. New technologies constantly emerge in the world of telecommunications, and along with their benefits come new types of potential degradations to call quality, which need to be monitored for, discovered, and quantified in a perceptually meaningful way. This book presents a new, non-intrusive speech quality assessment model. Detection of no specific type of degradation is performed by the model, only the overall pleasantness and intelligibility is assessed. This design choice was made in order to create a quality assessment method that can capably assess the quality of speech, under never-before-seen degradation types. Performance of the proposed model is shown to be competitive with those of current state-of-the-art quality assessment models. This book should be useful to any researcher in the field of speech, as well as professionals in the telecommunications industry.