Author: Thilo Michael
Publisher: Springer Nature
ISBN: 3031318447
Category : Technology & Engineering
Languages : en
Pages : 157
Book Description
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone.
Simulating Conversations for the Prediction of Speech Quality
Author: Thilo Michael
Publisher: Springer Nature
ISBN: 3031318447
Category : Technology & Engineering
Languages : en
Pages : 157
Book Description
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone.
Publisher: Springer Nature
ISBN: 3031318447
Category : Technology & Engineering
Languages : en
Pages : 157
Book Description
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone.
Assessment and Prediction of Speech Quality in Telecommunications
Author: Sebastian Möller
Publisher: Springer Science & Business Media
ISBN: 1475731175
Category : Science
Languages : en
Pages : 253
Book Description
The quality of a telecommunication voice service is largely inftuenced by the quality of the transmission system. Nevertheless, the analysis, synthesis and prediction of quality should take into account its multidimensional aspects. Quality can be regarded as a point where the perceived characteristics and the desired or expected ones meet. A schematic is presented which classifies different entities which contribute to the quality of a service, taking into account conversational, user as weIl as service related contributions. Starting from this concept, perceptively relevant constituents of speech communication quality are identified. The perceptive factors result from ele ments of the transmission configuration. A simulation model is developed and implemented which allows the most relevant parameters of traditional trans mission configurations to be manipulated, in real time and for the conversation situation. Inputs into the simulation are instrumentally measurable quality elements commonly used in transmission planning of telephone networks. A reduced set of these quality elements forms a basis for models which aim at predicting mouth-to-ear quality as it would be perceived by a user of the sys tem. These models are an important tool for the planner of telecommunication networks, as they allow the expected quality to be estimated in advance, even before the network has been set up. Two well-known models (the SUBMOD and the E-model) are analyzed in more detail, with an emphasis on the psy choacoustic and psychophysical backgrounds.
Publisher: Springer Science & Business Media
ISBN: 1475731175
Category : Science
Languages : en
Pages : 253
Book Description
The quality of a telecommunication voice service is largely inftuenced by the quality of the transmission system. Nevertheless, the analysis, synthesis and prediction of quality should take into account its multidimensional aspects. Quality can be regarded as a point where the perceived characteristics and the desired or expected ones meet. A schematic is presented which classifies different entities which contribute to the quality of a service, taking into account conversational, user as weIl as service related contributions. Starting from this concept, perceptively relevant constituents of speech communication quality are identified. The perceptive factors result from ele ments of the transmission configuration. A simulation model is developed and implemented which allows the most relevant parameters of traditional trans mission configurations to be manipulated, in real time and for the conversation situation. Inputs into the simulation are instrumentally measurable quality elements commonly used in transmission planning of telephone networks. A reduced set of these quality elements forms a basis for models which aim at predicting mouth-to-ear quality as it would be perceived by a user of the sys tem. These models are an important tool for the planner of telecommunication networks, as they allow the expected quality to be estimated in advance, even before the network has been set up. Two well-known models (the SUBMOD and the E-model) are analyzed in more detail, with an emphasis on the psy choacoustic and psychophysical backgrounds.
Audiovisual Quality Assessment and Prediction for Videotelephony
Author: Benjamin Belmudez
Publisher: Springer
ISBN: 331914166X
Category : Technology & Engineering
Languages : en
Pages : 196
Book Description
The work presented in this book focuses on modeling audiovisual quality as perceived by the users of IP-based solutions for video communication like videotelephony. It also extends the current framework for the parametric prediction of audiovisual call quality. The book addresses several aspects related to the quality perception of entire video calls, namely, the quality estimation of the single audio and video modalities in an interactive context, the audiovisual quality integration of these modalities and the temporal pooling of short sample-based quality scores to account for the perceptual quality impact of time-varying degradations.
Publisher: Springer
ISBN: 331914166X
Category : Technology & Engineering
Languages : en
Pages : 196
Book Description
The work presented in this book focuses on modeling audiovisual quality as perceived by the users of IP-based solutions for video communication like videotelephony. It also extends the current framework for the parametric prediction of audiovisual call quality. The book addresses several aspects related to the quality perception of entire video calls, namely, the quality estimation of the single audio and video modalities in an interactive context, the audiovisual quality integration of these modalities and the temporal pooling of short sample-based quality scores to account for the perceptual quality impact of time-varying degradations.
Simulating Conversations for the Prediction of Speech Quality
Author: Thilo Michael
Publisher:
ISBN: 9783031318450
Category :
Languages : en
Pages : 0
Book Description
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone. Presents the overview of a technical setup of a simulation able to replicate individual interactions Includes insights into the changes of individual interactions that occur due to delay and packet loss Describes and extends the state-of-the-art in parametric speech quality prediction .
Publisher:
ISBN: 9783031318450
Category :
Languages : en
Pages : 0
Book Description
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone. Presents the overview of a technical setup of a simulation able to replicate individual interactions Includes insights into the changes of individual interactions that occur due to delay and packet loss Describes and extends the state-of-the-art in parametric speech quality prediction .
Speech Quality of VoIP
Author: Alexander Raake
Publisher: John Wiley & Sons
ISBN: 0470032995
Category : Technology & Engineering
Languages : en
Pages : 336
Book Description
Finally a comprehensive overview of speech quality in VoIP from the user's perspective! Speech Quality of VoIP is an essential guide to assessing the speech quality of VoIP networks, whilst addressing the implications for the design of VoIP networks and systems. This book bridges the gap between the technical network-world and the psychoacoustic world of quality perception. Alexander Raake’s unique perspective combines awareness of the technical characteristics of VoIP networks and original research concerning the perception of speech transmitted across them. Starting from the network designer’s point of view, the different characteristics of the network are addressed, and then linked to features perceived by users. This book provides an overview of the available knowledge on the principal, relevant aspects of speech and speech quality perception, of speech quality assessment, and of transmission properties of telephone and VoIP networks, and of the related perceptual features and resulting speech quality. Discussing new research into the specific time-varying degradations VoIP brings along, but also the considerable potential of quality improvement to be achieved with wideband speech transmission, Alexander Raake demonstrates how network and service characteristics impact on the users perception of quality. Speech Quality of VoIP: Offers an insight into speech quality of VoIP from a user's perspective. Presents an overview of different modelling approaches and a parametric network-planning model for quality prediction in VoIP networks. Draws on innovative new research on the quality degradation characteristic of VoIP. Explains in detail how telephone speech quality can be greatly enhanced with VoIP’s wideband speech transmission capability. Assesses the vast collection of references into the technical and scientific literature related to VoIP quality. Illustrates concepts throughout with mathematical models, algorithms and simulations. Speech Quality of VoIP is the definitive guide for researchers, engineers and network planners working in the field of VoIP, Quality of Service, and speech communication processing in telecommunications. Advanced undergraduate and graduate students on telecommunication and networking courses will also find this text an invaluable resource.
Publisher: John Wiley & Sons
ISBN: 0470032995
Category : Technology & Engineering
Languages : en
Pages : 336
Book Description
Finally a comprehensive overview of speech quality in VoIP from the user's perspective! Speech Quality of VoIP is an essential guide to assessing the speech quality of VoIP networks, whilst addressing the implications for the design of VoIP networks and systems. This book bridges the gap between the technical network-world and the psychoacoustic world of quality perception. Alexander Raake’s unique perspective combines awareness of the technical characteristics of VoIP networks and original research concerning the perception of speech transmitted across them. Starting from the network designer’s point of view, the different characteristics of the network are addressed, and then linked to features perceived by users. This book provides an overview of the available knowledge on the principal, relevant aspects of speech and speech quality perception, of speech quality assessment, and of transmission properties of telephone and VoIP networks, and of the related perceptual features and resulting speech quality. Discussing new research into the specific time-varying degradations VoIP brings along, but also the considerable potential of quality improvement to be achieved with wideband speech transmission, Alexander Raake demonstrates how network and service characteristics impact on the users perception of quality. Speech Quality of VoIP: Offers an insight into speech quality of VoIP from a user's perspective. Presents an overview of different modelling approaches and a parametric network-planning model for quality prediction in VoIP networks. Draws on innovative new research on the quality degradation characteristic of VoIP. Explains in detail how telephone speech quality can be greatly enhanced with VoIP’s wideband speech transmission capability. Assesses the vast collection of references into the technical and scientific literature related to VoIP quality. Illustrates concepts throughout with mathematical models, algorithms and simulations. Speech Quality of VoIP is the definitive guide for researchers, engineers and network planners working in the field of VoIP, Quality of Service, and speech communication processing in telecommunications. Advanced undergraduate and graduate students on telecommunication and networking courses will also find this text an invaluable resource.
Speech and Computer
Author: Alexey Karpov
Publisher: Springer Nature
ISBN: 3030602761
Category : Computers
Languages : en
Pages : 704
Book Description
This book constitutes the proceedings of the 22nd International Conference on Speech and Computer, SPECOM 2020, held in St. Petersburg, Russia, in October 2020. The 65 papers presented were carefully reviewed and selected from 160 submissions. The papers present current research in the area of computer speech processing including speech science, speech technology, natural language processing, human-computer interaction, language identification, multimedia processing, human-machine interaction, deep learning for audio processing, computational paralinguistics, affective computing, speech and language resources, speech translation systems, text mining and sentiment analysis, voice assistants, etc. Due to the Corona pandemic SPECOM 2020 was held as a virtual event.
Publisher: Springer Nature
ISBN: 3030602761
Category : Computers
Languages : en
Pages : 704
Book Description
This book constitutes the proceedings of the 22nd International Conference on Speech and Computer, SPECOM 2020, held in St. Petersburg, Russia, in October 2020. The 65 papers presented were carefully reviewed and selected from 160 submissions. The papers present current research in the area of computer speech processing including speech science, speech technology, natural language processing, human-computer interaction, language identification, multimedia processing, human-machine interaction, deep learning for audio processing, computational paralinguistics, affective computing, speech and language resources, speech translation systems, text mining and sentiment analysis, voice assistants, etc. Due to the Corona pandemic SPECOM 2020 was held as a virtual event.
Quality of Experience
Author: Sebastian Möller
Publisher: Springer
ISBN: 331902681X
Category : Technology & Engineering
Languages : en
Pages : 431
Book Description
This pioneering book develops definitions and concepts related to Quality of Experience in the context of multimedia- and telecommunications-related applications, systems and services and applies these to various fields of communication and media technologies. The editors bring together numerous key-protagonists of the new discipline “Quality of Experience” and combine the state-of-the-art knowledge in one single volume.
Publisher: Springer
ISBN: 331902681X
Category : Technology & Engineering
Languages : en
Pages : 431
Book Description
This pioneering book develops definitions and concepts related to Quality of Experience in the context of multimedia- and telecommunications-related applications, systems and services and applies these to various fields of communication and media technologies. The editors bring together numerous key-protagonists of the new discipline “Quality of Experience” and combine the state-of-the-art knowledge in one single volume.
Deep Learning Based Speech Quality Prediction
Author: Gabriel Mittag
Publisher: Springer Nature
ISBN: 3030914798
Category : Technology & Engineering
Languages : en
Pages : 171
Book Description
This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.
Publisher: Springer Nature
ISBN: 3030914798
Category : Technology & Engineering
Languages : en
Pages : 171
Book Description
This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.
Assessment and Prediction of Speech Quality in Telecommunications
Author: Sebastian Möller
Publisher: Springer Science & Business Media
ISBN: 9780792378945
Category : Science
Languages : en
Pages : 268
Book Description
The quality of telecommunication voice services has become an important issue due to the evolving and liberalized market. With the advent of new technologies, however, a diversification takes place which makes it necessary to carefully plan and observe network quality. Speech communication quality - as it is perceived by the user or customer of a service - carries a multidimensional nature, a fact which must be reflected in its assessment and prediction with quality models. In this book a new schematic is developed which classifies different entities contributing to the quality of a service. It takes into account conversational user as well as service-related contributions. Starting from this concept, perceptively relevant constituents of speech communication quality are identified. A simulation model is developed and implemented, based on physical elements of the transmission configuration. It allows the perceptively most relevant parameters to be simulated, in real time and for the conversation situation. The book gives a valuable overview on assessment needed for reliably measuring the different quality dimensions. For the planning of telephone networks, quality models are presented which aim at predicting mouth-to-ear quality as it would be perceived by a user of the system. These models are an important tool for the planner of telecommunication networks, as they allow the expected quality to be estimated in advance, even before the network has been set up. Two well-known models (the SUBMOD and the E-model) are analyzed in more detail, with an emphasis on the psychoacoustic and psychophysical backgrounds. It turns out that model predictions are satisfactory for many types of degradations, but they can still be improved especially for new types of impairments. Proposals are made for quality model enhancement and combined approaches. Due to its `handbook' character, this book is an invaluable source of background information for anyone working in the field of speech quality assessment as well as telephone network planning and operation.
Publisher: Springer Science & Business Media
ISBN: 9780792378945
Category : Science
Languages : en
Pages : 268
Book Description
The quality of telecommunication voice services has become an important issue due to the evolving and liberalized market. With the advent of new technologies, however, a diversification takes place which makes it necessary to carefully plan and observe network quality. Speech communication quality - as it is perceived by the user or customer of a service - carries a multidimensional nature, a fact which must be reflected in its assessment and prediction with quality models. In this book a new schematic is developed which classifies different entities contributing to the quality of a service. It takes into account conversational user as well as service-related contributions. Starting from this concept, perceptively relevant constituents of speech communication quality are identified. A simulation model is developed and implemented, based on physical elements of the transmission configuration. It allows the perceptively most relevant parameters to be simulated, in real time and for the conversation situation. The book gives a valuable overview on assessment needed for reliably measuring the different quality dimensions. For the planning of telephone networks, quality models are presented which aim at predicting mouth-to-ear quality as it would be perceived by a user of the system. These models are an important tool for the planner of telecommunication networks, as they allow the expected quality to be estimated in advance, even before the network has been set up. Two well-known models (the SUBMOD and the E-model) are analyzed in more detail, with an emphasis on the psychoacoustic and psychophysical backgrounds. It turns out that model predictions are satisfactory for many types of degradations, but they can still be improved especially for new types of impairments. Proposals are made for quality model enhancement and combined approaches. Due to its `handbook' character, this book is an invaluable source of background information for anyone working in the field of speech quality assessment as well as telephone network planning and operation.
Integral and Diagnostic Intrusive Prediction of Speech Quality
Author: Nicolas Côté
Publisher: Springer Science & Business Media
ISBN: 3642184634
Category : Technology & Engineering
Languages : en
Pages : 255
Book Description
This work deals with the instrumental measurement methods for the perceived quality of transmitted speech. These measures simulate the speech perception process employed by human subjects during auditory experiments. The measure standardized by the International Telecommunication Union (ITU), called “Wideband-Perceptual Speech Quality Evaluation (WB-PESQ)”, is not able to quantify all these perceived characteristics on a unidimensional quality scale, the Mean Opinion Score (MOS) scale. Recent experimental studies showed that subjects make use of several perceptual dimensions to judge about the quality of speech signals. In order to represent the signal at a higher stage of perception, a new model, called “Diagnostic Instrumental Assessment of Listening quality (DIAL)”, has been developed. It includes a perceptual and a cognitive model which simulate the whole quality judgment process. Except for strong discontinuities, DIAL predicts very well speech quality of different speech processing and transmission systems, and it outperforms the WB-PESQ.
Publisher: Springer Science & Business Media
ISBN: 3642184634
Category : Technology & Engineering
Languages : en
Pages : 255
Book Description
This work deals with the instrumental measurement methods for the perceived quality of transmitted speech. These measures simulate the speech perception process employed by human subjects during auditory experiments. The measure standardized by the International Telecommunication Union (ITU), called “Wideband-Perceptual Speech Quality Evaluation (WB-PESQ)”, is not able to quantify all these perceived characteristics on a unidimensional quality scale, the Mean Opinion Score (MOS) scale. Recent experimental studies showed that subjects make use of several perceptual dimensions to judge about the quality of speech signals. In order to represent the signal at a higher stage of perception, a new model, called “Diagnostic Instrumental Assessment of Listening quality (DIAL)”, has been developed. It includes a perceptual and a cognitive model which simulate the whole quality judgment process. Except for strong discontinuities, DIAL predicts very well speech quality of different speech processing and transmission systems, and it outperforms the WB-PESQ.