Author:
Publisher:
ISBN:
Category : Electro-acoustics
Languages : en
Pages : 664
Book Description
ICASSP 93
Speech Recognition and Coding
Author: Antonio J. Rubio Ayuso
Publisher: Springer Science & Business Media
ISBN: 3642577458
Category : Technology & Engineering
Languages : en
Pages : 517
Book Description
Based on a NATO Advanced Study Institute held in 1993, this book addresses recent advances in automatic speech recognition and speech coding. The book contains contributions by many of the most outstanding researchers from the best laboratories worldwide in the field. The contributions have been grouped into five parts: on acoustic modeling; language modeling; speech processing, analysis and synthesis; speech coding; and vector quantization and neural nets. For each of these topics, some of the best-known researchers were invited to give a lecture. In addition to these lectures, the topics were complemented with discussions and presentations of the work of those attending. Altogether, the reader is given a wide perspective on recent advances in the field and will be able to see the trends for future work.
Publisher: Springer Science & Business Media
ISBN: 3642577458
Category : Technology & Engineering
Languages : en
Pages : 517
Book Description
Based on a NATO Advanced Study Institute held in 1993, this book addresses recent advances in automatic speech recognition and speech coding. The book contains contributions by many of the most outstanding researchers from the best laboratories worldwide in the field. The contributions have been grouped into five parts: on acoustic modeling; language modeling; speech processing, analysis and synthesis; speech coding; and vector quantization and neural nets. For each of these topics, some of the best-known researchers were invited to give a lecture. In addition to these lectures, the topics were complemented with discussions and presentations of the work of those attending. Altogether, the reader is given a wide perspective on recent advances in the field and will be able to see the trends for future work.
Automatic Speech Translation
Author: Akira Kurematsu
Publisher: CRC Press
ISBN: 1000673588
Category : Technology & Engineering
Languages : en
Pages : 136
Book Description
Automatic Speech Translation introduces recent results of Japanese research and development in speech translation and speech recognition. Topics covered include: fundamental concepts of speech recognition; speech pattern representation; phoneme-based HMM phoneme recognition; continuous speech recognition; speaker adaptation; speaker-independent speech recognition; utterance analysis, utterance transfer, utterance generation; contextual processing; speech synthesis and an experimental system of speech translation. This book presents the complicated technological aspects of machine translation and speech recognition, and outlines the future directions of this rapidly developing area of technology.
Publisher: CRC Press
ISBN: 1000673588
Category : Technology & Engineering
Languages : en
Pages : 136
Book Description
Automatic Speech Translation introduces recent results of Japanese research and development in speech translation and speech recognition. Topics covered include: fundamental concepts of speech recognition; speech pattern representation; phoneme-based HMM phoneme recognition; continuous speech recognition; speaker adaptation; speaker-independent speech recognition; utterance analysis, utterance transfer, utterance generation; contextual processing; speech synthesis and an experimental system of speech translation. This book presents the complicated technological aspects of machine translation and speech recognition, and outlines the future directions of this rapidly developing area of technology.
Modern Methods of Speech Processing
Author: Ravi P. Ramachandran
Publisher: Springer Science & Business Media
ISBN: 1461522811
Category : Technology & Engineering
Languages : en
Pages : 471
Book Description
The term speech processing refers to the scientific discipline concerned with the analysis and processing of speech signals for getting the best benefit in various practical scenarios. These different practical scenarios correspond to a large variety of applications of speech processing research. Examples of some applications include enhancement, coding, synthesis, recognition and speaker recognition. A very rapid growth, particularly during the past ten years, has resulted due to the efforts of many leading scientists. The ideal aim is to develop algorithms for a certain task that maximize performance, are computationally feasible and are robust to a wide class of conditions. The purpose of this book is to provide a cohesive collection of articles that describe recent advances in various branches of speech processing. The main focus is in describing specific research directions through a detailed analysis and review of both the theoretical and practical settings. The intended audience includes graduate students who are embarking on speech research as well as the experienced researcher already working in the field. For graduate students taking a course, this book serves as a supplement to the course material. As the student focuses on a particular topic, the corresponding set of articles in this book will serve as an initiation through exposure to research issues and by providing an extensive reference list to commence a literature survey. Expe rienced researchers can utilize this book as a reference guide and can expand their horizons in this rather broad area.
Publisher: Springer Science & Business Media
ISBN: 1461522811
Category : Technology & Engineering
Languages : en
Pages : 471
Book Description
The term speech processing refers to the scientific discipline concerned with the analysis and processing of speech signals for getting the best benefit in various practical scenarios. These different practical scenarios correspond to a large variety of applications of speech processing research. Examples of some applications include enhancement, coding, synthesis, recognition and speaker recognition. A very rapid growth, particularly during the past ten years, has resulted due to the efforts of many leading scientists. The ideal aim is to develop algorithms for a certain task that maximize performance, are computationally feasible and are robust to a wide class of conditions. The purpose of this book is to provide a cohesive collection of articles that describe recent advances in various branches of speech processing. The main focus is in describing specific research directions through a detailed analysis and review of both the theoretical and practical settings. The intended audience includes graduate students who are embarking on speech research as well as the experienced researcher already working in the field. For graduate students taking a course, this book serves as a supplement to the course material. As the student focuses on a particular topic, the corresponding set of articles in this book will serve as an initiation through exposure to research issues and by providing an extensive reference list to commence a literature survey. Expe rienced researchers can utilize this book as a reference guide and can expand their horizons in this rather broad area.
Automatic Speech and Speaker Recognition
Author: Chin-Hui Lee
Publisher: Springer Science & Business Media
ISBN: 1461313678
Category : Technology & Engineering
Languages : en
Pages : 524
Book Description
Research in the field of automatic speech and speaker recognition has made a number of significant advances in the last two decades, influenced by advances in signal processing, algorithms, architectures, and hardware. These advances include: the adoption of a statistical pattern recognition paradigm; the use of the hidden Markov modeling framework to characterize both the spectral and the temporal variations in the speech signal; the use of a large set of speech utterance examples from a large population of speakers to train the hidden Markov models of some fundamental speech units; the organization of speech and language knowledge sources into a structural finite state network; and the use of dynamic, programming based heuristic search methods to find the best word sequence in the lexical network corresponding to the spoken utterance. Automatic Speech and Speaker Recognition: Advanced Topics groups together in a single volume a number of important topics on speech and speaker recognition, topics which are of fundamental importance, but not yet covered in detail in existing textbooks. Although no explicit partition is given, the book is divided into five parts: Chapters 1-2 are devoted to technology overviews; Chapters 3-12 discuss acoustic modeling of fundamental speech units and lexical modeling of words and pronunciations; Chapters 13-15 address the issues related to flexibility and robustness; Chapter 16-18 concern the theoretical and practical issues of search; Chapters 19-20 give two examples of algorithm and implementational aspects for recognition system realization. Audience: A reference book for speech researchers and graduate students interested in pursuing potential research on the topic. May also be used as a text for advanced courses on the subject.
Publisher: Springer Science & Business Media
ISBN: 1461313678
Category : Technology & Engineering
Languages : en
Pages : 524
Book Description
Research in the field of automatic speech and speaker recognition has made a number of significant advances in the last two decades, influenced by advances in signal processing, algorithms, architectures, and hardware. These advances include: the adoption of a statistical pattern recognition paradigm; the use of the hidden Markov modeling framework to characterize both the spectral and the temporal variations in the speech signal; the use of a large set of speech utterance examples from a large population of speakers to train the hidden Markov models of some fundamental speech units; the organization of speech and language knowledge sources into a structural finite state network; and the use of dynamic, programming based heuristic search methods to find the best word sequence in the lexical network corresponding to the spoken utterance. Automatic Speech and Speaker Recognition: Advanced Topics groups together in a single volume a number of important topics on speech and speaker recognition, topics which are of fundamental importance, but not yet covered in detail in existing textbooks. Although no explicit partition is given, the book is divided into five parts: Chapters 1-2 are devoted to technology overviews; Chapters 3-12 discuss acoustic modeling of fundamental speech units and lexical modeling of words and pronunciations; Chapters 13-15 address the issues related to flexibility and robustness; Chapter 16-18 concern the theoretical and practical issues of search; Chapters 19-20 give two examples of algorithm and implementational aspects for recognition system realization. Audience: A reference book for speech researchers and graduate students interested in pursuing potential research on the topic. May also be used as a text for advanced courses on the subject.
1993 IEEE International Symposium on Circuits and Systems
Author:
Publisher:
ISBN:
Category : Electric filters
Languages : en
Pages : 1032
Book Description
Publisher:
ISBN:
Category : Electric filters
Languages : en
Pages : 1032
Book Description
ICANN ’93
Author: Stan Gielen
Publisher: Springer Science & Business Media
ISBN: 1447120639
Category : Computers
Languages : en
Pages : 1116
Book Description
This book contains the proceedings of the International Confer ence on Artificial Neural Networks which was held between September 13 and 16 in Amsterdam. It is the third in a series which started two years ago in Helsinki and which last year took place in Brighton. Thanks to the European Neural Network Society, ICANN has emerged as the leading conference on neural networks in Europe. Neural networks is a field of research which has enjoyed a rapid expansion and great popularity in both the academic and industrial research communities. The field is motivated by the commonly held belief that applications in the fields of artificial intelligence and robotics will benefit from a good understanding of the neural information processing properties that underlie human intelligence. Essential aspects of neural information processing are highly parallel execution of com putation, integration of memory and process, and robustness against fluctuations. It is believed that intelligent skills, such as perception, motion and cognition, can be easier realized in neuro-computers than in a conventional computing paradigm. This requires active research in neurobiology to extract com putational principles from experimental neurobiological find ings, in physics and mathematics to study the relation between architecture and function in neural networks, and in cognitive science to study higher brain functions, such as language and reasoning. Neural networks technology has already lead to practical methods that solve real problems in a wide area of industrial applications. The clusters on robotics and applications contain sessions on various sub-topics in these fields.
Publisher: Springer Science & Business Media
ISBN: 1447120639
Category : Computers
Languages : en
Pages : 1116
Book Description
This book contains the proceedings of the International Confer ence on Artificial Neural Networks which was held between September 13 and 16 in Amsterdam. It is the third in a series which started two years ago in Helsinki and which last year took place in Brighton. Thanks to the European Neural Network Society, ICANN has emerged as the leading conference on neural networks in Europe. Neural networks is a field of research which has enjoyed a rapid expansion and great popularity in both the academic and industrial research communities. The field is motivated by the commonly held belief that applications in the fields of artificial intelligence and robotics will benefit from a good understanding of the neural information processing properties that underlie human intelligence. Essential aspects of neural information processing are highly parallel execution of com putation, integration of memory and process, and robustness against fluctuations. It is believed that intelligent skills, such as perception, motion and cognition, can be easier realized in neuro-computers than in a conventional computing paradigm. This requires active research in neurobiology to extract com putational principles from experimental neurobiological find ings, in physics and mathematics to study the relation between architecture and function in neural networks, and in cognitive science to study higher brain functions, such as language and reasoning. Neural networks technology has already lead to practical methods that solve real problems in a wide area of industrial applications. The clusters on robotics and applications contain sessions on various sub-topics in these fields.
Analysis, Synthesis, and Perception of Musical Sounds
Author: James Beauchamp
Publisher: Springer Science & Business Media
ISBN: 038732576X
Category : Science
Languages : en
Pages : 348
Book Description
This book contains a complete and accurate mathematical treatment of the sounds of music with an emphasis on musical timbre. The book spans the range from tutorial introduction to advanced research and application to speculative assessment of its various techniques. All the contributors use a generalized additive sine wave model for describing musical timbre which gives a conceptual unity, but is of sufficient utility to be adapted to many different tasks.
Publisher: Springer Science & Business Media
ISBN: 038732576X
Category : Science
Languages : en
Pages : 348
Book Description
This book contains a complete and accurate mathematical treatment of the sounds of music with an emphasis on musical timbre. The book spans the range from tutorial introduction to advanced research and application to speculative assessment of its various techniques. All the contributors use a generalized additive sine wave model for describing musical timbre which gives a conceptual unity, but is of sufficient utility to be adapted to many different tasks.
Cosine-/Sine-Modulated Filter Banks
Author: Vladimir Britanak
Publisher: Springer
ISBN: 3319610805
Category : Technology & Engineering
Languages : en
Pages : 664
Book Description
This book covers various algorithmic developments in the perfect reconstruction cosine/sine-modulated filter banks (TDAC-MDCT/MDST or MLT, MCLT, low delay MDCT, complex exponential/cosine/sine-modulated QMF filter banks), and near-perfect reconstruction QMF banks (pseudo-QMF banks) in detail, including their general mathematical properties, matrix representations, fast algorithms and various methods to integer approximations being recently a new transform technology for lossless audio coding. Each chapter will contain a number of examples and will conclude with problems and exercises. The book reflects the research efforts/activities and achieved results of the authors in the time period over the last 20 years.
Publisher: Springer
ISBN: 3319610805
Category : Technology & Engineering
Languages : en
Pages : 664
Book Description
This book covers various algorithmic developments in the perfect reconstruction cosine/sine-modulated filter banks (TDAC-MDCT/MDST or MLT, MCLT, low delay MDCT, complex exponential/cosine/sine-modulated QMF filter banks), and near-perfect reconstruction QMF banks (pseudo-QMF banks) in detail, including their general mathematical properties, matrix representations, fast algorithms and various methods to integer approximations being recently a new transform technology for lossless audio coding. Each chapter will contain a number of examples and will conclude with problems and exercises. The book reflects the research efforts/activities and achieved results of the authors in the time period over the last 20 years.
Hidden Semi-Markov Models
Author: Shun-Zheng Yu
Publisher: Morgan Kaufmann
ISBN: 0128027711
Category : Mathematics
Languages : en
Pages : 209
Book Description
Hidden semi-Markov models (HSMMs) are among the most important models in the area of artificial intelligence / machine learning. Since the first HSMM was introduced in 1980 for machine recognition of speech, three other HSMMs have been proposed, with various definitions of duration and observation distributions. Those models have different expressions, algorithms, computational complexities, and applicable areas, without explicitly interchangeable forms. Hidden Semi-Markov Models: Theory, Algorithms and Applications provides a unified and foundational approach to HSMMs, including various HSMMs (such as the explicit duration, variable transition, and residential time of HSMMs), inference and estimation algorithms, implementation methods and application instances. Learn new developments and state-of-the-art emerging topics as they relate to HSMMs, presented with examples drawn from medicine, engineering and computer science. - Discusses the latest developments and emerging topics in the field of HSMMs - Includes a description of applications in various areas including, Human Activity Recognition, Handwriting Recognition, Network Traffic Characterization and Anomaly Detection, and Functional MRI Brain Mapping. - Shows how to master the basic techniques needed for using HSMMs and how to apply them.
Publisher: Morgan Kaufmann
ISBN: 0128027711
Category : Mathematics
Languages : en
Pages : 209
Book Description
Hidden semi-Markov models (HSMMs) are among the most important models in the area of artificial intelligence / machine learning. Since the first HSMM was introduced in 1980 for machine recognition of speech, three other HSMMs have been proposed, with various definitions of duration and observation distributions. Those models have different expressions, algorithms, computational complexities, and applicable areas, without explicitly interchangeable forms. Hidden Semi-Markov Models: Theory, Algorithms and Applications provides a unified and foundational approach to HSMMs, including various HSMMs (such as the explicit duration, variable transition, and residential time of HSMMs), inference and estimation algorithms, implementation methods and application instances. Learn new developments and state-of-the-art emerging topics as they relate to HSMMs, presented with examples drawn from medicine, engineering and computer science. - Discusses the latest developments and emerging topics in the field of HSMMs - Includes a description of applications in various areas including, Human Activity Recognition, Handwriting Recognition, Network Traffic Characterization and Anomaly Detection, and Functional MRI Brain Mapping. - Shows how to master the basic techniques needed for using HSMMs and how to apply them.