Author:
Publisher:
ISBN:
Category : Psycholinguistics
Languages : en
Pages : 764
Book Description
ICSLP ...
ICSLP 96
Author:
Publisher:
ISBN:
Category : Automatic speech recognition
Languages : en
Pages : 576
Book Description
Publisher:
ISBN:
Category : Automatic speech recognition
Languages : en
Pages : 576
Book Description
ICSLP 92 Proceedings
Author:
Publisher:
ISBN:
Category : Automatic speech recognition
Languages : en
Pages : 872
Book Description
Publisher:
ISBN:
Category : Automatic speech recognition
Languages : en
Pages : 872
Book Description
Automatic Speech Translation
Author: Akira Kurematsu
Publisher: CRC Press
ISBN: 1000673588
Category : Technology & Engineering
Languages : en
Pages : 136
Book Description
Automatic Speech Translation introduces recent results of Japanese research and development in speech translation and speech recognition. Topics covered include: fundamental concepts of speech recognition; speech pattern representation; phoneme-based HMM phoneme recognition; continuous speech recognition; speaker adaptation; speaker-independent speech recognition; utterance analysis, utterance transfer, utterance generation; contextual processĀing; speech synthesis and an experimental system of speech translation. This book presents the complicated technological aspects of machine translation and speech recognition, and outlines the future directions of this rapidly developing area of technology.
Publisher: CRC Press
ISBN: 1000673588
Category : Technology & Engineering
Languages : en
Pages : 136
Book Description
Automatic Speech Translation introduces recent results of Japanese research and development in speech translation and speech recognition. Topics covered include: fundamental concepts of speech recognition; speech pattern representation; phoneme-based HMM phoneme recognition; continuous speech recognition; speaker adaptation; speaker-independent speech recognition; utterance analysis, utterance transfer, utterance generation; contextual processĀing; speech synthesis and an experimental system of speech translation. This book presents the complicated technological aspects of machine translation and speech recognition, and outlines the future directions of this rapidly developing area of technology.
Text- and Speech-Triggered Information Access
Author: Steve Renals
Publisher: Springer Science & Business Media
ISBN: 3540406352
Category : Computers
Languages : en
Pages : 203
Book Description
This book presents revised versions of the lectures given at the 8th ELSNET European Summer School on Language and Speech Communication held on the Island of Chios, Greece, in summer 2000. Besides an introductory survey, the book presents lectures on data analysis for multimedia libraries, pronunciation modeling for large vocabulary speech recognition, statistical language modeling, very large scale information retrieval, reduction of information variation in text, and a concluding chapter on open questions in research for linguistics in information access. The book gives newcomers to language and speech communication a clear overview of the main technologies and problems in the area. Researchers and professionals active in the area will appreciate the book as a concise review of the technologies used in text- and speech-triggered information access.
Publisher: Springer Science & Business Media
ISBN: 3540406352
Category : Computers
Languages : en
Pages : 203
Book Description
This book presents revised versions of the lectures given at the 8th ELSNET European Summer School on Language and Speech Communication held on the Island of Chios, Greece, in summer 2000. Besides an introductory survey, the book presents lectures on data analysis for multimedia libraries, pronunciation modeling for large vocabulary speech recognition, statistical language modeling, very large scale information retrieval, reduction of information variation in text, and a concluding chapter on open questions in research for linguistics in information access. The book gives newcomers to language and speech communication a clear overview of the main technologies and problems in the area. Researchers and professionals active in the area will appreciate the book as a concise review of the technologies used in text- and speech-triggered information access.
Springer Handbook of Speech Processing
Author: Jacob Benesty
Publisher: Springer Science & Business Media
ISBN: 3540491252
Category : Technology & Engineering
Languages : en
Pages : 1170
Book Description
This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.
Publisher: Springer Science & Business Media
ISBN: 3540491252
Category : Technology & Engineering
Languages : en
Pages : 1170
Book Description
This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.
Automatic Assessment of Children Speech to Support Language Learning
Author: Christian Hacker
Publisher: Logos Verlag Berlin GmbH
ISBN: 3832522581
Category : Computers
Languages : en
Pages : 272
Book Description
Focus of this work are pattern recognition related aspects of computer assisted pronunciation training (CAPT) for second language learning. An overview of commercial systems shows that pronunciation training is being addressed by the growing field of computer assisted language learning only to a small extend, although in the state-of-the-art section a number of such approaches for automatic assessment can already be presented. In the present thesis different approaches are extended and combined. In particular a large set of nearly 200 pronunciation and prosodic features is developed. By this approach pronunciation scoring is regarded as classification task in high-dimensional feature space. Automatic speech recognition is the basis of most pronunciation scoring algorithms. In this thesis a system is presented, which supports second language learning at school, i.e. the target users are children. For this reason a state-of-the-art speech recognition engine is adapted to children speech, since young speakers are only hardly recognised by automatic systems. Phonetically motivated rules for typical mispronunciation errors are integrated into the system to make it suitable for pronunciation scoring. Evaluating an algorithm for pronunciation assessment is more difficult than simply counting the correctly recognised mistakes, since there exists no objective ground truth. This can be shown by evaluating the annotations of 14 teachers. However, with different measures it can be verified that the accuracy of the system (in comparison with teachers) thoroughly reaches the agreement among teachers. The evaluation is conducted with native German speakers learning English.
Publisher: Logos Verlag Berlin GmbH
ISBN: 3832522581
Category : Computers
Languages : en
Pages : 272
Book Description
Focus of this work are pattern recognition related aspects of computer assisted pronunciation training (CAPT) for second language learning. An overview of commercial systems shows that pronunciation training is being addressed by the growing field of computer assisted language learning only to a small extend, although in the state-of-the-art section a number of such approaches for automatic assessment can already be presented. In the present thesis different approaches are extended and combined. In particular a large set of nearly 200 pronunciation and prosodic features is developed. By this approach pronunciation scoring is regarded as classification task in high-dimensional feature space. Automatic speech recognition is the basis of most pronunciation scoring algorithms. In this thesis a system is presented, which supports second language learning at school, i.e. the target users are children. For this reason a state-of-the-art speech recognition engine is adapted to children speech, since young speakers are only hardly recognised by automatic systems. Phonetically motivated rules for typical mispronunciation errors are integrated into the system to make it suitable for pronunciation scoring. Evaluating an algorithm for pronunciation assessment is more difficult than simply counting the correctly recognised mistakes, since there exists no objective ground truth. This can be shown by evaluating the annotations of 14 teachers. However, with different measures it can be verified that the accuracy of the system (in comparison with teachers) thoroughly reaches the agreement among teachers. The evaluation is conducted with native German speakers learning English.
Smart Grapics
Author: Andreas Butz
Publisher: Springer Science & Business Media
ISBN: 3540405577
Category : Art
Languages : en
Pages : 271
Book Description
This book constitutes the refereed proceedings of the Third International Symposium on Smart Graphics, SG 2003, held in Heidelberg, Germany in July 2003. The 19 revised full papers and 7 poster papers presented were carefully reviewed and selected for presentation. The papers address smart graphics issues from the points of view of computer science, artificial intelligence, cognitive psychology, and fine art. The papers are organized in topical sections on graphical interaction, visualization techniques, virtual characters, and camera planning.
Publisher: Springer Science & Business Media
ISBN: 3540405577
Category : Art
Languages : en
Pages : 271
Book Description
This book constitutes the refereed proceedings of the Third International Symposium on Smart Graphics, SG 2003, held in Heidelberg, Germany in July 2003. The 19 revised full papers and 7 poster papers presented were carefully reviewed and selected for presentation. The papers address smart graphics issues from the points of view of computer science, artificial intelligence, cognitive psychology, and fine art. The papers are organized in topical sections on graphical interaction, visualization techniques, virtual characters, and camera planning.
Automatic Speech Recognition on Mobile Devices and over Communication Networks
Author: Zheng-Hua Tan
Publisher: Springer Science & Business Media
ISBN: 1848001436
Category : Technology & Engineering
Languages : en
Pages : 408
Book Description
The advances in computing and networking have sparked an enormous interest in deploying automatic speech recognition on mobile devices and over communication networks. This book brings together academic researchers and industrial practitioners to address the issues in this emerging realm and presents the reader with a comprehensive introduction to the subject of speech recognition in devices and networks. It covers network, distributed and embedded speech recognition systems.
Publisher: Springer Science & Business Media
ISBN: 1848001436
Category : Technology & Engineering
Languages : en
Pages : 408
Book Description
The advances in computing and networking have sparked an enormous interest in deploying automatic speech recognition on mobile devices and over communication networks. This book brings together academic researchers and industrial practitioners to address the issues in this emerging realm and presents the reader with a comprehensive introduction to the subject of speech recognition in devices and networks. It covers network, distributed and embedded speech recognition systems.
Robustness in Automatic Speech Recognition
Author: Jean-Claude Junqua
Publisher: Springer Science & Business Media
ISBN: 1461312973
Category : Technology & Engineering
Languages : en
Pages : 457
Book Description
Foreword Looking back the past 30 years. we have seen steady progress made in the area of speech science and technology. I still remember the excitement in the late seventies when Texas Instruments came up with a toy named "Speak-and-Spell" which was based on a VLSI chip containing the state-of-the-art linear prediction synthesizer. This caused a speech technology fever among the electronics industry. Particularly. applications of automatic speech recognition were rigorously attempt ed by many companies. some of which were start-ups founded just for this purpose. Unfortunately. it did not take long before they realized that automatic speech rec ognition technology was not mature enough to satisfy the need of customers. The fever gradually faded away. In the meantime. constant efforts have been made by many researchers and engi neers to improve the automatic speech recognition technology. Hardware capabilities have advanced impressively since that time. In the past few years. we have been witnessing and experiencing the advent of the "Information Revolution." What might be called the second surge of interest to com mercialize speech technology as a natural interface for man-machine communication began in much better shape than the first one. With computers much more powerful and faster. many applications look realistic this time. However. there are still tremendous practical issues to be overcome in order for speech to be truly the most natural interface between humans and machines.
Publisher: Springer Science & Business Media
ISBN: 1461312973
Category : Technology & Engineering
Languages : en
Pages : 457
Book Description
Foreword Looking back the past 30 years. we have seen steady progress made in the area of speech science and technology. I still remember the excitement in the late seventies when Texas Instruments came up with a toy named "Speak-and-Spell" which was based on a VLSI chip containing the state-of-the-art linear prediction synthesizer. This caused a speech technology fever among the electronics industry. Particularly. applications of automatic speech recognition were rigorously attempt ed by many companies. some of which were start-ups founded just for this purpose. Unfortunately. it did not take long before they realized that automatic speech rec ognition technology was not mature enough to satisfy the need of customers. The fever gradually faded away. In the meantime. constant efforts have been made by many researchers and engi neers to improve the automatic speech recognition technology. Hardware capabilities have advanced impressively since that time. In the past few years. we have been witnessing and experiencing the advent of the "Information Revolution." What might be called the second surge of interest to com mercialize speech technology as a natural interface for man-machine communication began in much better shape than the first one. With computers much more powerful and faster. many applications look realistic this time. However. there are still tremendous practical issues to be overcome in order for speech to be truly the most natural interface between humans and machines.