Author: K. Church
Publisher: Springer Science & Business Media
ISBN: 1461320135
Category : Technology & Engineering
Languages : en
Pages : 272
Book Description
It is well-known that phonemes have different acoustic realizations depending on the context. Thus, for example, the phoneme /t! is typically realized with a heavily aspirated strong burst at the beginning of a syllable as in the word Tom, but without a burst at the end of a syllable in a word like cat. Variation such as this is often considered to be problematic for speech recogni tion: (1) "In most systems for sentence recognition, such modifications must be viewed as a kind of 'noise' that makes it more difficult to hypothesize lexical candidates given an in put phonetic transcription. To see that this must be the case, we note that each phonological rule [in a certain example] results in irreversible ambiguity-the phonological rule does not have a unique inverse that could be used to recover the underlying phonemic representation for a lexical item. For example, . . . schwa vowels could be the first vowel in a word like 'about' or the surface realization of almost any English vowel appearing in a sufficiently destressed word. The tongue flap [(] could have come from a /t! or a /d/. " [65, pp. 548-549] This view of allophonic variation is representative of much of the speech recognition literature, especially during the late 1970's. One can find similar statements by Cole and Jakimik [22] and by Jelinek [50].
Phonological Parsing in Speech Recognition
Author: K. Church
Publisher: Springer Science & Business Media
ISBN: 1461320135
Category : Technology & Engineering
Languages : en
Pages : 272
Book Description
It is well-known that phonemes have different acoustic realizations depending on the context. Thus, for example, the phoneme /t! is typically realized with a heavily aspirated strong burst at the beginning of a syllable as in the word Tom, but without a burst at the end of a syllable in a word like cat. Variation such as this is often considered to be problematic for speech recogni tion: (1) "In most systems for sentence recognition, such modifications must be viewed as a kind of 'noise' that makes it more difficult to hypothesize lexical candidates given an in put phonetic transcription. To see that this must be the case, we note that each phonological rule [in a certain example] results in irreversible ambiguity-the phonological rule does not have a unique inverse that could be used to recover the underlying phonemic representation for a lexical item. For example, . . . schwa vowels could be the first vowel in a word like 'about' or the surface realization of almost any English vowel appearing in a sufficiently destressed word. The tongue flap [(] could have come from a /t! or a /d/. " [65, pp. 548-549] This view of allophonic variation is representative of much of the speech recognition literature, especially during the late 1970's. One can find similar statements by Cole and Jakimik [22] and by Jelinek [50].
Publisher: Springer Science & Business Media
ISBN: 1461320135
Category : Technology & Engineering
Languages : en
Pages : 272
Book Description
It is well-known that phonemes have different acoustic realizations depending on the context. Thus, for example, the phoneme /t! is typically realized with a heavily aspirated strong burst at the beginning of a syllable as in the word Tom, but without a burst at the end of a syllable in a word like cat. Variation such as this is often considered to be problematic for speech recogni tion: (1) "In most systems for sentence recognition, such modifications must be viewed as a kind of 'noise' that makes it more difficult to hypothesize lexical candidates given an in put phonetic transcription. To see that this must be the case, we note that each phonological rule [in a certain example] results in irreversible ambiguity-the phonological rule does not have a unique inverse that could be used to recover the underlying phonemic representation for a lexical item. For example, . . . schwa vowels could be the first vowel in a word like 'about' or the surface realization of almost any English vowel appearing in a sufficiently destressed word. The tongue flap [(] could have come from a /t! or a /d/. " [65, pp. 548-549] This view of allophonic variation is representative of much of the speech recognition literature, especially during the late 1970's. One can find similar statements by Cole and Jakimik [22] and by Jelinek [50].
Time Map Phonology
Author: J. Carson-Berndsen
Publisher: Springer Science & Business Media
ISBN: 9401735344
Category : Language Arts & Disciplines
Languages : en
Pages : 225
Book Description
This book is a revised version of my doctoral thesis which was submitted in April 1993. The main extension is a chapter on evaluation of the system de scribed in Chapter 8 as this is clearly an issue which was not treated in the original version. This required the collection of data, the development of a concept for diagnostic evaluation of linguistic word recognition systems and, of course, the actual evaluation of the system itself. The revisions made primarily concern the presentation of the latest version of the SILPA system described in an additional Subsection 8. 3, the development environment for SILPA in Sec tion 8. 4, the diagnostic evaluation of the system as an additional Chapter 9. Some updates are included in the discussion of phonology and computation in Chapter 2 and finite state techniques in computational phonology in Chapter 3. The thesis was designed primarily as a contribution to the area of compu tational phonology. However, it addresses issues which are relevant within the disciplines of general linguistics, computational linguistics and, in particular, speech technology, in providing a detailed declarative, computationally inter preted linguistic model for application in spoken language processing. Time Map Phonology is a novel, constraint-based approach based on a two-stage temporal interpretation of phonological categories as events.
Publisher: Springer Science & Business Media
ISBN: 9401735344
Category : Language Arts & Disciplines
Languages : en
Pages : 225
Book Description
This book is a revised version of my doctoral thesis which was submitted in April 1993. The main extension is a chapter on evaluation of the system de scribed in Chapter 8 as this is clearly an issue which was not treated in the original version. This required the collection of data, the development of a concept for diagnostic evaluation of linguistic word recognition systems and, of course, the actual evaluation of the system itself. The revisions made primarily concern the presentation of the latest version of the SILPA system described in an additional Subsection 8. 3, the development environment for SILPA in Sec tion 8. 4, the diagnostic evaluation of the system as an additional Chapter 9. Some updates are included in the discussion of phonology and computation in Chapter 2 and finite state techniques in computational phonology in Chapter 3. The thesis was designed primarily as a contribution to the area of compu tational phonology. However, it addresses issues which are relevant within the disciplines of general linguistics, computational linguistics and, in particular, speech technology, in providing a detailed declarative, computationally inter preted linguistic model for application in spoken language processing. Time Map Phonology is a novel, constraint-based approach based on a two-stage temporal interpretation of phonological categories as events.
The Integration of Phonetic Knowledge in Speech Technology
Author: William J. Barry
Publisher: Springer Science & Business Media
ISBN: 9781402026362
Category : Computers
Languages : en
Pages : 196
Book Description
Continued progress in Speech Technology in the face of ever-increasing demands on the performance levels of applications is a challenge to the whole speech and language science community. Robust recognition and understanding of spontaneous speech in varied environments, good comprehensibility and naturalness of expressive speech synthesis are goals that cannot be achieved without a change of paradigm. This book argues for interdisciplinary communication and cooperation in problem-solving in general, and discusses the interaction between speech and language engineering and phonetics in particular. With a number of reports on innovative speech technology research as well as more theoretical discussions, it addresses the practical, scientific and sometimes the philosophical problems that stand in the way of cross-disciplinary collaboration and illuminates some of the many possible ways forward. Audience: Researchers and professionals in speech technology and computational linguists.
Publisher: Springer Science & Business Media
ISBN: 9781402026362
Category : Computers
Languages : en
Pages : 196
Book Description
Continued progress in Speech Technology in the face of ever-increasing demands on the performance levels of applications is a challenge to the whole speech and language science community. Robust recognition and understanding of spontaneous speech in varied environments, good comprehensibility and naturalness of expressive speech synthesis are goals that cannot be achieved without a change of paradigm. This book argues for interdisciplinary communication and cooperation in problem-solving in general, and discusses the interaction between speech and language engineering and phonetics in particular. With a number of reports on innovative speech technology research as well as more theoretical discussions, it addresses the practical, scientific and sometimes the philosophical problems that stand in the way of cross-disciplinary collaboration and illuminates some of the many possible ways forward. Audience: Researchers and professionals in speech technology and computational linguists.
Author:
Publisher: IOS Press
ISBN:
Category :
Languages : en
Pages : 7289
Book Description
Publisher: IOS Press
ISBN:
Category :
Languages : en
Pages : 7289
Book Description
Robustness in Automatic Speech Recognition
Author: Jean-Claude Junqua
Publisher: Springer Science & Business Media
ISBN: 1461312973
Category : Technology & Engineering
Languages : en
Pages : 457
Book Description
Foreword Looking back the past 30 years. we have seen steady progress made in the area of speech science and technology. I still remember the excitement in the late seventies when Texas Instruments came up with a toy named "Speak-and-Spell" which was based on a VLSI chip containing the state-of-the-art linear prediction synthesizer. This caused a speech technology fever among the electronics industry. Particularly. applications of automatic speech recognition were rigorously attempt ed by many companies. some of which were start-ups founded just for this purpose. Unfortunately. it did not take long before they realized that automatic speech rec ognition technology was not mature enough to satisfy the need of customers. The fever gradually faded away. In the meantime. constant efforts have been made by many researchers and engi neers to improve the automatic speech recognition technology. Hardware capabilities have advanced impressively since that time. In the past few years. we have been witnessing and experiencing the advent of the "Information Revolution." What might be called the second surge of interest to com mercialize speech technology as a natural interface for man-machine communication began in much better shape than the first one. With computers much more powerful and faster. many applications look realistic this time. However. there are still tremendous practical issues to be overcome in order for speech to be truly the most natural interface between humans and machines.
Publisher: Springer Science & Business Media
ISBN: 1461312973
Category : Technology & Engineering
Languages : en
Pages : 457
Book Description
Foreword Looking back the past 30 years. we have seen steady progress made in the area of speech science and technology. I still remember the excitement in the late seventies when Texas Instruments came up with a toy named "Speak-and-Spell" which was based on a VLSI chip containing the state-of-the-art linear prediction synthesizer. This caused a speech technology fever among the electronics industry. Particularly. applications of automatic speech recognition were rigorously attempt ed by many companies. some of which were start-ups founded just for this purpose. Unfortunately. it did not take long before they realized that automatic speech rec ognition technology was not mature enough to satisfy the need of customers. The fever gradually faded away. In the meantime. constant efforts have been made by many researchers and engi neers to improve the automatic speech recognition technology. Hardware capabilities have advanced impressively since that time. In the past few years. we have been witnessing and experiencing the advent of the "Information Revolution." What might be called the second surge of interest to com mercialize speech technology as a natural interface for man-machine communication began in much better shape than the first one. With computers much more powerful and faster. many applications look realistic this time. However. there are still tremendous practical issues to be overcome in order for speech to be truly the most natural interface between humans and machines.
Readings in Speech Recognition
Author: Alexander Waibel
Publisher: Elsevier
ISBN: 0080515843
Category : Computers
Languages : en
Pages : 640
Book Description
After more than two decades of research activity, speech recognition has begun to live up to its promise as a practical technology and interest in the field is growing dramatically. Readings in Speech Recognition provides a collection of seminal papers that have influenced or redirected the field and that illustrate the central insights that have emerged over the years. The editors provide an introduction to the field, its concerns and research problems. Subsequent chapters are devoted to the main schools of thought and design philosophies that have motivated different approaches to speech recognition system design. Each chapter includes an introduction to the papers that highlights the major insights or needs that have motivated an approach to a problem and describes the commonalities and differences of that approach to others in the book.
Publisher: Elsevier
ISBN: 0080515843
Category : Computers
Languages : en
Pages : 640
Book Description
After more than two decades of research activity, speech recognition has begun to live up to its promise as a practical technology and interest in the field is growing dramatically. Readings in Speech Recognition provides a collection of seminal papers that have influenced or redirected the field and that illustrate the central insights that have emerged over the years. The editors provide an introduction to the field, its concerns and research problems. Subsequent chapters are devoted to the main schools of thought and design philosophies that have motivated different approaches to speech recognition system design. Each chapter includes an introduction to the papers that highlights the major insights or needs that have motivated an approach to a problem and describes the commonalities and differences of that approach to others in the book.
Living on the Edge
Author: Stefan Ploch
Publisher: Walter de Gruyter
ISBN: 3110890569
Category : Language Arts & Disciplines
Languages : en
Pages : 757
Book Description
This collection of papers by an international group of authors honors Jonathan Kaye's contributions to phonology by expanding some of Kaye's ideas to a variety of theoretical topics and languages. The set of ideas discussed or used in this collection includes: empty categories, licensing relationships and constraints, a restrictive two-levelled approach to phonology (without rule ordering or constraint ranking), a restrictive theory of syllabic representation (without the codas constituent and with exclusively binary branching), theories of the phonology-phonetics interface in which phonology is motivated independently of phonetics, and the metatheoretical flaws in a number of widely accepted but rarely questioned views on phonology.
Publisher: Walter de Gruyter
ISBN: 3110890569
Category : Language Arts & Disciplines
Languages : en
Pages : 757
Book Description
This collection of papers by an international group of authors honors Jonathan Kaye's contributions to phonology by expanding some of Kaye's ideas to a variety of theoretical topics and languages. The set of ideas discussed or used in this collection includes: empty categories, licensing relationships and constraints, a restrictive two-levelled approach to phonology (without rule ordering or constraint ranking), a restrictive theory of syllabic representation (without the codas constituent and with exclusively binary branching), theories of the phonology-phonetics interface in which phonology is motivated independently of phonetics, and the metatheoretical flaws in a number of widely accepted but rarely questioned views on phonology.
Lexical Representation and Process
Author: William Marslen-Wilson
Publisher: MIT Press
ISBN: 9780262631426
Category : Computers
Languages : en
Pages : 596
Book Description
The 18 contributions in Lexical Representation and Process provide a coherent and well-documented frame of reference for a field of study that is becoming central to both linguistics and psycholinguistics.
Publisher: MIT Press
ISBN: 9780262631426
Category : Computers
Languages : en
Pages : 596
Book Description
The 18 contributions in Lexical Representation and Process provide a coherent and well-documented frame of reference for a field of study that is becoming central to both linguistics and psycholinguistics.
Speech Processing
Author: Li Deng
Publisher: CRC Press
ISBN: 1482276232
Category : Technology & Engineering
Languages : en
Pages : 752
Book Description
Based on years of instruction and field expertise, this volume offers the necessary tools to understand all scientific, computational, and technological aspects of speech processing. The book emphasizes mathematical abstraction, the dynamics of the speech process, and the engineering optimization practices that promote effective problem solving in this area of research and covers many years of the authors' personal research on speech processing. Speech Processing helps build valuable analytical skills to help meet future challenges in scientific and technological advances in the field and considers the complex transition from human speech processing to computer speech processing.
Publisher: CRC Press
ISBN: 1482276232
Category : Technology & Engineering
Languages : en
Pages : 752
Book Description
Based on years of instruction and field expertise, this volume offers the necessary tools to understand all scientific, computational, and technological aspects of speech processing. The book emphasizes mathematical abstraction, the dynamics of the speech process, and the engineering optimization practices that promote effective problem solving in this area of research and covers many years of the authors' personal research on speech processing. Speech Processing helps build valuable analytical skills to help meet future challenges in scientific and technological advances in the field and considers the complex transition from human speech processing to computer speech processing.
Phonology
Author: Charles W. Kreidler
Publisher: Taylor & Francis
ISBN: 9780415237901
Category : Language Arts & Disciplines
Languages : en
Pages : 434
Book Description
Phonology: Critical Concepts, the first such anthology to appear in thirty years and the largest ever published, brings together over a hundred previously published book chapters and articles from professional journals. These have been chosen for their importance in the exploration of theoretical questions, with some preference for essays that are not easily accessible.Divided into sections, each part is preceded by a brief introduction which aims to point out the problems addressed by the various articles and show their relations to one another.-
Publisher: Taylor & Francis
ISBN: 9780415237901
Category : Language Arts & Disciplines
Languages : en
Pages : 434
Book Description
Phonology: Critical Concepts, the first such anthology to appear in thirty years and the largest ever published, brings together over a hundred previously published book chapters and articles from professional journals. These have been chosen for their importance in the exploration of theoretical questions, with some preference for essays that are not easily accessible.Divided into sections, each part is preceded by a brief introduction which aims to point out the problems addressed by the various articles and show their relations to one another.-