Audiovisual Speech Recognition: Correspondence between Brain and Behavior

Audiovisual Speech Recognition: Correspondence between Brain and Behavior PDF Author: Nicholas Altieri
Publisher: Frontiers E-books
ISBN: 2889192512
Category : Brain
Languages : en
Pages : 102

Get Book

Book Description
Perceptual processes mediating recognition, including the recognition of objects and spoken words, is inherently multisensory. This is true in spite of the fact that sensory inputs are segregated in early stages of neuro-sensory encoding. In face-to-face communication, for example, auditory information is processed in the cochlea, encoded in auditory sensory nerve, and processed in lower cortical areas. Eventually, these “sounds” are processed in higher cortical pathways such as the auditory cortex where it is perceived as speech. Likewise, visual information obtained from observing a talker’s articulators is encoded in lower visual pathways. Subsequently, this information undergoes processing in the visual cortex prior to the extraction of articulatory gestures in higher cortical areas associated with speech and language. As language perception unfolds, information garnered from visual articulators interacts with language processing in multiple brain regions. This occurs via visual projections to auditory, language, and multisensory brain regions. The association of auditory and visual speech signals makes the speech signal a highly “configural” percept. An important direction for the field is thus to provide ways to measure the extent to which visual speech information influences auditory processing, and likewise, assess how the unisensory components of the signal combine to form a configural/integrated percept. Numerous behavioral measures such as accuracy (e.g., percent correct, susceptibility to the “McGurk Effect”) and reaction time (RT) have been employed to assess multisensory integration ability in speech perception. On the other hand, neural based measures such as fMRI, EEG and MEG have been employed to examine the locus and or time-course of integration. The purpose of this Research Topic is to find converging behavioral and neural based assessments of audiovisual integration in speech perception. A further aim is to investigate speech recognition ability in normal hearing, hearing-impaired, and aging populations. As such, the purpose is to obtain neural measures from EEG as well as fMRI that shed light on the neural bases of multisensory processes, while connecting them to model based measures of reaction time and accuracy in the behavioral domain. In doing so, we endeavor to gain a more thorough description of the neural bases and mechanisms underlying integration in higher order processes such as speech and language recognition.

Audiovisual Speech Processing

Audiovisual Speech Processing PDF Author: Gérard Bailly
Publisher: Cambridge University Press
ISBN: 1107006821
Category : Computers
Languages : en
Pages : 507

Get Book

Book Description
This book presents a complete overview of all aspects of audiovisual speech including perception, production, brain processing and technology.

Audiovisual Speech Processing

Audiovisual Speech Processing PDF Author: Gérard Bailly
Publisher: Cambridge University Press
ISBN: 110737815X
Category : Language Arts & Disciplines
Languages : en
Pages : 507

Get Book

Book Description
When we speak, we configure the vocal tract which shapes the visible motions of the face and the patterning of the audible speech acoustics. Similarly, we use these visible and audible behaviors to perceive speech. This book showcases a broad range of research investigating how these two types of signals are used in spoken communication, how they interact, and how they can be used to enhance the realistic synthesis and recognition of audible and visible speech. The volume begins by addressing two important questions about human audiovisual performance: how auditory and visual signals combine to access the mental lexicon and where in the brain this and related processes take place. It then turns to the production and perception of multimodal speech and how structures are coordinated within and across the two modalities. Finally, the book presents overviews and recent developments in machine-based speech recognition and synthesis of AV speech.

Speech Recognition in Adverse Conditions

Speech Recognition in Adverse Conditions PDF Author: Sven Mattys
Publisher: Psychology Press
ISBN: 1317836812
Category : Psychology
Languages : en
Pages : 326

Get Book

Book Description
Speech recognition in ‘adverse conditions’ has been a familiar area of research in computer science, engineering, and hearing sciences for several decades. In contrast, most psycholinguistic theories of speech recognition are built upon evidence gathered from tasks performed by healthy listeners on carefully recorded speech, in a quiet environment, and under conditions of undivided attention. Building upon the momentum initiated by the Psycholinguistic Approaches to Speech Recognition in Adverse Conditions workshop held in Bristol, UK, in 2010, the aim of this volume is to promote a multi-disciplinary, yet unified approach to the perceptual, cognitive, and neuro-physiological mechanisms underpinning the recognition of degraded speech, variable speech, speech experienced under cognitive load, and speech experienced by theoretically relevant populations. This collection opens with a review of the literature and a formal classification of adverse conditions. The research articles then highlight those adverse conditions with the greatest potential for constraining theory, showing that some speech phenomena often believed to be immutable can be affected by noise, surface variations, or attentional set in ways that will force researchers to rethink their theory. This volume is essential for those interested in speech recognition outside laboratory constraints.

Neural Mechanisms of Perceptual Categorization as Precursors to Speech Perception

Neural Mechanisms of Perceptual Categorization as Precursors to Speech Perception PDF Author: Einat Liebenthal
Publisher: Frontiers Media SA
ISBN: 2889451585
Category : Electronic book
Languages : en
Pages : 188

Get Book

Book Description
Perceptual categorization is fundamental to the brain’s remarkable ability to process large amounts of sensory information and efficiently recognize objects including speech. Perceptual categorization is the neural bridge between lower-level sensory and higher-level language processing. A long line of research on the physical properties of the speech signal as determined by the anatomy and physiology of the speech production apparatus has led to descriptions of the acoustic information that is used in speech recognition (e.g., stop consonants place and manner of articulation, voice onset time, aspiration). Recent research has also considered what visual cues are relevant to visual speech recognition (i.e., the visual counter-parts used in lipreading or audiovisual speech perception). Much of the theoretical work on speech perception was done in the twentieth century without the benefit of neuroimaging technologies and models of neural representation. Recent progress in understanding the functional organization of sensory and association cortices based on advances in neuroimaging presents the possibility of achieving a comprehensive and far reaching account of perception in the service of language. At the level of cell assemblies, research in animals and humans suggests that neurons in the temporal cortex are important for encoding biological categories. On the cellular level, different classes of neurons (interneurons and pyramidal neurons) have been suggested to play differential roles in the neural computations underlying auditory and visual categorization. The moment is ripe for a research topic focused on neural mechanisms mediating the emergence of speech representations (including auditory, visual and even somatosensory based forms). Important progress can be achieved by juxtaposing within the same research topic the knowledge that currently exists, the identified lacunae, and the theories that can support future investigations. This research topic provides a snapshot and platform for discussion of current understanding of neural mechanisms underlying the formation of perceptual categories and their relationship to language from a multidisciplinary and multisensory perspective. It includes contributions (reviews, original research, methodological developments) pertaining to the neural substrates, dynamics, and mechanisms underlying perceptual categorization and their interaction with neural processes governing speech perception.

Speech Perception and Spoken Word Recognition

Speech Perception and Spoken Word Recognition PDF Author: Gareth Gaskell
Publisher: Psychology Press
ISBN: 1317677420
Category : Psychology
Languages : en
Pages : 206

Get Book

Book Description
Speech Perception and Spoken Word Recognition features contributions from the field’s leading scientists, and covers recent developments and current issues in the study of cognitive and neural mechanisms that take patterns of air vibrations and turn them ‘magically’ into meaning. The volume makes a unique theoretical contribution in linking behavioural and cognitive neuroscience research, and cutting across traditional strands of study, such as adult and developmental processing. The book: Focusses on the state of the art in the study of speech perception and spoken word recognition Discusses the interplay between behavioural and cognitive neuroscience evidence, and between adult and developmental research Evaluates key theories in the field and relates them to recent empirical advances, including the relationship between speech perception and speech production, meaning representation and real-time activation, and bilingual and monolingual spoken word recognition Examines emerging areas of study such as word learning and time-course of memory consolidation, and how the science of human speech perception can help computer speech recognition Overall this book presents a renewed focus on theoretical and developmental issues, as well as a multifaceted and broad review of the state of research, in speech perception and spoken word recognition. Particularly interested readers will be researchers of psycholinguistics and adjoining fields as well as advanced undergraduate and postgraduate students.

The Handbook of Multisensory Processes

The Handbook of Multisensory Processes PDF Author: Gemma Calvert
Publisher: MIT Press
ISBN: 9780262033213
Category : Anatomy
Languages : en
Pages : 952

Get Book

Book Description
Research is suggesting that rather than our senses being independent, perception is fundamentally a multisensory experience. This handbook reviews the evidence and explores the theory of broad underlying principles that govern sensory interactions, regardless of the specific senses involved.

The Role of Letter-Speech Sound Integration in Typical and Atypical Reading Development

The Role of Letter-Speech Sound Integration in Typical and Atypical Reading Development PDF Author: Jurgen Tijms
Publisher: Frontiers Media SA
ISBN: 2889636984
Category :
Languages : en
Pages : 249

Get Book

Book Description
Fluency is the quintessence of effective reading. To obtain socio-economic success, fluent reading is of primordial importance and reading is considered a crucial marker of an individual’s life course. Approximately 5% of children are affected by developmental dyslexia, exhibiting inaccurate word recognition, spelling, phonological decoding, and most importantly, severely dysfluent reading, which remains as their most characterizing and persistent deficit. Unable to attain society’s literacy demands, individuals with dyslexia are at severe risk for adverse academic, economic, and psychosocial consequences. Recently, it has been posed that the development of automatic letter-speech sound (LSS) integration is critical in the acquisition of fluent reading skills, and in particular that a failure to develop automatic LSS integration results in an impairment of reading fluency. In support, neurocognitive research has suggested that the development of automatized processing of LSS associations is an essential step in the formation of a functional neural network for reading. Furthermore, both neurocognitive and behavioural studies have suggested a less efficient LSS integration in children with dyslexia than in typical readers. Finally, results from intervention studies have suggested that training LSS might be a promising approach to ameliorate dysfluent reading in children with dyslexia. Nonetheless, there is still a considerable gap of knowledge in our understanding of the mechanisms by which learning LSS associations relate to (dys)fluent reading.

The Oxford Handbook of Cognitive Neuroscience, Volume 2

The Oxford Handbook of Cognitive Neuroscience, Volume 2 PDF Author: Kevin Ochsner
Publisher: Oxford University Press
ISBN: 0199988706
Category : Psychology
Languages : en
Pages : 638

Get Book

Book Description
A rich source of authoritative information that supports reading and study in the field of cognitive neuroscience, this two-volume handbook reviews the current state-of-the-science in all major areas of the field.

Association and Auditory Cortices

Association and Auditory Cortices PDF Author: Alan Peters
Publisher: Springer Science & Business Media
ISBN: 1475796196
Category : Medical
Languages : en
Pages : 366

Get Book

Book Description
This volume deals with some of the association areas of the cerebral cortex and with the auditory cortex. In the first chapter, by Deepak Pandya and Edward Yeterian, the general architectural features and connections of cortical associ ation areas are considered; as these authors point out, in primates the association areas take up a considerable portion of the total cortical surface. Indeed, it is the development of the association areas that accounts for the greatest differ ences between the brains of primate and non primate species, and these areas have long been viewed as crucial in the formation of higher cognitive and be havioral functions. In the following chapter, Irving Diamond, David Fitzpatrick, and James Sprague consider the question of whether the functions of the as sociation areas depend on projections from the sensory areas of the cortex. They use the visual cortex to examine this question and show that there is a great deal of difference between species in the amount of dependence, the differences being paralleled by variations in the manner in which the geniculate and pulvinar nuclei of the thalamus project to the striate and extra striate cortical areas. One of the more interesting and perhaps least understood of the association areas is the cingulate cortex, discussed by Brent Vogt. Cingulate cortex has been linked with emotion and with affective responses to pain, and in his chapter Vogt gives an account of its cytoarchitecture, connections, and functions.