Author: Herve Bourlard
Publisher: CRC Press
ISBN: 1482212137
Category : Science
Languages : en
Pages : 367
Book Description
This book provides a synthesis of the multifaceted field of interactive multimodal information management. The subjects treated include spoken language processing, image and video processing, document and handwriting analysis, identity information and interfaces. The book concludes with an overview of the highlights of the progress of the field dur
Multimodal Interactive Systems Management
Author: Herve Bourlard
Publisher: CRC Press
ISBN: 1482212137
Category : Science
Languages : en
Pages : 367
Book Description
This book provides a synthesis of the multifaceted field of interactive multimodal information management. The subjects treated include spoken language processing, image and video processing, document and handwriting analysis, identity information and interfaces. The book concludes with an overview of the highlights of the progress of the field dur
Publisher: CRC Press
ISBN: 1482212137
Category : Science
Languages : en
Pages : 367
Book Description
This book provides a synthesis of the multifaceted field of interactive multimodal information management. The subjects treated include spoken language processing, image and video processing, document and handwriting analysis, identity information and interfaces. The book concludes with an overview of the highlights of the progress of the field dur
Multimodal Human Computer Interaction and Pervasive Services
Author: Grifoni, Patrizia
Publisher: IGI Global
ISBN: 1605663875
Category : Computers
Languages : en
Pages : 537
Book Description
"This book provides concepts, methodologies, and applications used to design and develop multimodal systems"--Provided by publisher.
Publisher: IGI Global
ISBN: 1605663875
Category : Computers
Languages : en
Pages : 537
Book Description
"This book provides concepts, methodologies, and applications used to design and develop multimodal systems"--Provided by publisher.
Interactive Multimodal Information Management
Author: Hervé Bourlard
Publisher: EPFL Press
ISBN: 2940222711
Category : Reference
Languages : en
Pages : 369
Book Description
In the past twenty years, computers and networks have gained a prominent role in supporting human communications. This book presents recent research in multimodal information processing, which demonstrates that computers can achieve more than what telephone calls or videoconferencing can do. The book offers a snapshot of current capabilities for the analysis of human communications in several modalities – audio, speech, language, images, video, and documents – and for accessing this information interactively. The book has a clear application goal, which is the capture, automatic analysis, storage, and retrieval of multimodal signals from human interaction in meetings. This goal provides a controlled experimental framework and helps generating shared data, which is required for methods based on machine learning. This goal has shaped the vision of the contributors to the book and of many other researchers cited in it. It has also received significant long-term support through a series of projects, including the Swiss National Center of Competence in Research (NCCR) in Interactive Multimodal Information Management (IM2), to which the contributors to the book have been connected.
Publisher: EPFL Press
ISBN: 2940222711
Category : Reference
Languages : en
Pages : 369
Book Description
In the past twenty years, computers and networks have gained a prominent role in supporting human communications. This book presents recent research in multimodal information processing, which demonstrates that computers can achieve more than what telephone calls or videoconferencing can do. The book offers a snapshot of current capabilities for the analysis of human communications in several modalities – audio, speech, language, images, video, and documents – and for accessing this information interactively. The book has a clear application goal, which is the capture, automatic analysis, storage, and retrieval of multimodal signals from human interaction in meetings. This goal provides a controlled experimental framework and helps generating shared data, which is required for methods based on machine learning. This goal has shaped the vision of the contributors to the book and of many other researchers cited in it. It has also received significant long-term support through a series of projects, including the Swiss National Center of Competence in Research (NCCR) in Interactive Multimodal Information Management (IM2), to which the contributors to the book have been connected.
Interactive Multi-modal Question-Answering
Author: Antal van den Bosch
Publisher: Springer Science & Business Media
ISBN: 3642175252
Category : Computers
Languages : en
Pages : 279
Book Description
This book is the result of a group of researchers from different disciplines asking themselves one question: what does it take to develop a computer interface that listens, talks, and can answer questions in a domain? First, obviously, it takes specialized modules for speech recognition and synthesis, human interaction management (dialogue, input fusion, and multimodal output fusion), basic question understanding, and answer finding. While all modules are researched as independent subfields, this book describes the development of state-of-the-art modules and their integration into a single, working application capable of answering medical (encyclopedic) questions such as "How long is a person with measles contagious?" or "How can I prevent RSI?". The contributions in this book, which grew out of the IMIX project funded by the Netherlands Organisation for Scientific Research, document the development of this system, but also address more general issues in natural language processing, such as the development of multidimensional dialogue systems, the acquisition of taxonomic knowledge from text, answer fusion, sequence processing for domain-specific entity recognition, and syntactic parsing for question answering. Together, they offer an overview of the most important findings and lessons learned in the scope of the IMIX project, making the book of interest to both academic and commercial developers of human-machine interaction systems in Dutch or any other language. Highlights include: integrating multi-modal input fusion in dialogue management (Van Schooten and Op den Akker), state-of-the-art approaches to the extraction of term variants (Van der Plas, Tiedemann, and Fahmi; Tjong Kim Sang, Hofmann, and De Rijke), and multi-modal answer fusion (two chapters by Van Hooijdonk, Bosma, Krahmer, Maes, Theune, and Marsi). Watch the IMIX movie at www.nwo.nl/imix-film. Like IBM's Watson, the IMIX system described in the book gives naturally phrased responses to naturally posed questions. Where Watson can only generate synthetic speech, the IMIX system also recognizes speech. On the other hand, Watson is able to win a television quiz, while the IMIX system is domain-specific, answering only to medical questions. "The Netherlands has always been one of the leaders in the general field of Human Language Technology, and IMIX is no exception. It was a very ambitious program, with a remarkably successful performance leading to interesting results. The teams covered a remarkable amount of territory in the general sphere of multimodal question answering and information delivery, question answering, information extraction and component technologies." Eduard Hovy, USC, USA, Jon Oberlander, University of Edinburgh, Scotland, and Norbert Reithinger, DFKI, Germany
Publisher: Springer Science & Business Media
ISBN: 3642175252
Category : Computers
Languages : en
Pages : 279
Book Description
This book is the result of a group of researchers from different disciplines asking themselves one question: what does it take to develop a computer interface that listens, talks, and can answer questions in a domain? First, obviously, it takes specialized modules for speech recognition and synthesis, human interaction management (dialogue, input fusion, and multimodal output fusion), basic question understanding, and answer finding. While all modules are researched as independent subfields, this book describes the development of state-of-the-art modules and their integration into a single, working application capable of answering medical (encyclopedic) questions such as "How long is a person with measles contagious?" or "How can I prevent RSI?". The contributions in this book, which grew out of the IMIX project funded by the Netherlands Organisation for Scientific Research, document the development of this system, but also address more general issues in natural language processing, such as the development of multidimensional dialogue systems, the acquisition of taxonomic knowledge from text, answer fusion, sequence processing for domain-specific entity recognition, and syntactic parsing for question answering. Together, they offer an overview of the most important findings and lessons learned in the scope of the IMIX project, making the book of interest to both academic and commercial developers of human-machine interaction systems in Dutch or any other language. Highlights include: integrating multi-modal input fusion in dialogue management (Van Schooten and Op den Akker), state-of-the-art approaches to the extraction of term variants (Van der Plas, Tiedemann, and Fahmi; Tjong Kim Sang, Hofmann, and De Rijke), and multi-modal answer fusion (two chapters by Van Hooijdonk, Bosma, Krahmer, Maes, Theune, and Marsi). Watch the IMIX movie at www.nwo.nl/imix-film. Like IBM's Watson, the IMIX system described in the book gives naturally phrased responses to naturally posed questions. Where Watson can only generate synthetic speech, the IMIX system also recognizes speech. On the other hand, Watson is able to win a television quiz, while the IMIX system is domain-specific, answering only to medical questions. "The Netherlands has always been one of the leaders in the general field of Human Language Technology, and IMIX is no exception. It was a very ambitious program, with a remarkably successful performance leading to interesting results. The teams covered a remarkable amount of territory in the general sphere of multimodal question answering and information delivery, question answering, information extraction and component technologies." Eduard Hovy, USC, USA, Jon Oberlander, University of Edinburgh, Scotland, and Norbert Reithinger, DFKI, Germany
Quantifying Quality Aspects of Multimodal Interactive Systems
Author: Christine Kühnel
Publisher: Springer Science & Business Media
ISBN: 3642296025
Category : Technology & Engineering
Languages : en
Pages : 188
Book Description
This book systematically addresses the quantification of quality aspects of multimodal interactive systems. The conceptual structure is based on a schematic view on human-computer interaction where the user interacts with the system and perceives it via input and output interfaces. Thus, aspects of multimodal interaction are analyzed first, followed by a discussion of the evaluation of output and input and concluding with a view on the evaluation of a complete system.
Publisher: Springer Science & Business Media
ISBN: 3642296025
Category : Technology & Engineering
Languages : en
Pages : 188
Book Description
This book systematically addresses the quantification of quality aspects of multimodal interactive systems. The conceptual structure is based on a schematic view on human-computer interaction where the user interacts with the system and perceives it via input and output interfaces. Thus, aspects of multimodal interaction are analyzed first, followed by a discussion of the evaluation of output and input and concluding with a view on the evaluation of a complete system.
Human Machine Interaction
Author: Denis Lalanne
Publisher: Springer
ISBN: 3642004377
Category : Computers
Languages : en
Pages : 319
Book Description
Human Machine Interaction, or more commonly Human Computer Interaction, is the study of interaction between people and computers. It is an interdisciplinary field, connecting computer science with many other disciplines such as psychology, sociology and the arts. The present volume documents the results of the MMI research program on Human Machine Interaction involving 8 projects (selected from a total of 80 proposals) funded by the Hasler Foundation between 2005 and 2008. These projects were also partially funded by the associated universities and other third parties such as the Swiss National Science Foundation. This state-of-the-art survey begins with three chapters giving overviews of the domains of multimodal user interfaces, interactive visualization, and mixed reality. These are followed by eight chapters presenting the results of the projects, grouped according to the three aforementioned themes.
Publisher: Springer
ISBN: 3642004377
Category : Computers
Languages : en
Pages : 319
Book Description
Human Machine Interaction, or more commonly Human Computer Interaction, is the study of interaction between people and computers. It is an interdisciplinary field, connecting computer science with many other disciplines such as psychology, sociology and the arts. The present volume documents the results of the MMI research program on Human Machine Interaction involving 8 projects (selected from a total of 80 proposals) funded by the Hasler Foundation between 2005 and 2008. These projects were also partially funded by the associated universities and other third parties such as the Swiss National Science Foundation. This state-of-the-art survey begins with three chapters giving overviews of the domains of multimodal user interfaces, interactive visualization, and mixed reality. These are followed by eight chapters presenting the results of the projects, grouped according to the three aforementioned themes.
Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments
Author: Julie A. Jacko
Publisher: Springer
ISBN: 3540731105
Category : Computers
Languages : en
Pages : 1049
Book Description
Here is the third of a four-volume set that constitutes the refereed proceedings of the 12th International Conference on Human-Computer Interaction, HCII 2007, held in Beijing, China, in July 2007, jointly with eight other thematically similar conferences. It covers multimodality and conversational dialogue; adaptive, intelligent and emotional user interfaces; gesture and eye gaze recognition; and interactive TV and media.
Publisher: Springer
ISBN: 3540731105
Category : Computers
Languages : en
Pages : 1049
Book Description
Here is the third of a four-volume set that constitutes the refereed proceedings of the 12th International Conference on Human-Computer Interaction, HCII 2007, held in Beijing, China, in July 2007, jointly with eight other thematically similar conferences. It covers multimodality and conversational dialogue; adaptive, intelligent and emotional user interfaces; gesture and eye gaze recognition; and interactive TV and media.
Multimodal Signal Processing
Author: Jean-Philippe Thiran
Publisher: Academic Press
ISBN: 0080888690
Category : Computers
Languages : en
Pages : 343
Book Description
Multimodal signal processing is an important research and development field that processes signals and combines information from a variety of modalities – speech, vision, language, text – which significantly enhance the understanding, modelling, and performance of human-computer interaction devices or systems enhancing human-human communication. The overarching theme of this book is the application of signal processing and statistical machine learning techniques to problems arising in this multi-disciplinary field. It describes the capabilities and limitations of current technologies, and discusses the technical challenges that must be overcome to develop efficient and user-friendly multimodal interactive systems. With contributions from the leading experts in the field, the present book should serve as a reference in multimodal signal processing for signal processing researchers, graduate students, R&D engineers, and computer engineers who are interested in this emerging field. - Presents state-of-art methods for multimodal signal processing, analysis, and modeling - Contains numerous examples of systems with different modalities combined - Describes advanced applications in multimodal Human-Computer Interaction (HCI) as well as in computer-based analysis and modelling of multimodal human-human communication scenes.
Publisher: Academic Press
ISBN: 0080888690
Category : Computers
Languages : en
Pages : 343
Book Description
Multimodal signal processing is an important research and development field that processes signals and combines information from a variety of modalities – speech, vision, language, text – which significantly enhance the understanding, modelling, and performance of human-computer interaction devices or systems enhancing human-human communication. The overarching theme of this book is the application of signal processing and statistical machine learning techniques to problems arising in this multi-disciplinary field. It describes the capabilities and limitations of current technologies, and discusses the technical challenges that must be overcome to develop efficient and user-friendly multimodal interactive systems. With contributions from the leading experts in the field, the present book should serve as a reference in multimodal signal processing for signal processing researchers, graduate students, R&D engineers, and computer engineers who are interested in this emerging field. - Presents state-of-art methods for multimodal signal processing, analysis, and modeling - Contains numerous examples of systems with different modalities combined - Describes advanced applications in multimodal Human-Computer Interaction (HCI) as well as in computer-based analysis and modelling of multimodal human-human communication scenes.
The Handbook of Multimodal-Multisensor Interfaces, Volume 1
Author: Sharon Oviatt
Publisher: Morgan & Claypool
ISBN: 1970001666
Category : Computers
Languages : en
Pages : 598
Book Description
The Handbook of Multimodal-Multisensor Interfaces provides the first authoritative resource on what has become the dominant paradigm for new computer interfaces— user input involving new media (speech, multi-touch, gestures, writing) embedded in multimodal-multisensor interfaces. These interfaces support smart phones, wearables, in-vehicle and robotic applications, and many other areas that are now highly competitive commercially. This edited collection is written by international experts and pioneers in the field. It provides a textbook, reference, and technology roadmap for professionals working in this and related areas. This first volume of the handbook presents relevant theory and neuroscience foundations for guiding the development of high-performance systems. Additional chapters discuss approaches to user modeling and interface designs that support user choice, that synergistically combine modalities with sensors, and that blend multimodal input and output. This volume also highlights an in-depth look at the most common multimodal-multisensor combinations—for example, touch and pen input, haptic and non-speech audio output, and speech-centric systems that co-process either gestures, pen input, gaze, or visible lip movements. A common theme throughout these chapters is supporting mobility and individual differences among users. These handbook chapters provide walk-through examples of system design and processing, information on tools and practical resources for developing and evaluating new systems, and terminology and tutorial support for mastering this emerging field. In the final section of this volume, experts exchange views on a timely and controversial challenge topic, and how they believe multimodal-multisensor interfaces should be designed in the future to most effectively advance human performance.
Publisher: Morgan & Claypool
ISBN: 1970001666
Category : Computers
Languages : en
Pages : 598
Book Description
The Handbook of Multimodal-Multisensor Interfaces provides the first authoritative resource on what has become the dominant paradigm for new computer interfaces— user input involving new media (speech, multi-touch, gestures, writing) embedded in multimodal-multisensor interfaces. These interfaces support smart phones, wearables, in-vehicle and robotic applications, and many other areas that are now highly competitive commercially. This edited collection is written by international experts and pioneers in the field. It provides a textbook, reference, and technology roadmap for professionals working in this and related areas. This first volume of the handbook presents relevant theory and neuroscience foundations for guiding the development of high-performance systems. Additional chapters discuss approaches to user modeling and interface designs that support user choice, that synergistically combine modalities with sensors, and that blend multimodal input and output. This volume also highlights an in-depth look at the most common multimodal-multisensor combinations—for example, touch and pen input, haptic and non-speech audio output, and speech-centric systems that co-process either gestures, pen input, gaze, or visible lip movements. A common theme throughout these chapters is supporting mobility and individual differences among users. These handbook chapters provide walk-through examples of system design and processing, information on tools and practical resources for developing and evaluating new systems, and terminology and tutorial support for mastering this emerging field. In the final section of this volume, experts exchange views on a timely and controversial challenge topic, and how they believe multimodal-multisensor interfaces should be designed in the future to most effectively advance human performance.
Human-Computer Interaction. User Interface Design, Development and Multimodality
Author: Masaaki Kurosu
Publisher: Springer
ISBN: 331958071X
Category : Computers
Languages : en
Pages : 747
Book Description
The two-volume set LNCS 10271 and 10272 constitutes the refereed proceedings of the 19th International Conference on Human-Computer Interaction, HCII 2017, held in Vancouver, BC, Canada, in July 2017. The total of 1228 papers presented at the 15 colocated HCII 2017 conferences was carefully reviewed and selected from 4340 submissions. The papers address the latest research and development efforts and highlight the human aspects of design and use of computing systems. They cover the entire field of Human-Computer Interaction, addressing major advances in knowledge and effective use of computers in a variety of application areas. The papers included in this volume cover the following topics: HCI theory and education; HCI, innovation and technology acceptance; interaction design and evaluation methods; user interface development; methods, tools, and architectures; multimodal interaction; and emotions in HCI.
Publisher: Springer
ISBN: 331958071X
Category : Computers
Languages : en
Pages : 747
Book Description
The two-volume set LNCS 10271 and 10272 constitutes the refereed proceedings of the 19th International Conference on Human-Computer Interaction, HCII 2017, held in Vancouver, BC, Canada, in July 2017. The total of 1228 papers presented at the 15 colocated HCII 2017 conferences was carefully reviewed and selected from 4340 submissions. The papers address the latest research and development efforts and highlight the human aspects of design and use of computing systems. They cover the entire field of Human-Computer Interaction, addressing major advances in knowledge and effective use of computers in a variety of application areas. The papers included in this volume cover the following topics: HCI theory and education; HCI, innovation and technology acceptance; interaction design and evaluation methods; user interface development; methods, tools, and architectures; multimodal interaction; and emotions in HCI.