Author: Grotz, Markus
Publisher: KIT Scientific Publishing
ISBN: 3731511010
Category : Computers
Languages : en
Pages : 202
Book Description
Visual perception is one of the most important sources of information for both humans and robots. A particular challenge is the acquisition and interpretation of complex unstructured scenes. This work contributes to active vision for humanoid robots. A semantic model of the scene is created, which is extended by successively changing the robot's view in order to explore interaction possibilities of the scene.
Active Vision for Scene Understanding
Author: Grotz, Markus
Publisher: KIT Scientific Publishing
ISBN: 3731511010
Category : Computers
Languages : en
Pages : 202
Book Description
Visual perception is one of the most important sources of information for both humans and robots. A particular challenge is the acquisition and interpretation of complex unstructured scenes. This work contributes to active vision for humanoid robots. A semantic model of the scene is created, which is extended by successively changing the robot's view in order to explore interaction possibilities of the scene.
Publisher: KIT Scientific Publishing
ISBN: 3731511010
Category : Computers
Languages : en
Pages : 202
Book Description
Visual perception is one of the most important sources of information for both humans and robots. A particular challenge is the acquisition and interpretation of complex unstructured scenes. This work contributes to active vision for humanoid robots. A semantic model of the scene is created, which is extended by successively changing the robot's view in order to explore interaction possibilities of the scene.
Multimodal Scene Understanding
Author: Michael Ying Yang
Publisher: Academic Press
ISBN: 0128173599
Category : Technology & Engineering
Languages : en
Pages : 424
Book Description
Multimodal Scene Understanding: Algorithms, Applications and Deep Learning presents recent advances in multi-modal computing, with a focus on computer vision and photogrammetry. It provides the latest algorithms and applications that involve combining multiple sources of information and describes the role and approaches of multi-sensory data and multi-modal deep learning. The book is ideal for researchers from the fields of computer vision, remote sensing, robotics, and photogrammetry, thus helping foster interdisciplinary interaction and collaboration between these realms. Researchers collecting and analyzing multi-sensory data collections – for example, KITTI benchmark (stereo+laser) - from different platforms, such as autonomous vehicles, surveillance cameras, UAVs, planes and satellites will find this book to be very useful. - Contains state-of-the-art developments on multi-modal computing - Shines a focus on algorithms and applications - Presents novel deep learning topics on multi-sensor fusion and multi-modal deep learning
Publisher: Academic Press
ISBN: 0128173599
Category : Technology & Engineering
Languages : en
Pages : 424
Book Description
Multimodal Scene Understanding: Algorithms, Applications and Deep Learning presents recent advances in multi-modal computing, with a focus on computer vision and photogrammetry. It provides the latest algorithms and applications that involve combining multiple sources of information and describes the role and approaches of multi-sensory data and multi-modal deep learning. The book is ideal for researchers from the fields of computer vision, remote sensing, robotics, and photogrammetry, thus helping foster interdisciplinary interaction and collaboration between these realms. Researchers collecting and analyzing multi-sensory data collections – for example, KITTI benchmark (stereo+laser) - from different platforms, such as autonomous vehicles, surveillance cameras, UAVs, planes and satellites will find this book to be very useful. - Contains state-of-the-art developments on multi-modal computing - Shines a focus on algorithms and applications - Presents novel deep learning topics on multi-sensor fusion and multi-modal deep learning
Active Vision
Author: John M Findlay
Publisher: Oxford University Press
ISBN: 019852479X
Category : Language Arts & Disciplines
Languages : en
Pages : 235
Book Description
This title focuses on vision as an active process, rather than a passive activity and provides an integrated account of seeing and looking. The authors give a thorough description of basic details of the visual and oculomotor systems necessary to understand active vision.
Publisher: Oxford University Press
ISBN: 019852479X
Category : Language Arts & Disciplines
Languages : en
Pages : 235
Book Description
This title focuses on vision as an active process, rather than a passive activity and provides an integrated account of seeing and looking. The authors give a thorough description of basic details of the visual and oculomotor systems necessary to understand active vision.
Representations and Techniques for 3D Object Recognition and Scene Interpretation
Author: Derek Hoiem
Publisher: Morgan & Claypool Publishers
ISBN: 1608457281
Category : Computers
Languages : en
Pages : 172
Book Description
One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions
Publisher: Morgan & Claypool Publishers
ISBN: 1608457281
Category : Computers
Languages : en
Pages : 172
Book Description
One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions
Dynamic Data Driven Applications Systems
Author: Frederica Darema
Publisher: Springer Nature
ISBN: 3030617254
Category : Computers
Languages : en
Pages : 356
Book Description
This book constitutes the refereed proceedings of the Third International Conference on Dynamic Data Driven Application Systems, DDDAS 2020, held in Boston, MA, USA, in October 2020. The 21 full papers and 14 short papers presented in this volume were carefully reviewed and selected from 40 submissions. They cover topics such as: digital twins; environment cognizant adaptive-planning systems; energy systems; materials systems; physics-based systems analysis; imaging methods and systems; and learning systems.
Publisher: Springer Nature
ISBN: 3030617254
Category : Computers
Languages : en
Pages : 356
Book Description
This book constitutes the refereed proceedings of the Third International Conference on Dynamic Data Driven Application Systems, DDDAS 2020, held in Boston, MA, USA, in October 2020. The 21 full papers and 14 short papers presented in this volume were carefully reviewed and selected from 40 submissions. They cover topics such as: digital twins; environment cognizant adaptive-planning systems; energy systems; materials systems; physics-based systems analysis; imaging methods and systems; and learning systems.
Artificial Neural Networks and Machine Learning – ICANN 2024
Author: Michael Wand
Publisher: Springer Nature
ISBN: 3031723597
Category :
Languages : en
Pages : 469
Book Description
Publisher: Springer Nature
ISBN: 3031723597
Category :
Languages : en
Pages : 469
Book Description
Active Vision and Perception in Human-Robot Collaboration
Author: Dimitri Ognibene
Publisher: Frontiers Media SA
ISBN: 2889745996
Category : Science
Languages : en
Pages : 192
Book Description
Publisher: Frontiers Media SA
ISBN: 2889745996
Category : Science
Languages : en
Pages : 192
Book Description
Advanced Multimedia Content Processing
Author: Shojiro Nishio
Publisher: Springer Science & Business Media
ISBN: 3540657622
Category : Computers
Languages : en
Pages : 466
Book Description
This volume is the Proceedings of the First International Conference on Advanced Multimedia Content Processing (AMCP ’98). With the remarkable advances made in computer and communication hardware/software system technologies, we can now easily obtain large volumes of multimedia data through advanced computer networks and store and handle them in our own personal hardware. Sophisticated and integrated multimedia content processing technologies, which are essential to building a highly advanced information based society, are attracting ever increasing attention in various service areas, including broadcasting, publishing, medical treatment, entertainment, and communications. The prime concerns of these technologies are how to acquire multimedia content data from the real world, how to automatically organize and store these obtained data in databases for sharing and reuse, and how to generate and create new, attractive multimedia content using the stored data. This conference brings together researchers and practitioners from academia, in dustry, and public agencies to present and discuss recent advances in the acquisition, management, retrieval, creation, and utilization of large amounts of multimedia con tent. Artistic and innovative applications through the active use of multimedia con tent are also subjects of interest. The conference aims at covering the following par ticular areas: (1) Dynamic multimedia data modeling and intelligent structuring of content based on active, bottom up, and self organized strategies. (2) Access archi tecture, querying facilities, and distribution mechanisms for multimedia content.
Publisher: Springer Science & Business Media
ISBN: 3540657622
Category : Computers
Languages : en
Pages : 466
Book Description
This volume is the Proceedings of the First International Conference on Advanced Multimedia Content Processing (AMCP ’98). With the remarkable advances made in computer and communication hardware/software system technologies, we can now easily obtain large volumes of multimedia data through advanced computer networks and store and handle them in our own personal hardware. Sophisticated and integrated multimedia content processing technologies, which are essential to building a highly advanced information based society, are attracting ever increasing attention in various service areas, including broadcasting, publishing, medical treatment, entertainment, and communications. The prime concerns of these technologies are how to acquire multimedia content data from the real world, how to automatically organize and store these obtained data in databases for sharing and reuse, and how to generate and create new, attractive multimedia content using the stored data. This conference brings together researchers and practitioners from academia, in dustry, and public agencies to present and discuss recent advances in the acquisition, management, retrieval, creation, and utilization of large amounts of multimedia con tent. Artistic and innovative applications through the active use of multimedia con tent are also subjects of interest. The conference aims at covering the following par ticular areas: (1) Dynamic multimedia data modeling and intelligent structuring of content based on active, bottom up, and self organized strategies. (2) Access archi tecture, querying facilities, and distribution mechanisms for multimedia content.
Eye Guidance in Reading and Scene Perception
Author: G. Underwood
Publisher: Elsevier
ISBN: 0080506232
Category : Psychology
Languages : en
Pages : 481
Book Description
The distinguished contributors to this volume have been set the problem of describing how we know where to move our eyes. There is a great deal of current interest in the use of eye movement recordings to investigate various mental processes. The common theme is that variations in eye movements indicate variations in the processing of what is being perceived, whether in reading, driving or scene perception. However, a number of problems of interpretation are now emerging, and this edited volume sets out to address these problems. The book investigates controversies concerning the variations in eye movements associated with reading ability, concerning the extent to which text is used by the guidance mechanism while reading, concerning the relationship between eye movements and the control of other body movements, the relationship between what is inspected and what is perceived, and concerning the role of visual control attention in the acquisition of complex perceptual-motor skills, in addition to the nature of the guidance mechanism itself. The origins of the volume are in discussions held at a meeting of the European Society for Cognitive Psychology (ESCOP) that was held in Wurzburg in September 1996. The discussions concerned the landing effect in reading, an effect, that if substantiated, would provide evidence of the use of parafoveal information in eye guidance, and these discussions were explored in more detail at a small meeting in Chamonix, in February 1997. Many of the contributors to this volume were present at the meeting, but the arguments were not resolved in Chamonix either. Other leaders in the field were invited to contribute to the discussion, and this volume is the product. The argument remains unresolved, but the problem is certainly clearer.
Publisher: Elsevier
ISBN: 0080506232
Category : Psychology
Languages : en
Pages : 481
Book Description
The distinguished contributors to this volume have been set the problem of describing how we know where to move our eyes. There is a great deal of current interest in the use of eye movement recordings to investigate various mental processes. The common theme is that variations in eye movements indicate variations in the processing of what is being perceived, whether in reading, driving or scene perception. However, a number of problems of interpretation are now emerging, and this edited volume sets out to address these problems. The book investigates controversies concerning the variations in eye movements associated with reading ability, concerning the extent to which text is used by the guidance mechanism while reading, concerning the relationship between eye movements and the control of other body movements, the relationship between what is inspected and what is perceived, and concerning the role of visual control attention in the acquisition of complex perceptual-motor skills, in addition to the nature of the guidance mechanism itself. The origins of the volume are in discussions held at a meeting of the European Society for Cognitive Psychology (ESCOP) that was held in Wurzburg in September 1996. The discussions concerned the landing effect in reading, an effect, that if substantiated, would provide evidence of the use of parafoveal information in eye guidance, and these discussions were explored in more detail at a small meeting in Chamonix, in February 1997. Many of the contributors to this volume were present at the meeting, but the arguments were not resolved in Chamonix either. Other leaders in the field were invited to contribute to the discussion, and this volume is the product. The argument remains unresolved, but the problem is certainly clearer.
Machine Intelligence 15
Author: Koichi Furukawa
Publisher: Oxford University Press
ISBN: 9780198538677
Category : Business & Economics
Languages : en
Pages : 518
Book Description
The Machine Intelligence series was founded in 1965 by Donald Michie and has included many of the most important developments in the field over the past decades. This volume focuses on the theme of intelligent agents and features work by a number of eminent figures in artificial intelligence, including John McCarthy, Alan Robinson, Robert Kowalski, and Mike Genesereth. Topics include representations of consciousness, SoftBots, parallel implementations of logic, machine learning, machine vision, and machine-based scientific discovery in molecular biology.
Publisher: Oxford University Press
ISBN: 9780198538677
Category : Business & Economics
Languages : en
Pages : 518
Book Description
The Machine Intelligence series was founded in 1965 by Donald Michie and has included many of the most important developments in the field over the past decades. This volume focuses on the theme of intelligent agents and features work by a number of eminent figures in artificial intelligence, including John McCarthy, Alan Robinson, Robert Kowalski, and Mike Genesereth. Topics include representations of consciousness, SoftBots, parallel implementations of logic, machine learning, machine vision, and machine-based scientific discovery in molecular biology.