Towards Recognizing New Semantic Concepts in New Visual Domains

Towards Recognizing New Semantic Concepts in New Visual Domains PDF Author: Massimiliano Mancini
Publisher: Sapienza Università Editrice
ISBN: 8893772485
Category : Computers
Languages : en
Pages : 285

Get Book Here

Book Description
Despite being the leading paradigm in computer vision, deep neural networks are inherently limited by the visual and semantic information contained in their training set. In this thesis, we aim to design deep models operating with previously unseen visual domains and semantic concepts. We first describe different solutions for generalizing to new visual domains, applying variants of normalization layers to multiple challenging settings e.g. where new domain data is not available but arrives online or is described by metadata. In the second part, we incorporate new semantic concepts into pretrained deep models. We propose specific solutions for different problems such as multi-task/incremental learning and open-world recognition. Finally, we merge the two challenges: given images of multiple domains and categories, can we recognize unseen concepts in unseen domains? We propose an approach that is the first, promising step, towards solving this problem. Winner of the Competition “Prize for PhD Thesis 2020” arranged by Sapienza University Press.

Towards Recognizing New Semantic Concepts in New Visual Domains

Towards Recognizing New Semantic Concepts in New Visual Domains PDF Author: Massimiliano Mancini
Publisher: Sapienza Università Editrice
ISBN: 8893772485
Category : Computers
Languages : en
Pages : 285

Get Book Here

Book Description
Despite being the leading paradigm in computer vision, deep neural networks are inherently limited by the visual and semantic information contained in their training set. In this thesis, we aim to design deep models operating with previously unseen visual domains and semantic concepts. We first describe different solutions for generalizing to new visual domains, applying variants of normalization layers to multiple challenging settings e.g. where new domain data is not available but arrives online or is described by metadata. In the second part, we incorporate new semantic concepts into pretrained deep models. We propose specific solutions for different problems such as multi-task/incremental learning and open-world recognition. Finally, we merge the two challenges: given images of multiple domains and categories, can we recognize unseen concepts in unseen domains? We propose an approach that is the first, promising step, towards solving this problem. Winner of the Competition “Prize for PhD Thesis 2020” arranged by Sapienza University Press.

Computer Vision – ECCV 2020

Computer Vision – ECCV 2020 PDF Author: Andrea Vedaldi
Publisher: Springer Nature
ISBN: 3030586219
Category : Computers
Languages : en
Pages : 817

Get Book Here

Book Description
The 30-volume set, comprising the LNCS books 12346 until 12375, constitutes the refereed proceedings of the 16th European Conference on Computer Vision, ECCV 2020, which was planned to be held in Glasgow, UK, during August 23-28, 2020. The conference was held virtually due to the COVID-19 pandemic. The 1360 revised papers presented in these proceedings were carefully reviewed and selected from a total of 5025 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation.

Visual Domain Adaptation in the Deep Learning Era

Visual Domain Adaptation in the Deep Learning Era PDF Author: Gabriela Csurka
Publisher: Springer Nature
ISBN: 3031791754
Category : Computers
Languages : en
Pages : 182

Get Book Here

Book Description
Solving problems with deep neural networks typically relies on massive amounts of labeled training data to achieve high performance. While in many situations huge volumes of unlabeled data can be and often are generated and available, the cost of acquiring data labels remains high. Transfer learning (TL), and in particular domain adaptation (DA), has emerged as an effective solution to overcome the burden of annotation, exploiting the unlabeled data available from the target domain together with labeled data or pre-trained models from similar, yet different source domains. The aim of this book is to provide an overview of such DA/TL methods applied to computer vision, a field whose popularity has increased significantly in the last few years. We set the stage by revisiting the theoretical background and some of the historical shallow methods before discussing and comparing different domain adaptation strategies that exploit deep architectures for visual recognition. We introduce the space of self-training-based methods that draw inspiration from the related fields of deep semi-supervised and self-supervised learning in solving the deep domain adaptation. Going beyond the classic domain adaptation problem, we then explore the rich space of problem settings that arise when applying domain adaptation in practice such as partial or open-set DA, where source and target data categories do not fully overlap, continuous DA where the target data comes as a stream, and so on. We next consider the least restrictive setting of domain generalization (DG), as an extreme case where neither labeled nor unlabeled target data are available during training. Finally, we close by considering the emerging area of learning-to-learn and how it can be applied to further improve existing approaches to cross domain learning problems such as DA and DG.

Neural Information Processing

Neural Information Processing PDF Author: Teddy Mantoro
Publisher: Springer Nature
ISBN: 3030922731
Category : Computers
Languages : en
Pages : 718

Get Book Here

Book Description
The four-volume proceedings LNCS 13108, 13109, 13110, and 13111 constitutes the proceedings of the 28th International Conference on Neural Information Processing, ICONIP 2021, which was held during December 8-12, 2021. The conference was planned to take place in Bali, Indonesia but changed to an online format due to the COVID-19 pandemic. The total of 226 full papers presented in these proceedings was carefully reviewed and selected from 1093 submissions. The papers were organized in topical sections as follows: Part I: Theory and algorithms; Part II: Theory and algorithms; human centred computing; AI and cybersecurity; Part III: Cognitive neurosciences; reliable, robust, and secure machine learning algorithms; theory and applications of natural computing paradigms; advances in deep and shallow machine learning algorithms for biomedical data and imaging; applications; Part IV: Applications.

Domain Adaptation in Computer Vision Applications

Domain Adaptation in Computer Vision Applications PDF Author: Gabriela Csurka
Publisher: Springer
ISBN: 3319583476
Category : Computers
Languages : en
Pages : 338

Get Book Here

Book Description
This comprehensive text/reference presents a broad review of diverse domain adaptation (DA) methods for machine learning, with a focus on solutions for visual applications. The book collects together solutions and perspectives proposed by an international selection of pre-eminent experts in the field, addressing not only classical image categorization, but also other computer vision tasks such as detection, segmentation and visual attributes. Topics and features: surveys the complete field of visual DA, including shallow methods designed for homogeneous and heterogeneous data as well as deep architectures; presents a positioning of the dataset bias in the CNN-based feature arena; proposes detailed analyses of popular shallow methods that addresses landmark data selection, kernel embedding, feature alignment, joint feature transformation and classifier adaptation, or the case of limited access to the source data; discusses more recent deep DA methods, including discrepancy-based adaptation networks and adversarial discriminative DA models; addresses domain adaptation problems beyond image categorization, such as a Fisher encoding adaptation for vehicle re-identification, semantic segmentation and detection trained on synthetic images, and domain generalization for semantic part detection; describes a multi-source domain generalization technique for visual attributes and a unifying framework for multi-domain and multi-task learning. This authoritative volume will be of great interest to a broad audience ranging from researchers and practitioners, to students involved in computer vision, pattern recognition and machine learning.

Pattern Recognition and Computer Vision

Pattern Recognition and Computer Vision PDF Author: Qingshan Liu
Publisher: Springer Nature
ISBN: 9819984351
Category : Computers
Languages : en
Pages : 532

Get Book Here

Book Description
The 13-volume set LNCS 14425-14437 constitutes the refereed proceedings of the 6th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2023, held in Xiamen, China, during October 13–15, 2023. The 532 full papers presented in these volumes were selected from 1420 submissions. The papers have been organized in the following topical sections: Action Recognition, Multi-Modal Information Processing, 3D Vision and Reconstruction, Character Recognition, Fundamental Theory of Computer Vision, Machine Learning, Vision Problems in Robotics, Autonomous Driving, Pattern Classification and Cluster Analysis, Performance Evaluation and Benchmarks, Remote Sensing Image Interpretation, Biometric Recognition, Face Recognition and Pose Recognition, Structural Pattern Recognition, Computational Photography, Sensing and Display Technology, Video Analysis and Understanding, Vision Applications and Systems, Document Analysis and Recognition, Feature Extraction and Feature Selection, Multimedia Analysis and Reasoning, Optimization and Learning methods, Neural Network and Deep Learning, Low-Level Vision and Image Processing, Object Detection, Tracking and Identification, Medical Image Processing and Analysis.

Robot Programming by Demonstration

Robot Programming by Demonstration PDF Author: Sylvain Calinon
Publisher: EPFL Press
ISBN: 9781439808672
Category : Computers
Languages : en
Pages : 248

Get Book Here

Book Description
Recent advances in RbD have identified a number of key issues for ensuring a generic approach to the transfer of skills across various agents and contexts. This book focuses on the two generic questions of what to imitate and how to imitate and proposes active teaching methods.

Transfer Learning

Transfer Learning PDF Author: Qiang Yang
Publisher: Cambridge University Press
ISBN: 1108860087
Category : Computers
Languages : en
Pages : 394

Get Book Here

Book Description
Transfer learning deals with how systems can quickly adapt themselves to new situations, tasks and environments. It gives machine learning systems the ability to leverage auxiliary data and models to help solve target problems when there is only a small amount of data available. This makes such systems more reliable and robust, keeping the machine learning model faced with unforeseeable changes from deviating too much from expected performance. At an enterprise level, transfer learning allows knowledge to be reused so experience gained once can be repeatedly applied to the real world. For example, a pre-trained model that takes account of user privacy can be downloaded and adapted at the edge of a computer network. This self-contained, comprehensive reference text describes the standard algorithms and demonstrates how these are used in different transfer learning paradigms. It offers a solid grounding for newcomers as well as new insights for seasoned researchers and developers.

Computer Vision – ECCV 2018

Computer Vision – ECCV 2018 PDF Author: Vittorio Ferrari
Publisher: Springer
ISBN: 3030012492
Category : Computers
Languages : en
Pages : 880

Get Book Here

Book Description
The sixteen-volume set comprising the LNCS volumes 11205-11220 constitutes the refereed proceedings of the 15th European Conference on Computer Vision, ECCV 2018, held in Munich, Germany, in September 2018.The 776 revised papers presented were carefully reviewed and selected from 2439 submissions. The papers are organized in topical sections on learning for vision; computational photography; human analysis; human sensing; stereo and reconstruction; optimization; matching and recognition; video attention; and poster sessions.

Intelligent Computing

Intelligent Computing PDF Author: Kohei Arai
Publisher: Springer Nature
ISBN: 3030801195
Category : Technology & Engineering
Languages : en
Pages : 1184

Get Book Here

Book Description
This book is a comprehensive collection of chapters focusing on the core areas of computing and their further applications in the real world. Each chapter is a paper presented at the Computing Conference 2021 held on 15-16 July 2021. Computing 2021 attracted a total of 638 submissions which underwent a double-blind peer review process. Of those 638 submissions, 235 submissions have been selected to be included in this book. The goal of this conference is to give a platform to researchers with fundamental contributions and to be a premier venue for academic and industry practitioners to share new ideas and development experiences. We hope that readers find this volume interesting and valuable as it provides the state-of-the-art intelligent methods and techniques for solving real-world problems. We also expect that the conference and its publications is a trigger for further related research and technology improvements in this important subject.