Towards Recognizing New Semantic Concepts in New Visual Domains

Towards Recognizing New Semantic Concepts in New Visual Domains PDF Author: Massimiliano Mancini
Publisher: Sapienza Università Editrice
ISBN: 8893772485
Category : Computers
Languages : en
Pages : 285

Get Book Here

Book Description
Despite being the leading paradigm in computer vision, deep neural networks are inherently limited by the visual and semantic information contained in their training set. In this thesis, we aim to design deep models operating with previously unseen visual domains and semantic concepts. We first describe different solutions for generalizing to new visual domains, applying variants of normalization layers to multiple challenging settings e.g. where new domain data is not available but arrives online or is described by metadata. In the second part, we incorporate new semantic concepts into pretrained deep models. We propose specific solutions for different problems such as multi-task/incremental learning and open-world recognition. Finally, we merge the two challenges: given images of multiple domains and categories, can we recognize unseen concepts in unseen domains? We propose an approach that is the first, promising step, towards solving this problem. Winner of the Competition “Prize for PhD Thesis 2020” arranged by Sapienza University Press.

Towards Recognizing New Semantic Concepts in New Visual Domains

Towards Recognizing New Semantic Concepts in New Visual Domains PDF Author: Massimiliano Mancini
Publisher: Sapienza Università Editrice
ISBN: 8893772485
Category : Computers
Languages : en
Pages : 285

Get Book Here

Book Description
Despite being the leading paradigm in computer vision, deep neural networks are inherently limited by the visual and semantic information contained in their training set. In this thesis, we aim to design deep models operating with previously unseen visual domains and semantic concepts. We first describe different solutions for generalizing to new visual domains, applying variants of normalization layers to multiple challenging settings e.g. where new domain data is not available but arrives online or is described by metadata. In the second part, we incorporate new semantic concepts into pretrained deep models. We propose specific solutions for different problems such as multi-task/incremental learning and open-world recognition. Finally, we merge the two challenges: given images of multiple domains and categories, can we recognize unseen concepts in unseen domains? We propose an approach that is the first, promising step, towards solving this problem. Winner of the Competition “Prize for PhD Thesis 2020” arranged by Sapienza University Press.

Machine Learning Techniques for Adaptive Multimedia Retrieval: Technologies Applications and Perspectives

Machine Learning Techniques for Adaptive Multimedia Retrieval: Technologies Applications and Perspectives PDF Author: Wei, Chia-Hung
Publisher: IGI Global
ISBN: 1616928611
Category : Computers
Languages : en
Pages : 409

Get Book Here

Book Description
"This book disseminates current information on multimedia retrieval, advancing the field of multimedia databases, and educating the multimedia database community on machine learning techniques for adaptive multimedia retrieval research, design and applications"--Provided by publisher.

Machine Learning and Data Mining in Pattern Recognition

Machine Learning and Data Mining in Pattern Recognition PDF Author: Petra Perner
Publisher: Springer
ISBN: 354044596X
Category : Computers
Languages : en
Pages : 373

Get Book Here

Book Description
This book constitutes the refereed proceedings of the Second International Workshop on Machine Learning and Data Mining in Pattern Recognition, MLDM 2001, held in Leipzig, Germany in July 2001. The 26 revised full papers presented together with two invited papers were carefully reviewed and selected for inclusion in the proceedings. The papers are organized in topical sections on case-based reasoning and associative memory; rule induction and grammars; clustering and conceptual clustering; data mining on signals, images, and spatio-temporal data; nonlinear function learning and neural net based learning; learning for handwriting recognition; statistical and evolutionary learning; and content-based image retrieval.

Image and Video Retrieval

Image and Video Retrieval PDF Author: Wee-Kheng Leow
Publisher: Springer Science & Business Media
ISBN: 3540278583
Category : Computers
Languages : en
Pages : 686

Get Book Here

Book Description
It was our great pleasure to host the 4th International Conference on Image and Video Retrieval (CIVR) at the National University of Singapore on 20–22 July 2005. CIVR aims to provide an international forum for the discussion of research challenges and exchange of ideas among researchers and practitioners in image/video retrieval technologies. It addresses innovative research in the broad ?eld of image and video retrieval. A unique feature of this conference is the high level of participation by researchers from both academia and industry. Another unique feature of CIVR this year was in its format – it o?ered both the traditional oral presentation sessions, as well as the short presentation cum poster sessions. The latter provided an informal alternative forum for animated discussions and exchanges of ideas among the participants. We are pleased to note that interest in CIVR has grown over the years. The number of submissions has steadily increased from 82 in 2002, to 119 in 2003, and 125 in 2004. This year, we received 128 submissions from the international communities:with81(63.3%)fromAsiaandAustralia,25(19.5%)fromEurope, and 22 (17.2%) from North America. After a rigorous review process, 20 papers were accepted for oral presentations, and 42 papers were accepted for poster presentations. In addition to the accepted submitted papers, the program also included 4 invited papers, 1 keynote industrial paper, and 4 invited industrial papers. Altogether, we o?ered a diverse and interesting program, addressing the current interests and future trends in this area.

Computer Vision – ECCV 2022

Computer Vision – ECCV 2022 PDF Author: Shai Avidan
Publisher: Springer Nature
ISBN: 3031200594
Category : Computers
Languages : en
Pages : 810

Get Book Here

Book Description
The 39-volume set, comprising the LNCS books 13661 until 13699, constitutes the refereed proceedings of the 17th European Conference on Computer Vision, ECCV 2022, held in Tel Aviv, Israel, during October 23–27, 2022. The 1645 papers presented in these proceedings were carefully reviewed and selected from a total of 5804 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation.

E-Business Applications for Product Development and Competitive Growth: Emerging Technologies

E-Business Applications for Product Development and Competitive Growth: Emerging Technologies PDF Author: Lee, In
Publisher: IGI Global
ISBN: 1609601343
Category : Business & Economics
Languages : en
Pages : 503

Get Book Here

Book Description
"This book will serve as an integrated e-business knowledge base for those who are interested in the advancement of e-business theory and practice through a variety of research methods including theoretical, experimental, case, and survey research methods"--Provided by publisher.

Computer Vision – ECCV 2020

Computer Vision – ECCV 2020 PDF Author: Andrea Vedaldi
Publisher: Springer Nature
ISBN: 3030585921
Category : Computers
Languages : en
Pages : 840

Get Book Here

Book Description
The 30-volume set, comprising the LNCS books 12346 until 12375, constitutes the refereed proceedings of the 16th European Conference on Computer Vision, ECCV 2020, which was planned to be held in Glasgow, UK, during August 23-28, 2020. The conference was held virtually due to the COVID-19 pandemic. The 1360 revised papers presented in these proceedings were carefully reviewed and selected from a total of 5025 submissions. The papers deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation.

Knowledge-Driven Multimedia Information Extraction and Ontology Evolution

Knowledge-Driven Multimedia Information Extraction and Ontology Evolution PDF Author: Georgios Paliouras
Publisher: Springer
ISBN: 3642207952
Category : Computers
Languages : en
Pages : 251

Get Book Here

Book Description
This book presents the state of the art in the areas of ontology evolution and knowledge-driven multimedia information extraction, placing an emphasis on how the two can be combined to bridge the semantic gap. This was also the goal of the EC-sponsored BOEMIE (Bootstrapping Ontology Evolution with Multimedia Information Extraction) project, to which the authors of this book have all contributed. The book addresses researchers and practitioners in the field of computer science and more specifically in knowledge representation and management, ontology evolution, and information extraction from multimedia data. It may also constitute an excellent guide to students attending courses within a computer science study program, addressing information processing and extraction from any type of media (text, images, and video). Among other things, the book gives concrete examples of how several of the methods discussed can be applied to athletics (track and field) events.

Image and Video Retrieval

Image and Video Retrieval PDF Author: Erwin M. Bakker
Publisher: Springer Science & Business Media
ISBN: 3540406344
Category : Computers
Languages : en
Pages : 528

Get Book Here

Book Description
Welcome to the 2nd International Conference on Image and Video Retrieval, CIVR2003. The goal of CIVR is to illuminate the state of the art in visual information retrieval and to stimulate collaboration between researchers and practitioners. This year we received 110 submissions from 26 countries. Based upon the reviews of at least 3 members of the program committee, 43 papers were accepted for the research track of the conference. First, we would like to thank all of the members of the Program Committee and the additional referees listed below. Their reviews of the submissions played a pivotal role in the quality of the conference. Moreover,we are grateful to Nicu Sebe and Xiang Zhou for helping to organize the review process; Shih-Fu Chang and Alberto del Bimbo for setting up the practitioner track; and Erwin Bakker for editing the proceedings and designing the conference poster. Special thanks go to our keynote and plenary speakers, Nevenka Dimitrova fromPhilipsResearch,RameshJainfromGeorgiaTech,ChrisPorterfromGetty Images,andAlanSmeatonfromDublinCityUniversity.Furthermore,wewishto acknowledge our sponsors, the Beckman Institute at the University of Illinois at Urbana-Champaign,TsingHuaUniversity,theInstitutionofElectricalEngineers (IEE),PhilipsResearch,andtheLeidenInstituteofAdvancedComputerScience at Leiden University. Finally, we would like to express our thanks to severalpeople who performed important work related to the organization of the conference: Jennifer Quirk and Catherine Zech for the localorganizationat the BeckmanInstitute; Richard Harvey for his help with promotional activity and sponsorship for CIVR2003; andtotheorganizingcommitteeofthe?rstCIVRforsettinguptheinternational mission and structure of the conference.

Multimedia Content Analysis

Multimedia Content Analysis PDF Author: Ajay Divakaran
Publisher: Springer Science & Business Media
ISBN: 0387765697
Category : Computers
Languages : en
Pages : 412

Get Book Here

Book Description
Multimedia Content Analysis: Theory and Applications covers the latest in multimedia content analysis and applications based on such analysis. As research has progressed, it has become clear that this field has to appeal to other disciplines such as psycho-physics, media production, etc. This book consists of invited chapters that cover the entire range of the field. Some of the topics covered include low-level audio-visual analysis based retrieval and indexing techniques, the TRECVID effort, video browsing interfaces, content creation and content analysis, and multimedia analysis-based applications, among others. The chapters are written by leading researchers in the multimedia field.