Active Learning of an Action Detector on Untrimmed Videos

Active Learning of an Action Detector on Untrimmed Videos PDF Author: Sunil Bandla
Publisher:
ISBN:
Category :
Languages : en
Pages : 66

Get Book Here

Book Description
Collecting and annotating videos of realistic human actions is tedious, yet critical for training action recognition systems. We propose a method to actively request the most useful video annotations among a large set of unlabeled videos. Predicting the utility of annotating unlabeled video is not trivial, since any given clip may contain multiple actions of interest, and it need not be trimmed to temporal regions of interest. To deal with this problem, we propose a detection-based active learner to train action category models. We develop a voting-based framework to localize likely intervals of interest in an unlabeled clip, and use them to estimate the total reduction in uncertainty that annotating that clip would yield. On three datasets, we show our approach can learn accurate action detectors more efficiently than alternative active learning strategies that fail to accommodate the "untrimmed" nature of real video data.

Active Learning of an Action Detector on Untrimmed Videos

Active Learning of an Action Detector on Untrimmed Videos PDF Author: Sunil Bandla
Publisher:
ISBN:
Category :
Languages : en
Pages : 66

Get Book Here

Book Description
Collecting and annotating videos of realistic human actions is tedious, yet critical for training action recognition systems. We propose a method to actively request the most useful video annotations among a large set of unlabeled videos. Predicting the utility of annotating unlabeled video is not trivial, since any given clip may contain multiple actions of interest, and it need not be trimmed to temporal regions of interest. To deal with this problem, we propose a detection-based active learner to train action category models. We develop a voting-based framework to localize likely intervals of interest in an unlabeled clip, and use them to estimate the total reduction in uncertainty that annotating that clip would yield. On three datasets, we show our approach can learn accurate action detectors more efficiently than alternative active learning strategies that fail to accommodate the "untrimmed" nature of real video data.

Computer Vision – ECCV 2018

Computer Vision – ECCV 2018 PDF Author: Vittorio Ferrari
Publisher: Springer
ISBN: 3030012522
Category : Computers
Languages : en
Pages : 875

Get Book Here

Book Description
The sixteen-volume set comprising the LNCS volumes 11205-11220 constitutes the refereed proceedings of the 15th European Conference on Computer Vision, ECCV 2018, held in Munich, Germany, in September 2018.The 776 revised papers presented were carefully reviewed and selected from 2439 submissions. The papers are organized in topical sections on learning for vision; computational photography; human analysis; human sensing; stereo and reconstruction; optimization; matching and recognition; video attention; and poster sessions.

Pattern Recognition and Computer Vision

Pattern Recognition and Computer Vision PDF Author: Zhouchen Lin
Publisher: Springer Nature
ISBN: 3030317234
Category : Computers
Languages : en
Pages : 813

Get Book Here

Book Description
The three-volume set LNCS 11857, 11858, and 11859 constitutes the refereed proceedings of the Second Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2019, held in Xi’an, China, in November 2019. The 165 revised full papers presented were carefully reviewed and selected from 412 submissions. The papers have been organized in the following topical sections: Part I: Object Detection, Tracking and Recognition, Part II: Image/Video Processing and Analysis, Part III: Data Analysis and Optimization.

Action Recognition, Temporal Localization and Detection in Trimmed and Untrimmed Videos

Action Recognition, Temporal Localization and Detection in Trimmed and Untrimmed Videos PDF Author: Rui Hou
Publisher:
ISBN:
Category :
Languages : en
Pages : 107

Get Book Here

Book Description
Automatic understanding of videos is one of the most active areas of computer vision research. It has applications in video surveillance, human computer interaction, video sports analysis, virtual and augmented reality, video retrieval etc. In this dissertation, we address four important tasks in video understanding, namely action recognition, temporal action localization, spatial-temporal action detection and video object/action segmentation. This dissertation makes contributions to above tasks by proposing. First, for video action recognition, we propose a category level feature learning method. Our proposed method automatically identifies such pairs of categories using a criterion of mutual pairwise proximity in the (kernelized) feature space, and a category-level similarity matrix where each entry corresponds to the one-vs-one SVM margin for pairs of categories. Second, for temporal action localization, we propose to exploit the temporal structure of actions by modeling an action as a sequence of sub-actions and present a computationally efficient approach. Third, we propose 3D Tube Convolutional Neural Network (TCNN) based pipeline for action detection. The proposed architecture is a unified deep network that is able to recognize and localize action based on 3D convolution features. It generalizes the popular faster R-CNN framework from images to videos. Last, an end-to-end encoder-decoder based 3D convolutional neural network pipeline is proposed, which is able to segment out the foreground objects from the background. Moreover, the action label can be obtained as well by passing the foreground object into an action classifier. Extensive experiments on several video datasets demonstrate the superior performance of the proposed approach for video understanding compared to the state-of-the-art.

Neural Information Processing

Neural Information Processing PDF Author: Mohammad Tanveer
Publisher: Springer Nature
ISBN: 3031301056
Category : Computers
Languages : en
Pages : 660

Get Book Here

Book Description
The three-volume set LNCS 13623, 13624, and 13625 constitutes the refereed proceedings of the 29th International Conference on Neural Information Processing, ICONIP 2022, held as a virtual event, November 22–26, 2022. The 146 papers presented in the proceedings set were carefully reviewed and selected from 810 submissions. They were organized in topical sections as follows: Theory and Algorithms; Cognitive Neurosciences; Human Centered Computing; and Applications. The ICONIP conference aims to provide a leading international forum for researchers, scientists, and industry professionals who are working in neuroscience, neural networks, deep learning, and related fields to share their new ideas, progress, and achievements.

Computer Vision – ECCV 2022 Workshops

Computer Vision – ECCV 2022 Workshops PDF Author: Leonid Karlinsky
Publisher: Springer Nature
ISBN: 3031250699
Category : Computers
Languages : en
Pages : 796

Get Book Here

Book Description
The 8-volume set, comprising the LNCS books 13801 until 13809, constitutes the refereed proceedings of 38 out of the 60 workshops held at the 17th European Conference on Computer Vision, ECCV 2022. The conference took place in Tel Aviv, Israel, during October 23-27, 2022; the workshops were held hybrid or online. The 367 full papers included in this volume set were carefully reviewed and selected for inclusion in the ECCV 2022 workshop proceedings. They were organized in individual parts as follows: Part I: W01 - AI for Space; W02 - Vision for Art; W03 - Adversarial Robustness in the Real World; W04 - Autonomous Vehicle Vision Part II: W05 - Learning With Limited and Imperfect Data; W06 - Advances in Image Manipulation; Part III: W07 - Medical Computer Vision; W08 - Computer Vision for Metaverse; W09 - Self-Supervised Learning: What Is Next?; Part IV: W10 - Self-Supervised Learning for Next-Generation Industry-Level Autonomous Driving; W11 - ISIC Skin Image Analysis; W12 - Cross-Modal Human-Robot Interaction; W13 - Text in Everything; W14 - BioImage Computing; W15 - Visual Object-Oriented Learning Meets Interaction: Discovery, Representations, and Applications; W16 - AI for Creative Video Editing and Understanding; W17 - Visual Inductive Priors for Data-Efficient Deep Learning; W18 - Mobile Intelligent Photography and Imaging; Part V: W19 - People Analysis: From Face, Body and Fashion to 3D Virtual Avatars; W20 - Safe Artificial Intelligence for Automated Driving; W21 - Real-World Surveillance: Applications and Challenges; W22 - Affective Behavior Analysis In-the-Wild; Part VI: W23 - Visual Perception for Navigation in Human Environments: The JackRabbot Human Body Pose Dataset and Benchmark; W24 - Distributed Smart Cameras; W25 - Causality in Vision; W26 - In-Vehicle Sensing and Monitorization; W27 - Assistive Computer Vision and Robotics; W28 - Computational Aspects of Deep Learning; Part VII: W29 - Computer Vision for Civil and Infrastructure Engineering; W30 - AI-Enabled Medical Image Analysis: Digital Pathology and Radiology/COVID19; W31 - Compositional and Multimodal Perception; Part VIII: W32 - Uncertainty Quantification for Computer Vision; W33 - Recovering 6D Object Pose; W34 - Drawings and Abstract Imagery: Representation and Analysis; W35 - Sign Language Understanding; W36 - A Challenge for Out-of-Distribution Generalization in Computer Vision; W37 - Vision With Biased or Scarce Data; W38 - Visual Object Tracking Challenge.

Neural Information Processing

Neural Information Processing PDF Author: Teddy Mantoro
Publisher: Springer Nature
ISBN: 3030922383
Category : Computers
Languages : en
Pages : 724

Get Book Here

Book Description
The four-volume proceedings LNCS 13108, 13109, 13110, and 13111 constitutes the proceedings of the 28th International Conference on Neural Information Processing, ICONIP 2021, which was held during December 8-12, 2021. The conference was planned to take place in Bali, Indonesia but changed to an online format due to the COVID-19 pandemic. The total of 226 full papers presented in these proceedings was carefully reviewed and selected from 1093 submissions. The papers were organized in topical sections as follows: Part I: Theory and algorithms; Part II: Theory and algorithms; human centred computing; AI and cybersecurity; Part III: Cognitive neurosciences; reliable, robust, and secure machine learning algorithms; theory and applications of natural computing paradigms; advances in deep and shallow machine learning algorithms for biomedical data and imaging; applications; Part IV: Applications.

Computational Science – ICCS 2019

Computational Science – ICCS 2019 PDF Author: João M. F. Rodrigues
Publisher: Springer
ISBN: 3030227413
Category : Computers
Languages : en
Pages : 690

Get Book Here

Book Description
The five-volume set LNCS 11536, 11537, 11538, 11539 and 11540 constitutes the proceedings of the 19th International Conference on Computational Science, ICCS 2019, held in Faro, Portugal, in June 2019. The total of 65 full papers and 168 workshop papers presented in this book set were carefully reviewed and selected from 573 submissions (228 submissions to the main track and 345 submissions to the workshops). The papers were organized in topical sections named: Part I: ICCS Main Track Part II: ICCS Main Track; Track of Advances in High-Performance Computational Earth Sciences: Applications and Frameworks; Track of Agent-Based Simulations, Adaptive Algorithms and Solvers; Track of Applications of Matrix Methods in Artificial Intelligence and Machine Learning; Track of Architecture, Languages, Compilation and Hardware Support for Emerging and Heterogeneous Systems Part III: Track of Biomedical and Bioinformatics Challenges for Computer Science; Track of Classifier Learning from Difficult Data; Track of Computational Finance and Business Intelligence; Track of Computational Optimization, Modelling and Simulation; Track of Computational Science in IoT and Smart Systems Part IV: Track of Data-Driven Computational Sciences; Track of Machine Learning and Data Assimilation for Dynamical Systems; Track of Marine Computing in the Interconnected World for the Benefit of the Society; Track of Multiscale Modelling and Simulation; Track of Simulations of Flow and Transport: Modeling, Algorithms and Computation Part V: Track of Smart Systems: Computer Vision, Sensor Networks and Machine Learning; Track of Solving Problems with Uncertainties; Track of Teaching Computational Science; Poster Track ICCS 2019 Chapter “Comparing Domain-decomposition Methods for the Parallelization of Distributed Land Surface Models” is available open access under a Creative Commons Attribution 4.0 International License via link.springer.com.

Image and Graphics

Image and Graphics PDF Author: Yao Zhao
Publisher: Springer Nature
ISBN: 3030341135
Category : Computers
Languages : en
Pages : 688

Get Book Here

Book Description
This three-volume set LNCS 11901, 11902, and 11903 constitutes the refereed conference proceedings of the 10thth International Conference on Image and Graphics, ICIG 2019, held in Beijing, China, in August 2019. The 183 full papers presented were selected from 384 submissions and focus on advances of theory, techniques and algorithms as well as innovative technologies of image, video and graphics processing and fostering innovation, entrepreneurship, and networking.

Biologically Motivated Computer Vision

Biologically Motivated Computer Vision PDF Author: Heinrich H. Bülthoff
Publisher: Springer
ISBN: 3540361812
Category : Computers
Languages : en
Pages : 676

Get Book Here

Book Description
This book constitutes the refereed proceedings of the Second International Workshop on Biologically Motivated Computer Vision, BMCV 2002, held in Tübingen, Germany, in November 2002. The 22 revised full papers and 37 revised short papers presented together with 6 invited papers were carefully reviewed and selected from 97 submissions. The papers are organized in topical sections on neurons and features, motion, mid-level vision, recognition - from scenes to neurons, attention, robotics, and cognitive vision.