Tensor-based Spatio-temporal Outlier Detection in Large Datasets

Tensor-based Spatio-temporal Outlier Detection in Large Datasets PDF Author: Yanan Sun
Publisher:
ISBN:
Category :
Languages : en
Pages : 322

Get Book Here

Book Description
Spatio-Temporal data is inherently large since each spatial node has spatial attributes and may also be associated with large amounts of measurement data captured over time. In such large and multi-dimensional data identifying anomalies can be a challenge due to the massive data size and relationships among spatial objects. Discovering anomalies in spatio-temporal data is relevant in several domains, such as detecting rare disease outbreaks, detecting oil spills, discovering regions with highway traffic congestion. Most existing techniques for discovering anomalies in spatio-temporal data may find the spatial outliers first and then identify the spatio-temporal anomalies from the data of that specific spatial location. Alternatively, some approaches may discover anomalous time periods and then discover the unusual spatial location in them. This may lead to identifying incorrect spatio-temporal outliers or missing important spatio-temporal phenomena due to the elimination of information after each step. Thus, there is a need to address capturing both space and time simultaneously. A tensor is a multi-dimensional array. It is considered as a powerful tool to manipulate multi-dimensional and multi-variate data. It has a concise mathematical framework for formulating and solving complex data problems efficiently. Tensors can handle complex relationship in spatio-temporal data and their mathematical framework can help us detect spatio-temporal outliers in an effective and efficient manner. Tensors multiplication can integrate spatial and temporal aspects in the data at the same time. In this dissertation, we present our novel approach addressing the key limitation of existing spatio-temporal outlier detection methods by using an efficient tensor based model that supports complex relationships in spatio-temporal data to detect outliers by looking at space and time simultaneously as well as handling the scalability issue when it comes to manipulating large datasets. In this dissertation, we are going to present our novel spatio-temporal tensor model. Based on the spatio-temporal tensor model, we present our clustering-based neighborhood discovery algorithm, neighborhood-based spatial, and spatio-temporal outlier detection algorithms to discover different types of spatio-temporal outliers namely point based and window based outliers. We discuss detailed experimental results for each of the algorithms proposed and also present comparative results.

Tensor-based Spatio-temporal Outlier Detection in Large Datasets

Tensor-based Spatio-temporal Outlier Detection in Large Datasets PDF Author: Yanan Sun
Publisher:
ISBN:
Category :
Languages : en
Pages : 322

Get Book Here

Book Description
Spatio-Temporal data is inherently large since each spatial node has spatial attributes and may also be associated with large amounts of measurement data captured over time. In such large and multi-dimensional data identifying anomalies can be a challenge due to the massive data size and relationships among spatial objects. Discovering anomalies in spatio-temporal data is relevant in several domains, such as detecting rare disease outbreaks, detecting oil spills, discovering regions with highway traffic congestion. Most existing techniques for discovering anomalies in spatio-temporal data may find the spatial outliers first and then identify the spatio-temporal anomalies from the data of that specific spatial location. Alternatively, some approaches may discover anomalous time periods and then discover the unusual spatial location in them. This may lead to identifying incorrect spatio-temporal outliers or missing important spatio-temporal phenomena due to the elimination of information after each step. Thus, there is a need to address capturing both space and time simultaneously. A tensor is a multi-dimensional array. It is considered as a powerful tool to manipulate multi-dimensional and multi-variate data. It has a concise mathematical framework for formulating and solving complex data problems efficiently. Tensors can handle complex relationship in spatio-temporal data and their mathematical framework can help us detect spatio-temporal outliers in an effective and efficient manner. Tensors multiplication can integrate spatial and temporal aspects in the data at the same time. In this dissertation, we present our novel approach addressing the key limitation of existing spatio-temporal outlier detection methods by using an efficient tensor based model that supports complex relationships in spatio-temporal data to detect outliers by looking at space and time simultaneously as well as handling the scalability issue when it comes to manipulating large datasets. In this dissertation, we are going to present our novel spatio-temporal tensor model. Based on the spatio-temporal tensor model, we present our clustering-based neighborhood discovery algorithm, neighborhood-based spatial, and spatio-temporal outlier detection algorithms to discover different types of spatio-temporal outliers namely point based and window based outliers. We discuss detailed experimental results for each of the algorithms proposed and also present comparative results.

Outlier Detection for Temporal Data

Outlier Detection for Temporal Data PDF Author: Manish Gupta
Publisher: Springer Nature
ISBN: 3031019059
Category : Computers
Languages : en
Pages : 110

Get Book Here

Book Description
Outlier (or anomaly) detection is a very broad field which has been studied in the context of a large number of research areas like statistics, data mining, sensor networks, environmental science, distributed systems, spatio-temporal mining, etc. Initial research in outlier detection focused on time series-based outliers (in statistics). Since then, outlier detection has been studied on a large variety of data types including high-dimensional data, uncertain data, stream data, network data, time series data, spatial data, and spatio-temporal data. While there have been many tutorials and surveys for general outlier detection, we focus on outlier detection for temporal data in this book. A large number of applications generate temporal datasets. For example, in our everyday life, various kinds of records like credit, personnel, financial, judicial, medical, etc., are all temporal. This stresses the need for an organized and detailed study of outliers with respect to such temporal data. In the past decade, there has been a lot of research on various forms of temporal data including consecutive data snapshots, series of data snapshots and data streams. Besides the initial work on time series, researchers have focused on rich forms of data including multiple data streams, spatio-temporal data, network data, community distribution data, etc. Compared to general outlier detection, techniques for temporal outlier detection are very different. In this book, we will present an organized picture of both recent and past research in temporal outlier detection. We start with the basics and then ramp up the reader to the main ideas in state-of-the-art outlier detection techniques. We motivate the importance of temporal outlier detection and brief the challenges beyond usual outlier detection. Then, we list down a taxonomy of proposed techniques for temporal outlier detection. Such techniques broadly include statistical techniques (like AR models, Markov models, histograms, neural networks), distance- and density-based approaches, grouping-based approaches (clustering, community detection), network-based approaches, and spatio-temporal outlier detection approaches. We summarize by presenting a wide collection of applications where temporal outlier detection techniques have been applied to discover interesting outliers. Table of Contents: Preface / Acknowledgments / Figure Credits / Introduction and Challenges / Outlier Detection for Time Series and Data Sequences / Outlier Detection for Data Streams / Outlier Detection for Distributed Data Streams / Outlier Detection for Spatio-Temporal Data / Outlier Detection for Temporal Network Data / Applications of Outlier Detection for Temporal Data / Conclusions and Research Directions / Bibliography / Authors' Biographies

Outlier Detection for Temporal Data

Outlier Detection for Temporal Data PDF Author: Manish Gupta
Publisher:
ISBN: 9781627053754
Category : Outliers (Statistics)
Languages : en
Pages : 0

Get Book Here

Book Description
Outlier (or anomaly) detection is a very broad field which has been studied in the context of a large number of research areas like statistics, data mining, sensor networks, environmental science, distributed systems, spatio-temporal mining, etc. Initial research in outlier detection focused on time series-based outliers (in statistics). Since then, outlier detection has been studied on a large variety of data types including high-dimensional data, uncertain data, stream data, network data, time series data, spatial data, and spatio-temporal data. While there have been many tutorials and surveys for general outlier detection, we focus on outlier detection for temporal data in this book. A large number of applications generate temporal datasets. For example, in our everyday life, various kinds of records like credit, personnel, financial, judicial, medical, etc., are all temporal. This stresses the need for an organized and detailed study of outliers with respect to such temporal data. In the past decade, there has been a lot of research on various forms of temporal data including consecutive data snapshots, series of data snapshots and data streams. Besides the initial work on time series, researchers have focused on rich forms of data including multiple data streams, spatio-temporal data, network data, community distribution data, etc. Compared to general outlier detection, techniques for temporal outlier detection are very different. In this book, we will present an organized picture of both recent and past research in temporal outlier detection. We start with the basics and then ramp up the reader to the main ideas in state-of-the-art outlier detection techniques. We motivate the importance of temporal outlier detection and brief the challenges beyond usual outlier detection. Then, we list down a taxonomy of proposed techniques for temporal outlier detection. Such techniques broadly include statistical techniques (like AR models, Markov models, histograms, neural networks), distance- and density-based approaches, grouping-based approaches (clustering, community detection), network-based approaches, and spatio-temporal outlier detection approaches. We summarize by presenting a wide collection of applications where temporal outlier detection techniques have been applied to discover interesting outliers.

Spatio-temporal Anomaly Detection

Spatio-temporal Anomaly Detection PDF Author: Mahashweta Das
Publisher:
ISBN:
Category :
Languages : en
Pages : 61

Get Book Here

Book Description
Abstract: Recent advances in computational sciences have led to the generation and utilization of enormous amounts of spatio-temporal data in numerous scientific disciplines, such as wireless sensor networks, bioinformatics, astrophysics and computational fluid dynamics. The need to efficiently handle and effectively analyze such humongous amount of data has led to the application of data mining techniques in this domain of research. Spatio-temporal data mining is the process of maneuvering such datasets to extract interesting knowledge and meaningful insights from the data. In this thesis, we propose a spatio-temporal and a temporal-spatial anomaly detection algorithm in the context of wireless sensor network application. The latter can also be interpreted as a prediction model for spatio-temporal datasets. The anomaly detection algorithms identify local abnormalities in sensor data collected over space and time, independent of the global analytical view presented by the entire dataset. The first of our proposed algorithms identifies candidate spatial outliers by computing an outlierness indicator, which we call ANOI (Antimonotonic Outlierness Indicator) and then determines the spatio-temporal outliers by checking the stability of the candidates over time. Our novel idea of antimonotonicity in an outlier detection framework helps us to prune the search space and reduce computational complexity, which is a serious bottleneck for neighborhood-based outlier detection methodologies. The second algorithm first models time series data and then refines the temporal estimates by integrating spatial association information. Since the model predicts short-term energy availability based on past historical records, it can also assist wireless sensor nodes to automatically adapt to changing environmental conditions and use its energy harvesting and activity scheduling abilities intelligently. Experimental results on climate data empirically demonstrate the effectiveness of both the approaches.

Advanced Analytics and Learning on Temporal Data

Advanced Analytics and Learning on Temporal Data PDF Author: Vincent Lemaire
Publisher: Springer Nature
ISBN: 3030390985
Category : Computers
Languages : en
Pages : 236

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 4th ECML PKDD Workshop on Advanced Analytics and Learning on Temporal Data, AALTD 2019, held in Würzburg, Germany, in September 2019. The 7 full papers presented together with 9 poster papers were carefully reviewed and selected from 31 submissions. The papers cover topics such as temporal data clustering; classification of univariate and multivariate time series; early classification of temporal data; deep learning and learning representations for temporal data; modeling temporal dependencies; advanced forecasting and prediction models; space-temporal statistical analysis; functional data analysis methods; temporal data streams; interpretable time-series analysis methods; dimensionality reduction, sparsity, algorithmic complexity and big data challenge; and bio-informatics, medical, energy consumption, on temporal data.

Advances in Spatial and Temporal Databases

Advances in Spatial and Temporal Databases PDF Author: Michael Gertz
Publisher: Springer
ISBN: 3319643673
Category : Computers
Languages : en
Pages : 454

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 15th International Symposium on Spatial and Temporal Databases, SSTD 2017, held in Arlington, VA, USA, in August 2017.The 19 full papers presented together with 8 demo papers and 5 vision papers were carefully reviewed and selected from 90 submissions. The papers are organized around the current research on concepts, tools, and techniques related to spatial and temporal databases.

Handbook of Mobility Data Mining, Volume 1

Handbook of Mobility Data Mining, Volume 1 PDF Author: Haoran Zhang
Publisher: Elsevier
ISBN: 0443184291
Category : Business & Economics
Languages : en
Pages : 224

Get Book Here

Book Description
Handbook of Mobility Data Mining, Volume One: Data Preprocessing and Visualization introduces the fundamental technologies of mobile big data mining (MDM), advanced AI methods, and upper-level applications, helping readers comprehensively understand MDM with a bottom-up approach. The book explains how to preprocess mobile big data, visualize urban mobility, simulate and predict human travel behavior, and assess urban mobility characteristics and their matching performance as conditions and constraints in transport, emergency management, and sustainability development systems. The book contains crucial information for researchers, engineers, operators, administrators, and policymakers seeking greater understanding of current technologies' infra-knowledge structure and limitations. Further, the book introduces how to design MDM platforms that adapt to the evolving mobility environment, new types of transportation, and users based on an integrated solution that utilizes sensing and communication capabilities to tackle significant challenges faced by the MDM field. This volume focuses on how to efficiently pre-process mobile big data to extract and utilize critical feature information of high-dimensional city people flow. The book first provides a conceptual theory and framework, then discusses data sources, trajectory map-matching, noise filtering, trajectory data segmentation, data quality assessment, and more, concluding with a chapter on privacy protection in mobile big data mining. Introduces the characteristics of different mobility data sources, like GPS, CDR, and sensor-based mobility data Summarizes existing visualization technologies of the current transportation system into a multi-view frame, covering the perspective of the three leading actors Provides recommendations for practical open-source tools and libraries for system visualization Stems from the editor’s strong network of global transport authorities and transport companies, providing a solid knowledge structure and data foundation as well as geographical and stakeholder coverage

Innovations in Smart Cities Applications Volume 5

Innovations in Smart Cities Applications Volume 5 PDF Author: Mohamed Ben Ahmed
Publisher: Springer Nature
ISBN: 3030941914
Category : Technology & Engineering
Languages : en
Pages : 1117

Get Book Here

Book Description
This book sets the innovative research contributions, works, and solutions for almost all the intelligent and smart applications in the smart cities. The smart city concept is a relevant topic for industrials, governments, and citizens. Due to this, the smart city, considered as a multi-domain context, attracts tremendously academics researchers and practitioners who provide efforts in theoretical proofs, approaches, architectures, and in applied researches. The importance of smart cities comes essentially from the significant growth of populations in the near future which conducts to a real need of smart applications that can support this evolution in the future cities. The main scope of this book covers new and original ideas for the next generations of cities using the new technologies. The book involves the application of the data science and AI, IoT technologies and architectures, smart earth and water management, smart education and E-learning systems, smart modeling systems, smart mobility, and renewable energy. It also reports recent research works on big data technologies, image processing and recognition systems, and smart security and privacy.

Urban Computing

Urban Computing PDF Author: Yu Zheng
Publisher: MIT Press
ISBN: 0262039087
Category : Computers
Languages : en
Pages : 633

Get Book Here

Book Description
An authoritative treatment of urban computing, offering an overview of the field, fundamental techniques, advanced models, and novel applications. Urban computing brings powerful computational techniques to bear on such urban challenges as pollution, energy consumption, and traffic congestion. Using today's large-scale computing infrastructure and data gathered from sensing technologies, urban computing combines computer science with urban planning, transportation, environmental science, sociology, and other areas of urban studies, tackling specific problems with concrete methodologies in a data-centric computing framework. This authoritative treatment of urban computing offers an overview of the field, fundamental techniques, advanced models, and novel applications. Each chapter acts as a tutorial that introduces readers to an important aspect of urban computing, with references to relevant research. The book outlines key concepts, sources of data, and typical applications; describes four paradigms of urban sensing in sensor-centric and human-centric categories; introduces data management for spatial and spatio-temporal data, from basic indexing and retrieval algorithms to cloud computing platforms; and covers beginning and advanced topics in mining knowledge from urban big data, beginning with fundamental data mining algorithms and progressing to advanced machine learning techniques. Urban Computing provides students, researchers, and application developers with an essential handbook to an evolving interdisciplinary field.

Graph Mining

Graph Mining PDF Author: Deepayan Chakrabarti
Publisher: Morgan & Claypool Publishers
ISBN: 160845116X
Category : Computers
Languages : en
Pages : 209

Get Book Here

Book Description
What does the Web look like? How can we find patterns, communities, outliers, in a social network? Which are the most central nodes in a network? These are the questions that motivate this work. Networks and graphs appear in many diverse settings, for example in social networks, computer-communication networks (intrusion detection, traffic management), protein-protein interaction networks in biology, document-text bipartite graphs in text retrieval, person-account graphs in financial fraud detection, and others. In this work, first we list several surprising patterns that real graphs tend to follow. Then we give a detailed list of generators that try to mirror these patterns. Generators are important, because they can help with "what if" scenarios, extrapolations, and anonymization. Then we provide a list of powerful tools for graph analysis, and specifically spectral methods (Singular Value Decomposition (SVD)), tensors, and case studies like the famous "pageRank" algorithm and the "HITS" algorithm for ranking web search results. Finally, we conclude with a survey of tools and observations from related fields like sociology, which provide complementary viewpoints. Table of Contents: Introduction / Patterns in Static Graphs / Patterns in Evolving Graphs / Patterns in Weighted Graphs / Discussion: The Structure of Specific Graphs / Discussion: Power Laws and Deviations / Summary of Patterns / Graph Generators / Preferential Attachment and Variants / Incorporating Geographical Information / The RMat / Graph Generation by Kronecker Multiplication / Summary and Practitioner's Guide / SVD, Random Walks, and Tensors / Tensors / Community Detection / Influence/Virus Propagation and Immunization / Case Studies / Social Networks / Other Related Work / Conclusions