Data Abstraction and Pattern Identification in Time-series Data

Data Abstraction and Pattern Identification in Time-series Data PDF Author: Prithiviraj Muthumanickam
Publisher: Linköping University Electronic Press
ISBN: 9179299652
Category :
Languages : en
Pages : 73

Get Book Here

Book Description
Data sources such as simulations, sensor networks across many application domains generate large volumes of time-series data which exhibit characteristics that evolve over time. Visual data analysis methods can help us in exploring and understanding the underlying patterns present in time-series data but, due to their ever-increasing size, the visual data analysis process can become complex. Large data sets can be handled using data abstraction techniques by transforming the raw data into a simpler format while, at the same time, preserving significant features that are important for the user. When dealing with time-series data, abstraction techniques should also take into account the underlying temporal characteristics. This thesis focuses on different data abstraction and pattern identification methods particularly in the cases of large 1D time-series and 2D spatio-temporal time-series data which exhibit spatiotemporal discontinuity. Based on the dimensionality and characteristics of the data, this thesis proposes a variety of efficient data-adaptive and user-controlled data abstraction methods that transform the raw data into a symbol sequence. The transformation of raw time-series into a symbol sequence can act as input to different sequence analysis methods from data mining and machine learning communities to identify interesting patterns of user behavior. In the case of very long duration 1D time-series, locally adaptive and user-controlled data approximation methods were presented to simplify the data, while at the same time retaining the perceptually important features. The simplified data were converted into a symbol sequence and a sketch-based pattern identification was then used to identify patterns in the symbolic data using regular expression based pattern matching. The method was applied to financial time-series and patterns such as head-and-shoulders, double and triple-top patterns were identified using hand drawn sketches in an interactive manner. Through data smoothing, the data approximation step also enables visualization of inherent patterns in the time-series representation while at the same time retaining perceptually important points. Very long duration 2D spatio-temporal eye tracking data sets that exhibit spatio-temporal discontinuity was transformed into symbolic data using scalable clustering and hierarchical cluster merging processes, each of which can be parallelized. The raw data is transformed into a symbol sequence with each symbol representing a region of interest in the eye gaze data. The identified regions of interest can also be displayed in a Space-Time Cube (STC) that captures both the temporal and contextual information. Through interactive filtering, zooming and geometric transformation, the STC representation along with linked views enables interactive data exploration. Using different sequence analysis methods, the symbol sequences are analyzed further to identify temporal patterns in the data set. Data collected from air traffic control officers from the domain of Air traffic control were used as application examples to demonstrate the results.

Data Abstraction and Pattern Identification in Time-series Data

Data Abstraction and Pattern Identification in Time-series Data PDF Author: Prithiviraj Muthumanickam
Publisher: Linköping University Electronic Press
ISBN: 9179299652
Category :
Languages : en
Pages : 73

Get Book Here

Book Description
Data sources such as simulations, sensor networks across many application domains generate large volumes of time-series data which exhibit characteristics that evolve over time. Visual data analysis methods can help us in exploring and understanding the underlying patterns present in time-series data but, due to their ever-increasing size, the visual data analysis process can become complex. Large data sets can be handled using data abstraction techniques by transforming the raw data into a simpler format while, at the same time, preserving significant features that are important for the user. When dealing with time-series data, abstraction techniques should also take into account the underlying temporal characteristics. This thesis focuses on different data abstraction and pattern identification methods particularly in the cases of large 1D time-series and 2D spatio-temporal time-series data which exhibit spatiotemporal discontinuity. Based on the dimensionality and characteristics of the data, this thesis proposes a variety of efficient data-adaptive and user-controlled data abstraction methods that transform the raw data into a symbol sequence. The transformation of raw time-series into a symbol sequence can act as input to different sequence analysis methods from data mining and machine learning communities to identify interesting patterns of user behavior. In the case of very long duration 1D time-series, locally adaptive and user-controlled data approximation methods were presented to simplify the data, while at the same time retaining the perceptually important features. The simplified data were converted into a symbol sequence and a sketch-based pattern identification was then used to identify patterns in the symbolic data using regular expression based pattern matching. The method was applied to financial time-series and patterns such as head-and-shoulders, double and triple-top patterns were identified using hand drawn sketches in an interactive manner. Through data smoothing, the data approximation step also enables visualization of inherent patterns in the time-series representation while at the same time retaining perceptually important points. Very long duration 2D spatio-temporal eye tracking data sets that exhibit spatio-temporal discontinuity was transformed into symbolic data using scalable clustering and hierarchical cluster merging processes, each of which can be parallelized. The raw data is transformed into a symbol sequence with each symbol representing a region of interest in the eye gaze data. The identified regions of interest can also be displayed in a Space-Time Cube (STC) that captures both the temporal and contextual information. Through interactive filtering, zooming and geometric transformation, the STC representation along with linked views enables interactive data exploration. Using different sequence analysis methods, the symbol sequences are analyzed further to identify temporal patterns in the data set. Data collected from air traffic control officers from the domain of Air traffic control were used as application examples to demonstrate the results.

Pattern Recognition and Classification in Time Series Data

Pattern Recognition and Classification in Time Series Data PDF Author: Volna, Eva
Publisher: IGI Global
ISBN: 1522505660
Category : Computers
Languages : en
Pages : 295

Get Book Here

Book Description
Patterns can be any number of items that occur repeatedly, whether in the behaviour of animals, humans, traffic, or even in the appearance of a design. As technologies continue to advance, recognizing, mimicking, and responding to all types of patterns becomes more precise. Pattern Recognition and Classification in Time Series Data focuses on intelligent methods and techniques for recognizing and storing dynamic patterns. Emphasizing topics related to artificial intelligence, pattern management, and algorithm development, in addition to practical examples and applications, this publication is an essential reference source for graduate students, researchers, and professionals in a variety of computer-related disciplines.

Pattern Classification

Pattern Classification PDF Author: Richard O. Duda
Publisher: John Wiley & Sons
ISBN: 111858600X
Category : Technology & Engineering
Languages : en
Pages : 680

Get Book Here

Book Description
The first edition, published in 1973, has become a classicreference in the field. Now with the second edition, readers willfind information on key new topics such as neural networks andstatistical pattern recognition, the theory of machine learning,and the theory of invariances. Also included are worked examples,comparisons between different methods, extensive graphics, expandedexercises and computer project topics. An Instructor's Manual presenting detailed solutions to all theproblems in the book is available from the Wiley editorialdepartment.

Machine Learning and Data Mining in Pattern Recognition

Machine Learning and Data Mining in Pattern Recognition PDF Author: Petra Perner
Publisher: Springer Science & Business Media
ISBN: 3540405046
Category : Computers
Languages : en
Pages : 452

Get Book Here

Book Description
TheInternationalConferenceonMachineLearningandDataMining(MLDM)is the third meeting in a series of biennial events, which started in 1999, organized by the Institute of Computer Vision and Applied Computer Sciences (IBaI) in Leipzig. MLDM began as a workshop and is now a conference, and has brought the topic of machine learning and data mining to the attention of the research community. Seventy-?ve papers were submitted to the conference this year. The program committeeworkedhardtoselectthemostprogressiveresearchinafairandc- petent review process which led to the acceptance of 33 papers for presentation at the conference. The 33 papers in these proceedings cover a wide variety of topics related to machine learning and data mining. The two invited talks deal with learning in case-based reasoning and with mining for structural data. The contributed papers can be grouped into nine areas: support vector machines; pattern dis- very; decision trees; clustering; classi?cation and retrieval; case-based reasoning; Bayesian models and methods; association rules; and applications. We would like to express our appreciation to the reviewers for their precise andhighlyprofessionalwork.WearegratefultotheGermanScienceFoundation for its support of the Eastern European researchers. We appreciate the help and understanding of the editorial sta? at Springer Verlag, and in particular Alfred Hofmann,whosupportedthepublicationoftheseproceedingsintheLNAIseries. Last, but not least, we wish to thank all the speakers and participants who contributed to the success of the conference.

Web Information Systems and Applications

Web Information Systems and Applications PDF Author: Weiwei Ni
Publisher: Springer Nature
ISBN: 3030309525
Category : Computers
Languages : en
Pages : 725

Get Book Here

Book Description
This book constitutes the proceedings of the 16th International Conference on Web Information Systems and Applications, WISA 2019, held in Qingdao, China, in September 2019. The 39 revised full papers and 33 short papers presented were carefully reviewed and selected from 154 submissions. The papers are grouped in topical sections on machine learning and data mining, cloud computing and big data, information retrieval, natural language processing, data privacy and security, knowledge graphs and social networks, blockchain, query processing, and recommendations.

Advances in Intelligent Data Analysis

Advances in Intelligent Data Analysis PDF Author: David J Hand
Publisher: Springer
ISBN: 3540484124
Category : Computers
Languages : en
Pages : 529

Get Book Here

Book Description
This book constitutes the refereed proceedings of the Third International Symposium on Intelligent Data Analysis, IDA-99 held in Amsterdam, The Netherlands in August 1999. The 21 revised full papers and 23 posters presented in the book were carefully reviewed and selected from a total of more than 100 submissions. The papers address all current aspects of intelligent data analysis; they are organized in sections on learning, visualization, classification and clustering, integration, applications and media mining.

Data Science

Data Science PDF Author: Jing He
Publisher: Springer Nature
ISBN: 9811528101
Category : Computers
Languages : en
Pages : 720

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 6th International Conference on Data Science, ICDS 2019, held in Ningbo, China, during May 2019. The 64 revised full papers presented were carefully reviewed and selected from 210 submissions. The research papers cover the areas of Advancement of Data Science and Smart City Applications, Theory of Data Science, Data Science of People and Health, Web of Data, Data Science of Trust and Internet of Things.

Pattern Recognition

Pattern Recognition PDF Author: Jesús Ariel Carrasco-Ochoa
Publisher: Springer
ISBN: 3642311490
Category : Computers
Languages : en
Pages : 358

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 4th Mexican Conference on Pattern Recognition, MCPR 2012, held in Huatulco, Mexico, in June 2012. The 31 revised full papers and 3 keynotes presented were carefully reviewed and selected from 64 submissions and are organized in topical sections on image processing; computer vision and image recognition; pattern recognition and neural networks; and document processing and speech recognition.

Information-Statistical Data Mining

Information-Statistical Data Mining PDF Author: Bon K. Sy
Publisher: Springer Science & Business Media
ISBN: 1441990011
Category : Technology & Engineering
Languages : en
Pages : 301

Get Book Here

Book Description
Information-Statistical Data Mining: Warehouse Integration with Examples of Oracle Basics is written to introduce basic concepts, advanced research techniques, and practical solutions of data warehousing and data mining for hosting large data sets and EDA. This book is unique because it is one of the few in the forefront that attempts to bridge statistics and information theory through a concept of patterns. Information-Statistical Data Mining: Warehouse Integration with Examples of Oracle Basics is designed for a professional audience composed of researchers and practitioners in industry. This book is also suitable as a secondary text for graduate-level students in computer science and engineering.

Advances in Pattern Recognition - ICAPR 2001

Advances in Pattern Recognition - ICAPR 2001 PDF Author: Sameer Singh
Publisher: Springer
ISBN: 3540447326
Category : Computers
Languages : en
Pages : 491

Get Book Here

Book Description
The paper is organized as follows: In section 2, we describe the no- orientation-discontinuity interfering model based on a Gaussian stochastic model in analyzing the properties of the interfering strokes. In section 3, we describe the improved canny edge detector with an ed- orientation constraint to detect the edges and recover the weak ones of the foreground words and characters; In section 4, we illustrate, discuss and evaluate the experimental results of the proposed method, demonstrating that our algorithm significantly improves the segmentation quality; Section 5 concludes this paper. 2. The norm-orientation-discontinuity interfering stroke model Figure 2 shows three typical samples of original image segments from the original documents and their magnitude of the detected edges respectively. The magnitude of the gradient is converted into the gray level value. The darker the edge is, the larger is the gradient magnitude. It is obvious that the topmost strong edges correspond to foreground edges. It should be noted that, while usually, the foreground writing appears darker than the background image, as shown in sample image Figure 2(a), there are cases where the foreground and background have similar intensities as shown in Figure 2(b), or worst still, the background is more prominent than the foreground as in Figure 2(c). So using only the intensity value is not enough to differentiate the foreground from the background. (a) (b) (c) (d) (e) (f)