Principal Component Analysis and Randomness Test for Big Data Analysis

Principal Component Analysis and Randomness Test for Big Data Analysis PDF Author: Mieko Tanaka-Yamawaki
Publisher: Springer Nature
ISBN: 9811939675
Category : Business & Economics
Languages : en
Pages : 153

Get Book Here

Book Description
This book presents the novel approach of analyzing large-sized rectangular-shaped numerical data (so-called big data). The essence of this approach is to grasp the "meaning" of the data instantly, without getting into the details of individual data. Unlike conventional approaches of principal component analysis, randomness tests, and visualization methods, the authors' approach has the benefits of universality and simplicity of data analysis, regardless of data types, structures, or specific field of science. First, mathematical preparation is described. The RMT-PCA and the RMT-test utilize the cross-correlation matrix of time series, C = XXT, where X represents a rectangular matrix of N rows and L columns and XT represents the transverse matrix of X. Because C is symmetric, namely, C = CT, it can be converted to a diagonal matrix of eigenvalues by a similarity transformation SCS-1 = SCST using an orthogonal matrix S. When N is significantly large, the histogram of the eigenvalue distribution can be compared to the theoretical formula derived in the context of the random matrix theory (RMT, in abbreviation). Then the RMT-PCA applied to high-frequency stock prices in Japanese and American markets is dealt with. This approach proves its effectiveness in extracting "trendy" business sectors of the financial market over the prescribed time scale. In this case, X consists of N stock- prices of length L, and the correlation matrix C is an N by N square matrix, whose element at the i-th row and j-th column is the inner product of the price time series of the length L of the i-th stock and the j-th stock of the equal length L. Next, the RMT-test is applied to measure randomness of various random number generators, including algorithmically generated random numbers and physically generated random numbers. The book concludes by demonstrating two applications of the RMT-test: (1) a comparison of hash functions, and (2) stock prediction by means of randomness, including a new index of off-randomness related to market decline.

Principal Component Analysis and Randomness Test for Big Data Analysis

Principal Component Analysis and Randomness Test for Big Data Analysis PDF Author: Mieko Tanaka-Yamawaki
Publisher: Springer Nature
ISBN: 9811939675
Category : Business & Economics
Languages : en
Pages : 153

Get Book Here

Book Description
This book presents the novel approach of analyzing large-sized rectangular-shaped numerical data (so-called big data). The essence of this approach is to grasp the "meaning" of the data instantly, without getting into the details of individual data. Unlike conventional approaches of principal component analysis, randomness tests, and visualization methods, the authors' approach has the benefits of universality and simplicity of data analysis, regardless of data types, structures, or specific field of science. First, mathematical preparation is described. The RMT-PCA and the RMT-test utilize the cross-correlation matrix of time series, C = XXT, where X represents a rectangular matrix of N rows and L columns and XT represents the transverse matrix of X. Because C is symmetric, namely, C = CT, it can be converted to a diagonal matrix of eigenvalues by a similarity transformation SCS-1 = SCST using an orthogonal matrix S. When N is significantly large, the histogram of the eigenvalue distribution can be compared to the theoretical formula derived in the context of the random matrix theory (RMT, in abbreviation). Then the RMT-PCA applied to high-frequency stock prices in Japanese and American markets is dealt with. This approach proves its effectiveness in extracting "trendy" business sectors of the financial market over the prescribed time scale. In this case, X consists of N stock- prices of length L, and the correlation matrix C is an N by N square matrix, whose element at the i-th row and j-th column is the inner product of the price time series of the length L of the i-th stock and the j-th stock of the equal length L. Next, the RMT-test is applied to measure randomness of various random number generators, including algorithmically generated random numbers and physically generated random numbers. The book concludes by demonstrating two applications of the RMT-test: (1) a comparison of hash functions, and (2) stock prediction by means of randomness, including a new index of off-randomness related to market decline.

Places Rated Almanac

Places Rated Almanac PDF Author: David Savageau
Publisher: Prentice Hall
ISBN: 9780671849474
Category : Philosophy
Languages : en
Pages : 452

Get Book Here

Book Description
This sometimes controversial bestseller, completely updated with all new statistics, is packed with timely facts and unbiased information on more than 300 metropolitan areas in the U.S. and Canada. Each city is ranked according to costs of living, crime rates, cultural life, and environmental factors.

Principal Component Analysis

Principal Component Analysis PDF Author: I.T. Jolliffe
Publisher: Springer Science & Business Media
ISBN: 1475719043
Category : Mathematics
Languages : en
Pages : 283

Get Book Here

Book Description
Principal component analysis is probably the oldest and best known of the It was first introduced by Pearson (1901), techniques ofmultivariate analysis. and developed independently by Hotelling (1933). Like many multivariate methods, it was not widely used until the advent of electronic computers, but it is now weIl entrenched in virtually every statistical computer package. The central idea of principal component analysis is to reduce the dimen sionality of a data set in which there are a large number of interrelated variables, while retaining as much as possible of the variation present in the data set. This reduction is achieved by transforming to a new set of variables, the principal components, which are uncorrelated, and which are ordered so that the first few retain most of the variation present in all of the original variables. Computation of the principal components reduces to the solution of an eigenvalue-eigenvector problem for a positive-semidefinite symmetrie matrix. Thus, the definition and computation of principal components are straightforward but, as will be seen, this apparently simple technique has a wide variety of different applications, as weIl as a number of different deri vations. Any feelings that principal component analysis is a narrow subject should soon be dispelled by the present book; indeed some quite broad topics which are related to principal component analysis receive no more than a brief mention in the final two chapters.

Fintech with Artificial Intelligence, Big Data, and Blockchain

Fintech with Artificial Intelligence, Big Data, and Blockchain PDF Author: Paul Moon Sub Choi
Publisher: Springer Nature
ISBN: 9813361379
Category : Technology & Engineering
Languages : en
Pages : 306

Get Book Here

Book Description
This book introduces readers to recent advancements in financial technologies. The contents cover some of the state-of-the-art fields in financial technology, practice, and research associated with artificial intelligence, big data, and blockchain—all of which are transforming the nature of how products and services are designed and delivered, making less adaptable institutions fast become obsolete. The book provides the fundamental framework, research insights, and empirical evidence in the efficacy of these new technologies, employing practical and academic approaches to help professionals and academics reach innovative solutions and grow competitive strengths.

Information Management and Big Data

Information Management and Big Data PDF Author: Juan Antonio Lossio-Ventura
Publisher: Springer Nature
ISBN: 3031636163
Category :
Languages : en
Pages : 366

Get Book Here

Book Description


An Introduction to Applied Multivariate Analysis with R

An Introduction to Applied Multivariate Analysis with R PDF Author: Brian Everitt
Publisher: Springer Science & Business Media
ISBN: 1441996508
Category : Mathematics
Languages : en
Pages : 284

Get Book Here

Book Description
The majority of data sets collected by researchers in all disciplines are multivariate, meaning that several measurements, observations, or recordings are taken on each of the units in the data set. These units might be human subjects, archaeological artifacts, countries, or a vast variety of other things. In a few cases, it may be sensible to isolate each variable and study it separately, but in most instances all the variables need to be examined simultaneously in order to fully grasp the structure and key features of the data. For this purpose, one or another method of multivariate analysis might be helpful, and it is with such methods that this book is largely concerned. Multivariate analysis includes methods both for describing and exploring such data and for making formal inferences about them. The aim of all the techniques is, in general sense, to display or extract the signal in the data in the presence of noise and to find out what the data show us in the midst of their apparent chaos. An Introduction to Applied Multivariate Analysis with R explores the correct application of these methods so as to extract as much information as possible from the data at hand, particularly as some type of graphical representation, via the R software. Throughout the book, the authors give many examples of R code used to apply the multivariate techniques to multivariate data.

Data Mining and Big Data

Data Mining and Big Data PDF Author: Ying Tan
Publisher: Springer Nature
ISBN: 9811572054
Category : Computers
Languages : en
Pages : 130

Get Book Here

Book Description
This book constitutes refereed proceedings of the 5th International Conference on Data Mining and Big Data, DMBD 2020, held in July 2020. Due to the COVID-19 pandemic the conference was held in a fully virtual format. The 7 full papers and 3 short papers presented in this volume were carefully reviewed and selected from 39 submissions. The papers present the latest research on advantages in theories, technologies, and applications in data mining and big data. The volume covers many aspects of data mining and big data as well as intelligent computing methods applied to all fields of computer science, machine learning, data mining and knowledge discovery, data science, etc.

Big Data Analytics

Big Data Analytics PDF Author: Ümit Demirbaga
Publisher: Springer Nature
ISBN: 3031556399
Category :
Languages : en
Pages : 299

Get Book Here

Book Description


Groundwater Response to Changing Climate

Groundwater Response to Changing Climate PDF Author: Makoto Taniguchi
Publisher: CRC Press
ISBN: 0203852834
Category : Science
Languages : en
Pages : 245

Get Book Here

Book Description
Groundwater systems are vital to both society and the environment, supporting food production and many other ecosystem services. Sustainable management of this vital resource for future generations requires a sound understanding of how groundwater might respond to the inevitable changes in future climate. In this volume, recent developments within

Big Data, Machine Learning, and Applications

Big Data, Machine Learning, and Applications PDF Author: Malaya Dutta Borah
Publisher: Springer Nature
ISBN: 9819934818
Category : Computers
Languages : en
Pages : 758

Get Book Here

Book Description
This book constitutes refereed proceedings of the Second International Conference on Big Data, Machine Learning, and Applications, BigDML 2021. The volume focuses on topics such as computing methodology; machine learning; artificial intelligence; information systems; security and privacy. This volume will benefit research scholars, academicians, and industrial people who work on data storage and machine learning.