Predictive Dynamic Load Balancing of Parallel Hash-joins Over Heterogeneous Processors in the Presence of Data Skew

Predictive Dynamic Load Balancing of Parallel Hash-joins Over Heterogeneous Processors in the Presence of Data Skew PDF Author: Columbia University. Dept. of Computer Science
Publisher:
ISBN:
Category : Distributed databases
Languages : en
Pages : 18

Get Book Here

Book Description
Abstract: "In this paper, we present new algorithms to balance the computation of parallel hash joins over heterogeneous processors in the presence of data skew and external loads. Heterogeneity in our model consists of disparate computing elements, as well as general purpose computing ensembles that are subject to external loading (e.g., a LAN connected workstation cluster). Data skew manifests itself as significant nonuniformities in the distribution of attribute values of underlying relations that are involved in a join. We develop cost models and predictive dynamic load balancing protocols to detect imbalance during the computation of a single large join. New predictive bucket scheduling algorithms are presented that smooth out the load over the entire ensemble by reallocating buckets whenever imbalance is detected. Our algorithms can account for imbalance due to data skew as well as heterogeneity in the computing environment. Significant performance gains are reported for a wide range of test cases on a prototype implementation of the system."

Predictive Dynamic Load Balancing of Parallel Hash-joins Over Heterogeneous Processors in the Presence of Data Skew

Predictive Dynamic Load Balancing of Parallel Hash-joins Over Heterogeneous Processors in the Presence of Data Skew PDF Author: Columbia University. Dept. of Computer Science
Publisher:
ISBN:
Category : Distributed databases
Languages : en
Pages : 18

Get Book Here

Book Description
Abstract: "In this paper, we present new algorithms to balance the computation of parallel hash joins over heterogeneous processors in the presence of data skew and external loads. Heterogeneity in our model consists of disparate computing elements, as well as general purpose computing ensembles that are subject to external loading (e.g., a LAN connected workstation cluster). Data skew manifests itself as significant nonuniformities in the distribution of attribute values of underlying relations that are involved in a join. We develop cost models and predictive dynamic load balancing protocols to detect imbalance during the computation of a single large join. New predictive bucket scheduling algorithms are presented that smooth out the load over the entire ensemble by reallocating buckets whenever imbalance is detected. Our algorithms can account for imbalance due to data skew as well as heterogeneity in the computing environment. Significant performance gains are reported for a wide range of test cases on a prototype implementation of the system."

Euro-Par 2001 Parallel Processing

Euro-Par 2001 Parallel Processing PDF Author: Rizos Sakellariou
Publisher: Springer
ISBN: 3540446818
Category : Computers
Languages : en
Pages : 993

Get Book Here

Book Description
Euro-Par – the European Conference on Parallel Computing – is an international conference series dedicated to the promotion and advancement of all aspects of parallel computing. The major themes can be divided into the broad categories of hardware, software, algorithms, and applications for parallel computing. The objective of Euro-Par is to provide a forum within which to promote the dev- opment of parallel computing both as an industrial technique and an academic discipline, extending the frontiers of both the state of the art and the state of the practice. This is particularlyimportant at a time when parallel computing is undergoing strong and sustained development and experiencing real ind- trial take up. The main audience for and participants in Euro-Par are seen as researchers in academic departments, government laboratories, and industrial organisations. Euro-Par aims to become the primarychoice of such professionals for the presentation of new results in their speci?c areas. Euro-Par is also int- ested in applications that demonstrate the e?ectiveness of the main Euro-Par themes. Euro-Par has its own Internet domain with a permanent web site where the historyof the conference series is described: http://www. euro-par. org. The Euro-Par conference series is sponsored bythe Association of Computer Machineryand the International Federation of Information Processing. Euro-Par 2001 Euro-Par 2001 was organised bythe Universityof Manchester and UMIST.

Advances In Multimedia & Databases For The New Century - A Swiss/japanese Perspective

Advances In Multimedia & Databases For The New Century - A Swiss/japanese Perspective PDF Author: Yoshifumi Masunaga
Publisher: World Scientific
ISBN: 981454289X
Category : Computers
Languages : en
Pages : 225

Get Book Here

Book Description
This Switzerland-Japan Joint Seminar on Multimedia and Databases was held to achieve at least three goals. First, it enabled us to present and discuss our recent research results and exchange our ideas for further promotion of science and technology. The second goal was to establish a friendly relationship between the Swiss and the Japanese. The last, but not least, aim was to disseminate information about our plans by publishing the proceedings of this seminar. We thought that publishing the outcome of the seminar would be essential in order not to store the treasure — the seminar results — secretly.

Proceedings of the ... International Conference on Parallel and Distributed Information Systems

Proceedings of the ... International Conference on Parallel and Distributed Information Systems PDF Author:
Publisher:
ISBN:
Category : Distributed databases
Languages : en
Pages : 298

Get Book Here

Book Description


Web-Age Information Management

Web-Age Information Management PDF Author: Hongjun Lu
Publisher: Springer
ISBN: 354045151X
Category : Computers
Languages : en
Pages : 458

Get Book Here

Book Description
Database research and development has been remarkably successful over the past three decades. Now the field is facing new challenges posted by the rapid advances of technology, especially the penetration of the Web and Internet into everyone's daily life. The economical and financial environment where database systems are used has been changing dramatically. In addition to being able to efficiently manage a large volume of operational data generated internally, the ability to manage data in cyberspace, extract relevant information, and discover knowledge to support decision making is critical to the success of any organization. In order to provide researchers and practitioners with a forum to share their experiences in tackling problems in managing and using data, information, and knowledge in the age of the Internet and Web, the First International Conference on Web-Age Information Management (WAIM 2000) was held in Shanghai, China, June 21-23. The inaugural conference in its series was well received. Researchers from 17 countries and regions, including Austria, Australia, Bahrain, Canada, China, France, Germany, Japan, Korea, Malaysia, The Netherlands, Poland, Singapore, Spain, Taiwan, UK, and USA submitted their recent work. Twenty-seven regular and 14 short papers contained in these proceedings were presented during the two-day conference. These papers cover a large spectrum of issues, from classical data management such as object-oriented modeling, spatial and temporal databases to recent hits like data mining, data warehousing, semi-structured data, and XML.

Very Large Data Bases

Very Large Data Bases PDF Author:
Publisher:
ISBN:
Category : Data structures (Computer science)
Languages : en
Pages : 792

Get Book Here

Book Description


EURO-PAR '...

EURO-PAR '... PDF Author:
Publisher:
ISBN:
Category : Parallel processing (Electronic computers)
Languages : en
Pages : 988

Get Book Here

Book Description


Parallel Hash Join with Skew Handling on Multiprocessor Systems

Parallel Hash Join with Skew Handling on Multiprocessor Systems PDF Author: Walid R. Tout
Publisher:
ISBN:
Category : Computer architecture
Languages : en
Pages : 282

Get Book Here

Book Description


Proceedings of the Twenty-fifth International Conference on Very Large Databases, Edinburgh, Scotland, UK, 7-10 September, 1999

Proceedings of the Twenty-fifth International Conference on Very Large Databases, Edinburgh, Scotland, UK, 7-10 September, 1999 PDF Author: Malcolm Atkinson
Publisher: Morgan Kaufmann Publishers
ISBN:
Category : Data structures (Computer science)
Languages : en
Pages : 800

Get Book Here

Book Description
This is the silver anniversary of one of the longest running database conferences. VLDB is among the best established forums for discussion in the international database community and is organized every year by the VLDB Endowment.

Web-age Information Management

Web-age Information Management PDF Author:
Publisher:
ISBN:
Category : Database management
Languages : en
Pages : 492

Get Book Here

Book Description