Hardware and Software Fault Tolerance in Parallel Computing Systems

Hardware and Software Fault Tolerance in Parallel Computing Systems PDF Author: Dimitri Ranguelov Avresky
Publisher: Prentice Hall
ISBN:
Category : Computers
Languages : en
Pages : 360

Get Book Here

Book Description

Hardware and Software Fault Tolerance in Parallel Computing Systems

Hardware and Software Fault Tolerance in Parallel Computing Systems PDF Author: Dimitri Ranguelov Avresky
Publisher: Prentice Hall
ISBN:
Category : Computers
Languages : en
Pages : 360

Get Book Here

Book Description


Fault-Tolerant Systems

Fault-Tolerant Systems PDF Author: Israel Koren
Publisher: Elsevier
ISBN: 0080492681
Category : Computers
Languages : en
Pages : 399

Get Book Here

Book Description
Fault-Tolerant Systems is the first book on fault tolerance design with a systems approach to both hardware and software. No other text on the market takes this approach, nor offers the comprehensive and up-to-date treatment that Koren and Krishna provide. This book incorporates case studies that highlight six different computer systems with fault-tolerance techniques implemented in their design. A complete ancillary package is available to lecturers, including online solutions manual for instructors and PowerPoint slides. Students, designers, and architects of high performance processors will value this comprehensive overview of the field. - The first book on fault tolerance design with a systems approach - Comprehensive coverage of both hardware and software fault tolerance, as well as information and time redundancy - Incorporated case studies highlight six different computer systems with fault-tolerance techniques implemented in their design - Available to lecturers is a complete ancillary package including online solutions manual for instructors and PowerPoint slides

Software-Implemented Hardware Fault Tolerance

Software-Implemented Hardware Fault Tolerance PDF Author: Olga Goloubeva
Publisher: Springer Science & Business Media
ISBN: 0387329374
Category : Technology & Engineering
Languages : en
Pages : 238

Get Book Here

Book Description
This book presents the theory behind software-implemented hardware fault tolerance, as well as the practical aspects needed to put it to work on real examples. By evaluating accurately the advantages and disadvantages of the already available approaches, the book provides a guide to developers willing to adopt software-implemented hardware fault tolerance in their applications. Moreover, the book identifies open issues for researchers willing to improve the already available techniques.

Hardware and Software Architectures for Fault Tolerance

Hardware and Software Architectures for Fault Tolerance PDF Author: Michel Banatre
Publisher: Springer Science & Business Media
ISBN: 9783540577676
Category : Computers
Languages : en
Pages : 332

Get Book Here

Book Description
Fault tolerance has been an active research area for many years. This volume presents papers from a workshop held in 1993 where a small number of key researchers and practitioners in the area met to discuss the experiences of industrial practitioners, to provide a perspective on the state of the art of fault tolerance research, to determine whether the subject is becoming mature, and to learn from the experiences so far in order to identify what might be important research topics for the coming years. The workshop provided a more intimate environment for discussions and presentations than usual at conferences. The papers in the volume were presented at the workshop, then updated and revised to reflect what was learned at the workshop.

Design And Analysis Of Reliable And Fault-tolerant Computer Systems

Design And Analysis Of Reliable And Fault-tolerant Computer Systems PDF Author: Mostafa I Abd-el-barr
Publisher: World Scientific
ISBN: 190897978X
Category : Computers
Languages : en
Pages : 463

Get Book Here

Book Description
Covering both the theoretical and practical aspects of fault-tolerant mobile systems, and fault tolerance and analysis, this book tackles the current issues of reliability-based optimization of computer networks, fault-tolerant mobile systems, and fault tolerance and reliability of high speed and hierarchical networks.The book is divided into six parts to facilitate coverage of the material by course instructors and computer systems professionals. The sequence of chapters in each part ensures the gradual coverage of issues from the basics to the most recent developments. A useful set of references, including electronic sources, is listed at the end of each chapter./a

Software Design for Resilient Computer Systems

Software Design for Resilient Computer Systems PDF Author: Igor Schagaev
Publisher: Springer Nature
ISBN: 3031551397
Category :
Languages : en
Pages : 414

Get Book Here

Book Description


Dependable Embedded Systems

Dependable Embedded Systems PDF Author: Jörg Henkel
Publisher: Springer Nature
ISBN: 303052017X
Category : Technology & Engineering
Languages : en
Pages : 606

Get Book Here

Book Description
This Open Access book introduces readers to many new techniques for enhancing and optimizing reliability in embedded systems, which have emerged particularly within the last five years. This book introduces the most prominent reliability concerns from today’s points of view and roughly recapitulates the progress in the community so far. Unlike other books that focus on a single abstraction level such circuit level or system level alone, the focus of this book is to deal with the different reliability challenges across different levels starting from the physical level all the way to the system level (cross-layer approaches). The book aims at demonstrating how new hardware/software co-design solution can be proposed to ef-fectively mitigate reliability degradation such as transistor aging, processor variation, temperature effects, soft errors, etc. Provides readers with latest insights into novel, cross-layer methods and models with respect to dependability of embedded systems; Describes cross-layer approaches that can leverage reliability through techniques that are pro-actively designed with respect to techniques at other layers; Explains run-time adaptation and concepts/means of self-organization, in order to achieve error resiliency in complex, future many core systems.

Patterns for Fault Tolerant Software

Patterns for Fault Tolerant Software PDF Author: Robert S. Hanmer
Publisher: John Wiley & Sons
ISBN: 1118351541
Category : Computers
Languages : en
Pages : 272

Get Book Here

Book Description
Software patterns have revolutionized the way developer’s and architects think about how software is designed, built and documented. This new title in Wiley’s prestigious Series in Software Design Patterns presents proven techniques to achieve patterns for fault tolerant software. This is a key reference for experts seeking to select a technique appropriate for a given system. Readers are guided from concepts and terminology, through common principles and methods, to advanced techniques and practices in the development of software systems. References will provide access points to the key literature, including descriptions of exemplar applications of each technique. Organized into a collection of software techniques, specific techniques can be easily found with sufficient detail to allow appropriate choices for the system being designed.

Fault Tolerant Computer Architecture

Fault Tolerant Computer Architecture PDF Author: Daniel Sorin
Publisher: Morgan & Claypool Publishers
ISBN: 1598299549
Category : Technology & Engineering
Languages : en
Pages : 116

Get Book Here

Book Description
For many years, most computer architects have pursued one primary goal: performance. Architects have translated the ever-increasing abundance of ever-faster transistors provided by Moore's law into remarkable increases in performance. Recently, however, the bounty provided by Moore's law has been accompanied by several challenges that have arisen as devices have become smaller, including a decrease in dependability due to physical faults. In this book, we focus on the dependability challenge and the fault tolerance solutions that architects are developing to overcome it. The two main purposes of this book are to explore the key ideas in fault-tolerant computer architecture and to present the current state-of-the-art - over approximately the past 10 years - in academia and industry. Table of Contents: Introduction / Error Detection / Error Recovery / Diagnosis / Self-Repair / The Future

Software Fault Tolerance Techniques and Implementation

Software Fault Tolerance Techniques and Implementation PDF Author: Laura L. Pullum
Publisher: Artech House
ISBN: 1580531377
Category : Computers
Languages : en
Pages : 358

Get Book Here

Book Description
Look to this innovative resource for the most-comprehensive coverage of software fault tolerance techniques available in a single volume. It offers you a thorough understanding of the operation of critical software fault tolerance techniques and guides you through their design, operation and performance. You get an in-depth discussion on the advantages and disadvantages of specific techniques, so you can decide which ones are best suited for your work.