Power-efficient Tightly-coupled Processor Arrays for Digital Signal Processing

Power-efficient Tightly-coupled Processor Arrays for Digital Signal Processing PDF Author: Dmitrij Kissler
Publisher:
ISBN:
Category :
Languages : en
Pages : 264

Get Book Here

Book Description

Power-efficient Tightly-coupled Processor Arrays for Digital Signal Processing

Power-efficient Tightly-coupled Processor Arrays for Digital Signal Processing PDF Author: Dmitrij Kissler
Publisher:
ISBN:
Category :
Languages : en
Pages : 264

Get Book Here

Book Description


Invasive Tightly Coupled Processor Arrays

Invasive Tightly Coupled Processor Arrays PDF Author: VAHID LARI
Publisher: Springer
ISBN: 9811010587
Category : Technology & Engineering
Languages : en
Pages : 165

Get Book Here

Book Description
This book introduces new massively parallel computer (MPSoC) architectures called invasive tightly coupled processor arrays. It proposes strategies, architecture designs, and programming interfaces for invasive TCPAs that allow invading and subsequently executing loop programs with strict requirements or guarantees of non-functional execution qualities such as performance, power consumption, and reliability. For the first time, such a configurable processor array architecture consisting of locally interconnected VLIW processing elements can be claimed by programs, either in full or in part, using the principle of invasive computing. Invasive TCPAs provide unprecedented energy efficiency for the parallel execution of nested loop programs by avoiding any global memory access such as GPUs and may even support loops with complex dependencies such as loop-carried dependencies that are not amenable to parallel execution on GPUs. For this purpose, the book proposes different invasion strategies for claiming a desired number of processing elements (PEs) or region within a TCPA exclusively for an application according to performance requirements. It not only presents models for implementing invasion strategies in hardware, but also proposes two distinct design flavors for dedicated hardware components to support invasion control on TCPAs.

Blocks, Towards Energy-efficient, Coarse-grained Reconfigurable Architectures

Blocks, Towards Energy-efficient, Coarse-grained Reconfigurable Architectures PDF Author: Mark Wijtvliet
Publisher: Springer Nature
ISBN: 3030797740
Category : Technology & Engineering
Languages : en
Pages : 225

Get Book Here

Book Description
This book describes a new, coarse-grained reconfigurable architecture (CGRA), called Blocks, and puts it in the context of computer architectures, and in particular of other CGRAs. The book starts with an extensive evaluation of historic and existing CGRAs and their strengths and weaknesses. This also leads to a better understanding and new definition of what distinguishes CGRAs from other architectural approaches. The authors introduce Blocks as unique due to its separate programmable control and data paths, allowing light-weight instruction decode units to be arbitrarily connected to one or more functional units (FUs) over a statically configured interconnect. The discussion includes an explanation of how to model architectures, resulting in an area and energy model for Blocks. The accuracy of this model is evaluated against fully implemented architectures, showing that although it is three orders of magnitude faster than synthesis the error margin is very acceptable. The book concludes with a case study on a real System-on-Chip, including a RISC architecture, the Blocks CGRA and peripherals.

Flexibility, Scalability, and Efficiency in Next-Generation Digital Signal Processors

Flexibility, Scalability, and Efficiency in Next-Generation Digital Signal Processors PDF Author: Uneeb Rathore
Publisher:
ISBN:
Category :
Languages : en
Pages : 184

Get Book Here

Book Description
Despite advancements in transistor density, the last decade has seen the slowing down of Moore's law, an increasing silicon area cost, and an increasing number of dedicated accelerators in modern System on Chips (SoCs) and System in Packages (SiPs), leading to dark silicon. In trying to find alternate ways to fit more compute on a package in a cost effective way, leading chip manufacturers are adopting designs with more flexible hardware and their integration on silicon interposer based multi-chip platform technologies.Flexible chips can reuse hardware resources shared across algorithms, increasing active utilization of silicon and reducing required chip area. Additionally they can accommodate frequent design changes for constantly evolving standards such as 5G, which would otherwise require costly chip re-designs and re-spins. However, existing flexible architectures such as coarse-grain DSPs and CGRA significantly lag behind their dedicated accelerator counterparts in terms of throughput and energy and area efficiencies (10x-25x). There is a significant need today for flexible designs that are re-usable, have high throughput, and are also efficient enough for the strict energy and cost requirements of mobile and edge devices, in addition to ensuring compliance with the evolving protocols. Multi-chip scaling and heterogeneous integration can significantly lower manufacturing costs and time-to-market due to higher chip yields and IP-design reuse across multiple nodes. However, large interposer bump pitch, bulky inter-chip communication links, individual custom timing circuity, and lower channel bandwidths stand in the way of widespread adoption. Moreover, in energy- and cost-sensitive mobile applications, high channel efficiency coupled with low channel area serve as additional constraints. To address these challenges, this dissertation presents a flexible, domain-specific, 784-Core, Universal Digital Signal Processor (UDSP) array, targeting DSP applications (such as FIR, IIR, FFT and Vector-Dot-Product), achieving a 4.2x energy-efficiency gap and 6.4x area-efficiency gap from their ASIC counterparts, with high throughput (1.1GHz). The UDSP is realized with a course-grain domain-specific core that balances granularity and utilization, interconnected via a network tailored to DSP kernels with the "right" amount of connectivity. In addition, the trade-off between silicon area and compile flexibility is explored for multi-layer sparse switchbox designs resulting in an area- and time-efficient hardware-compiler co-optimized switchbox to further enhance design productivity. For advancing multi-chip scaling, this dissertation presents the 1st functional, 2x2 UDSP processor on a 2-layer Silicon Interconnect Fabric (Si-IF) with 10-μm pitch I/O bumps. Utilizing the proposed Streaming Near Range - 10μm (SNR-10) channel, the inter-chip links archive 0.38pJ/bit efficiency at 1.1GHz, and the highest bandwidth density per layer at 149Gbps/mm/layer. In an effort to further increase SNR-10 bandwidth without sacrificing technology portability, a 2.1mW, very wide range, 0.0032mm2, fully synthesizable DLL is developed. The DLL uses a ring oscillator and counter based coarse delay line to reduce area and increase frequency range. It uses an active-preemptive fine delay line switching scheme to reduce DNL, and uses independent dual-edge delays to allow duty cycle tracking, enabling high-speed DDR links in future revisions of SNR-10.

Energy Efficient Hardware-Software Co-Synthesis Using Reconfigurable Hardware

Energy Efficient Hardware-Software Co-Synthesis Using Reconfigurable Hardware PDF Author: Jingzhao Ou
Publisher: CRC Press
ISBN: 1584887427
Category : Computers
Languages : en
Pages : 225

Get Book Here

Book Description
Rapid energy estimation for energy efficient applications using field-programmable gate arrays (FPGAs) remains a challenging research topic. Energy dissipation and efficiency have prevented the widespread use of FPGA devices in embedded systems, where energy efficiency is a key performance metric. Helping overcome these challenges, Energy Efficient

Low-power Heterogeneous Reconfigurable Digital Signal Processors with Energy-efficient Interconnect Network

Low-power Heterogeneous Reconfigurable Digital Signal Processors with Energy-efficient Interconnect Network PDF Author: Hui Zhang (Ph.D.)
Publisher:
ISBN:
Category :
Languages : en
Pages : 252

Get Book Here

Book Description


Processor Arrays

Processor Arrays PDF Author: Terry J. Fountain
Publisher:
ISBN:
Category : Computers
Languages : en
Pages : 240

Get Book Here

Book Description


Handbook of Signal Processing Systems

Handbook of Signal Processing Systems PDF Author: Shuvra S. Bhattacharyya
Publisher: Springer Science & Business Media
ISBN: 1441963456
Category : Technology & Engineering
Languages : en
Pages : 1099

Get Book Here

Book Description
It gives me immense pleasure to introduce this timely handbook to the research/- velopment communities in the ?eld of signal processing systems (SPS). This is the ?rst of its kind and represents state-of-the-arts coverage of research in this ?eld. The driving force behind information technologies (IT) hinges critically upon the major advances in both component integration and system integration. The major breakthrough for the former is undoubtedly the invention of IC in the 50’s by Jack S. Kilby, the Nobel Prize Laureate in Physics 2000. In an integrated circuit, all components were made of the same semiconductor material. Beginning with the pocket calculator in 1964, there have been many increasingly complex applications followed. In fact, processing gates and memory storage on a chip have since then grown at an exponential rate, following Moore’s Law. (Moore himself admitted that Moore’s Law had turned out to be more accurate, longer lasting and deeper in impact than he ever imagined. ) With greater device integration, various signal processing systems have been realized for many killer IT applications. Further breakthroughs in computer sciences and Internet technologies have also catalyzed large-scale system integration. All these have led to today’s IT revolution which has profound impacts on our lifestyle and overall prospect of humanity. (It is hard to imagine life today without mobiles or Internets!) The success of SPS requires a well-concerted integrated approach from mul- ple disciplines, such as device, design, and application.

Code Generation for Tightly Coupled Processor Arrays

Code Generation for Tightly Coupled Processor Arrays PDF Author: Srinivas Boppu
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description


Processor Description Languages

Processor Description Languages PDF Author: Prabhat Mishra
Publisher: Elsevier
ISBN: 0080558372
Category : Computers
Languages : en
Pages : 433

Get Book Here

Book Description
Efficient design of embedded processors plays a critical role in embedded systems design. Processor description languages and their associated specification, exploration and rapid prototyping methodologies are used to find the best possible design for a given set of applications under various design constraints, such as area, power and performance. This book is the first, comprehensive survey of modern architecture description languages and will be an invaluable reference for embedded system architects, designers, developers, and validation engineers. Readers will see that the use of particular architecture description languages will lead to productivity gains in designing particular (application-specific) types of embedded processors. * Comprehensive coverage of all modern architecture description languages... use the right ADL to design your processor to fit your application;* Most up-to-date information available about each architecture description language from the developers...save time chasing down reliable documentation;* Describes how each architecture desccription language enables key design automation tasks, such as simulation, synthesis and testing...fit the ADL to your design cycle;