Files in this item

FilesDescriptionFormat

application/pdf

application/pdfA Near-Memory P ... Manipulation Workloads.pdf (196kB)
(no description provided)PDF

Description

Title:A Near-Memory Processor for Vector, Streaming and Bit Manipulation Workloads
Author(s):Wei, Mingliang; Snir, Marc; Torrellas, Josep; Tremaine, R. Brett
Subject(s):Computer Architecture Parallel Computing
Abstract:Many important scientific and engineering applications execute sub-optimally on current commodity processors and servers. Specifically, since they frequently use caches ineffectively, they are often heavily bottlenecked by global memory bandwidth. In addition, they sometimes need to perform expensive bit manipulation operations that are not efficiently supported by commodity ISAs. Moreover, an analysis of technology trends suggests that, despite the criticality of some of these applications, future commodity processors and servers are unlikely to be tuned for them. To address this problem, this paper proposes the design of a simple co-processor on which the main processor can off-load vector, streaming, and bit-manipulation computation. The co-processor is a blocked-multithreaded narrow in-order core with support for vectors, streams, and bit manipulation. It has no caches and a high bandwidth to memory. For this reason, rather than for its actual physical location, we call it Near-Memory Processor (NMP). Our simulations show that a set of scientific applications run much faster on the NMP than on an aggressive conventional processor. Specifically, the speedups obtained reach 18, with a geometric mean of 5.8 for 10 applications.
Issue Date:2005-02
Genre:Technical Report
Type:Text
URI:http://hdl.handle.net/2142/11011
Other Identifier(s):UIUCDCS-R-2005-2557
Rights Information:You are granted permission for the non-commercial reproduction, distribution, display, and performance of this technical report in any format, BUT this permission is only for a period of 45 (forty-five) days from the most recent time that you verified that this technical report is still available from the University of Illinois at Urbana-Champaign Computer Science Department under terms that include this permission. All other rights are reserved by the author(s).
Date Available in IDEALS:2009-04-17


This item appears in the following Collection(s)

Item Statistics