Exploiting Execution Locality with a Decoupled Kilo-Instruction Processor

abstract

Overcoming increasing memory latency is one of the main problems that microprocessor designers have faced over the years. The two basic techniques introduced to mitigate latencies are caches and out-of-order execution. However, neither of these solutions is adequatefor hiding off-chip memory accesses in the order of 200 cycles or more. Theoretically, increasing the size of the instruction window would allow much longer latencies to be hidden. But scaling the structures to support thousands of in-flight instructions would be prohibitively expensive. However, the distribution of instruction issue times under the presence of L2 cache misses is highly correlated. This paper describes this phenomenon of Execution Locality and shows how it can be exploited with an inexpensive microarchitecture consisting of two linked cores. This Decoupled Kilo-Instruction Processor (D-KIP) is very effective in recovering lost potential performance. Extensive simulations show that speed-ups of up to 379% are possible for numerical benchmarks thanks to the exploitation of impressive degrees of Memory-Level Parallelism (MLP) and the execution of independent instructions in the shadow of L2 misses. Springer-Verlag Berlin Heidelberg 2008.

authors

Jimenez, Daniel

published proceedings

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

author list (cited authors)

Perics, M., Cristal, A., Gonzlez, R., Jimnez, D. A., & Valero, M.

citation count

1

complete list of authors

Pericàs, Miquel||Cristal, Adrian||González, Ruben||Jiménez, Daniel A||Valero, Mateo

publication date

February 2008

publisher

Springer Nature Publisher

published in

n1611-3349ISSN Journal

keywords

46 Information And Computing Sciences

Digital Object Identifier (DOI)

10.1007/978-3-540-77704-5_5

International Standard Book Number (ISBN) 10

3540777032

International Standard Book Number (ISBN) 13

9783540777038

start page

56

end page

67

volume

4759

URL

http://dx.doi.org/10.1007/978-3-540-77704-5_5

Exploiting Execution Locality with a Decoupled Kilo-Instruction Processor Conference Paper

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 10

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

volume

Other

URL