Bandwidth-Efficient On-Chip Interconnect Designs for GPGPUs

abstract

2015 ACM. Modern computational workloads require abundant thread level parallelism (TLP), necessitating highly-parallel, many-core accelerators such as General Purpose Graphics Processing Units (GPGPUs). GPGPUs place a heavy demand on the on-chIP interconnect between the many cores and a few memory controllers (MCs). Thus, traffic is highly asymmetric, impacting on-chIP resource utilization and system performance. Here, we analyze the communication demands of typical GPGPU applications, and propose efficient Network-on-ChIP (NoC) designs to meet those demands. We show that the proposed schemes improve performance by up to 64.7%. Compared to the best of class prior work, our VC monopolizing and partitioning schemes improve performance by 25%.

name of conference

Proceedings of the 52nd Annual Design Automation Conference

authors

published proceedings

2015 52ND ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC)

author list (cited authors)

Jang, H., Kim, J., Gratz, P., Yum, K. H., & Kim, E. J.

citation count

55

complete list of authors

Jang, Hyunjun||Kim, Jinchun||Gratz, Paul||Yum, Ki Hwan||Kim, Eun Jung

publication date

June 2015

publisher

Association for Computing Machinery (ACM) Publisher

published in

Design Automation Conference Journal

keywords

Bandwidth
Gpgpu
Network-on-chip

Digital Object Identifier (DOI)

10.1145/2744769.2744803

International Standard Book Number (ISBN) 13

9781450335201

start page

1

end page

6

volume

2015-July

URL

http://dx.doi.org/10.1145/2744769.2744803

Bandwidth-Efficient On-Chip Interconnect Designs for GPGPUs Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

volume

Other

URL