Loading…

Effective cache bank placement for GPUs

The placement of the Last Level Cache (LLC) banks in the GPU on-chip network can significantly affect the performance of memory-intensive workloads. In this paper, we attempt to offer a placement methodology for the LLC banks to maximize the performance of the on-chip network connecting the LLC bank...

Full description

Saved in:

Bibliographic Details
Main Authors:	Sadrosadati, Mohammad, Mirhosseini, Amirhossein, Roozkhosh, Shahin, Bakhishi, Hazhir, Sarbazi-Azad, Hamid
Format:	Conference Proceeding
Language:	English
Subjects:	Bandwidth Graphics processing units Measurement Message systems Optimization System-on-chip Throughput
Online Access:	Request full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	The placement of the Last Level Cache (LLC) banks in the GPU on-chip network can significantly affect the performance of memory-intensive workloads. In this paper, we attempt to offer a placement methodology for the LLC banks to maximize the performance of the on-chip network connecting the LLC banks to the streaming multiprocessors in GPUs. We argue that an efficient placement needs to be derived based on a novel metric that considers the latency hiding capability of the GPUs through thread level parallelism. To this end, we propose a throughput aware metric, called Effective Latency Impact (ELI). Moreover, we define an optimization problem to formulate our placement approach based on the ELI metric mathematically. To solve this optimization problem, we deploy a heuristic solution as this optimization problem is NP-hard. Experimental results show that our placement approach improves the performance by up to 15.7% compared to the state-of-the-art placement.
ISSN:	1558-1101
DOI:	10.23919/DATE.2017.7926954