Characterizing the L1 data cache's vulnerability to transient errors in chip-multiprocessors

Document Type

Conference Proceeding

Publication Date

9-14-2011

Abstract

With continuous technology scaling, current and next generation microprocessors are becoming more vulnerable to transient errors such as soft errors induced by energetic particle strikes. While mainstream microprocessors are employing multi-/many-core architectures targeting at high-performance parallel computing applications, the transistor/area share of on-chip caches keeps increasing. As cache memories being the major victim of soft errors, it is of paramount importance to characterize on-chip cache's vulnerability in this context for devising potential reliability optimizations, especially under the interaction with cache coherence protocols. In this work, we develop a lifetime model for the private L1 data cache in chip-multiprocessors (CMPs), which is based on the cache activities and the states of cachelines. This lifetime model is then applied to characterize and predict cache's vulnerability trend in CMPs. Our experimental evaluation shows that cache vulnerable phases due to remote accesses increase dramatically as the number of processor cores increases. Based on vulnerable phase analysis, we propose a protocol enhancement to prematurely invalidate cachelines in modified (M) state for minimizing the vulnerability factor due to remote reads to modified cachelines. © 2011 IEEE.

Identifier

80052598450 (Scopus)

ISBN

[9780769544472]

Publication Title

Proceedings 2011 IEEE Computer Society Annual Symposium on VLSI Isvlsi 2011

External Full Text Location

https://doi.org/10.1109/ISVLSI.2011.23

First Page

266

Last Page

271

This document is currently not available here.

Share

COinS