Characterizing the L1 data cache's vulnerability to transient errors in chip-multiprocessors
Document Type
Conference Proceeding
Publication Date
9-14-2011
Abstract
With continuous technology scaling, current and next generation microprocessors are becoming more vulnerable to transient errors such as soft errors induced by energetic particle strikes. While mainstream microprocessors are employing multi-/many-core architectures targeting at high-performance parallel computing applications, the transistor/area share of on-chip caches keeps increasing. As cache memories being the major victim of soft errors, it is of paramount importance to characterize on-chip cache's vulnerability in this context for devising potential reliability optimizations, especially under the interaction with cache coherence protocols. In this work, we develop a lifetime model for the private L1 data cache in chip-multiprocessors (CMPs), which is based on the cache activities and the states of cachelines. This lifetime model is then applied to characterize and predict cache's vulnerability trend in CMPs. Our experimental evaluation shows that cache vulnerable phases due to remote accesses increase dramatically as the number of processor cores increases. Based on vulnerable phase analysis, we propose a protocol enhancement to prematurely invalidate cachelines in modified (M) state for minimizing the vulnerability factor due to remote reads to modified cachelines. © 2011 IEEE.
Identifier
80052598450 (Scopus)
ISBN
[9780769544472]
Publication Title
Proceedings 2011 IEEE Computer Society Annual Symposium on VLSI Isvlsi 2011
External Full Text Location
https://doi.org/10.1109/ISVLSI.2011.23
First Page
266
Last Page
271
Recommended Citation
Tang, Li; Wang, Shuai; Hu, Jie; and Hu, Xiaobo Sharon, "Characterizing the L1 data cache's vulnerability to transient errors in chip-multiprocessors" (2011). Faculty Publications. 11177.
https://digitalcommons.njit.edu/fac_pubs/11177
