EaseMiss: HW/SW Co-Optimization for Efficient Large Matrix-Matrix Multiply Operations
Document Type
Conference Proceeding
Publication Date
1-1-2022
Abstract
Due to the essential role of matrix multiplication in many scientific applications, especially in data and compute -intensive applications, we explore the efficiency of highly used matrix production algorithms. This paper proposes an HW/SW co-optimization technique, entitled EaseMiss, to reduce the cache miss ratio for large general matrix-matrix multiplications. First, we revise the algorithms by applying three software optimization techniques to improve performance. Choosing the proper algorithms to achieve the best performance is examined and formulated. By leveraging the proposed optimizations, the number of cache misses decreases by a factor of 3 in a conventional data cache. To further improve, we then propose SPLiTCACHE to virtually split data cache regarding matrices' dimensions for better data reuse. This method can be easily embedded into conventional general-purpose processors or GPUs at the cost of negligible logical circuit overhead. After using the correct and valid splitting, the obtained results show that the cache misses reduce by a factor of 2 compared to the conventional data cache on average in the machine learning workloads.
Identifier
85137681716 (Scopus)
ISBN
[9781665498852]
Publication Title
Proceedings of the 2022 IEEE Dallas Circuits and Systems Conference Dcas 2022
External Full Text Location
https://doi.org/10.1109/DCAS53974.2022.9845629
Recommended Citation
Nezhadi, Ali; Angizi, Shaahin; and Roohi, Arman, "EaseMiss: HW/SW Co-Optimization for Efficient Large Matrix-Matrix Multiply Operations" (2022). Faculty Publications. 3411.
https://digitalcommons.njit.edu/fac_pubs/3411