Matrix Multiplication with Straggler Tolerance in Coded Elastic Computing via Lagrange Code
Document Type
Conference Proceeding
Publication Date
1-1-2023
Abstract
In cloud computing systems, elastic events and stragglers increase the uncertainty of the system, leading to computation delays. Coded elastic computing (CEC) introduced by Yang et al. in 2018 is a framework which mitigates the impact of elastic events using Maximum Distance Separable (MDS) coded storage. It proposed a CEC scheme for both matrix-vector multiplication and general matrix-matrix multiplication applications. However, in these applications, the proposed CEC scheme cannot tolerate stragglers due to the limitations imposed by MDS codes. In this paper we propose a new elastic computing scheme using uncoded storage and Lagrange coded computing approaches. The proposed scheme can effectively mitigate the effects of both elasticity and stragglers. Moreover, it produces a lower complexity and smaller recovery threshold compared to existing coded storage based schemes.
Identifier
85178251832 (Scopus)
ISBN
[9781538674628]
Publication Title
IEEE International Conference on Communications
External Full Text Location
https://doi.org/10.1109/ICC45041.2023.10279134
ISSN
15503607
First Page
136
Last Page
141
Volume
2023-May
Grant
2145835
Fund Ref
National Science Foundation
Recommended Citation
Zhong, Xi; Kliewer, Jörg; and Ji, Mingyue, "Matrix Multiplication with Straggler Tolerance in Coded Elastic Computing via Lagrange Code" (2023). Faculty Publications. 2260.
https://digitalcommons.njit.edu/fac_pubs/2260