Parallel Adaptive Stochastic Gradient Descent Algorithms for Latent Factor Analysis of High-Dimensional and Incomplete Industrial Data
Document Type
Article
Publication Date
1-1-2024
Abstract
Latent factor analysis (LFA) is efficient in knowledge discovery from a high-dimensional and incomplete (HDI) matrix frequently encountered in industrial big data-related applications. A stochastic gradient descent (SGD) algorithm is commonly adopted as a learning algorithm for LFA owing to its high efficiency. However, its sequential nature makes it less scalable when processing large-scale data. Although alternating SGD decouples an LFA process to achieve parallelization, its performance relies on its hyper-parameters that are highly expensive to tune. To address this issue, this paper presents three extended alternating SGD algorithms whose hyper-parameters are made adaptive through particle swarm optimization. Correspondingly, three Parallel Adaptive LFA (PAL) models are proposed and achieve highly efficient latent factor acquisition from an HDI matrix. Experiments have been conducted on four HDI matrices collected from industrial applications, and the benchmark models are LFA models based on state-of-the-art parallel SGD algorithms including the alternative SGD, Hogwild!, distributed gradient descent, and sparse matrix factorization parallelization. The results demonstrate that compared with the benchmarks, with 32 threads, the proposed PAL models achieve much speedup gain. They achieve the highest prediction accuracy for missing data on most cases.
Identifier
85161536693 (Scopus)
Publication Title
IEEE Transactions on Automation Science and Engineering
External Full Text Location
https://doi.org/10.1109/TASE.2023.3267609
e-ISSN
15583783
ISSN
15455955
First Page
2716
Last Page
2729
Issue
3
Volume
21
Grant
62272078
Fund Ref
National Natural Science Foundation of China
Recommended Citation
Qin, Wen; Luo, Xin; Li, Shuai; and Zhou, Meng Chu, "Parallel Adaptive Stochastic Gradient Descent Algorithms for Latent Factor Analysis of High-Dimensional and Incomplete Industrial Data" (2024). Faculty Publications. 1168.
https://digitalcommons.njit.edu/fac_pubs/1168