Parallel LU factorization of sparse matrices on FPGA-based configurable computing engines
Document Type
Article
Publication Date
4-10-2004
Abstract
Configurable computing, where hardware resources are configured appropriately to match specific hardware designs, has recently demonstrated its ability to significantly improve performance for a wide range of computation-intensive applications. With steady advances in silicon technology, as predicted by Moore's Law, Field-Programmable Gate Array (FPGA) technologies have enabled the implementation of System-on-a-Programmable-Chip (SOPC or SOC) computing platforms, which, in turn, have given a significant boost to the field of configurable computing. It is possible to implement various specialized parallel machines in a single silicon chip. In this paper, we describe our design and implementation of a parallel machine on an SOPC development board, using multiple instances of a soft IP configurable processor; we use this machine for LU factorization. LU factorization is widely used in engineering and science to solve efficiently large systems of linear equations. Our implementation facilitates the efficient solution of linear equations at a cost much lower than that of supercomputers and networks of workstations. The intricacies of our FPGA-based design are presented along with tradeoff choices made for the purpose of illustration. Performance results prove the viability of our approach. Copyright © 2004 John Wiley & Sons, Ltd.
Identifier
1842533207 (Scopus)
Publication Title
Concurrency and Computation Practice and Experience
External Full Text Location
https://doi.org/10.1002/cpe.748
ISSN
15320626
First Page
319
Last Page
343
Issue
4
Volume
16
Recommended Citation
Wang, Xiaofang and Ziavras, Sotirios G., "Parallel LU factorization of sparse matrices on FPGA-based configurable computing engines" (2004). Faculty Publications. 20389.
https://digitalcommons.njit.edu/fac_pubs/20389
