Performance optimization of an FPGA-based configurable multiprocessor for matrix operations
Document Type
Conference Proceeding
Publication Date
1-1-2003
Abstract
Several driving forces have recently brought about significant advances in the field of configurable computing. They have also enabled parallel processing within a single field-programmable gate array (FPGA) chip. The ever-increasing complexity of application algorithms and the supercomputing crisis have made this new parallel-processing approach more important and pertinent. Its cost-effectiveness provides system designers with the greatest flexibility while imposing many challenges to current hardware and software codesign methodologies. This paper explores practical hardware and software design and implementation issues for FPGA-based configurable multiprocessors, based on the authors' first-hand experience with a shared-memory implementation of parallel LU factorization for sparse block-diagonal-bordered (BDB) matrices. We also propose a new dynamic load balancing strategy for parallel LU factorization on our system. Performance results are included to prove the viability of this new multiprocessor design approach.
Identifier
84946069788 (Scopus)
ISBN
[0780383206, 9780780383203]
Publication Title
Proceedings 2003 IEEE International Conference on Field Programmable Technology Fpt 2003
External Full Text Location
https://doi.org/10.1109/FPT.2003.1275763
First Page
303
Last Page
306
Grant
ER63384
Recommended Citation
Wang, Xiaofang and Ziavras, Sotirios G., "Performance optimization of an FPGA-based configurable multiprocessor for matrix operations" (2003). Faculty Publications. 14327.
https://digitalcommons.njit.edu/fac_pubs/14327
