FPGA implementation of a cholesky algorithm for a shared-memory multiprocessor architecture
Document Type
Article
Publication Date
12-1-2004
Abstract
Solving a system of linear equations is a key problem in engineering and science. Matrix factorization is a key component of many methods used to solve such equations. However, the factorization process is very time consuming, so these problems have often been targeted for parallel machines rather than sequential ones. Nevertheless, commercially available supercomputers are expensive and only large institutions have the resources to purchase them. Hence, efforts are on to develop more affordable alternatives. In this paper, we propose such an approach. We present an implementation of a parallel version of the Cholesky matrix factorization algorithm on a single-chip multiprocessor built inside an APEX20K series Field-Programmable Gate Array (FPGA) developed by Altera. Our multiprocessor system uses an asymmetric, shared-memory MIMD architecture and was built using the configurable Nios× processor core which was also developed by Altera. Our system was developed using Altera's System-On-a-Programmable-Chip (SOPC) Quartus II development environment. Our Cholesky implementation is based on an algorithm described by George et al. [6]. This algorithm is scalable and uses a "queue of tasks" approach to ensure dynamic load-balancing among the processing elements. Our implementation assumes dense matrices in the input. We present performance results for uniprocessor and multiprocessor implementations. Our results show that the implementation of multiprocessors inside FPGAs can benefit matrix operations, such as matrix factorization. Further benefits result from good dynamic load-balancing techniques.
Identifier
6344242990 (Scopus)
Publication Title
Parallel Algorithms and Applications
External Full Text Location
https://doi.org/10.1080/10637190412331279957
ISSN
10637192
First Page
211
Last Page
226
Issue
4
Volume
19
Grant
DE-FG02-03CH11171
Recommended Citation
Haridas, Satchidanand G. and Ziavras, Sotirios G., "FPGA implementation of a cholesky algorithm for a shared-memory multiprocessor architecture" (2004). Faculty Publications. 20053.
https://digitalcommons.njit.edu/fac_pubs/20053
