Performance optimization of an FPGA-based configurable multiprocessor for matrix operations

Document Type

Conference Proceeding

Publication Date

1-1-2003

Abstract

Several driving forces have recently brought about significant advances in the field of configurable computing. They have also enabled parallel processing within a single field-programmable gate array (FPGA) chip. The ever-increasing complexity of application algorithms and the supercomputing crisis have made this new parallel-processing approach more important and pertinent. Its cost-effectiveness provides system designers with the greatest flexibility while imposing many challenges to current hardware and software codesign methodologies. This paper explores practical hardware and software design and implementation issues for FPGA-based configurable multiprocessors, based on the authors' first-hand experience with a shared-memory implementation of parallel LU factorization for sparse block-diagonal-bordered (BDB) matrices. We also propose a new dynamic load balancing strategy for parallel LU factorization on our system. Performance results are included to prove the viability of this new multiprocessor design approach.

Identifier

84946069788 (Scopus)

ISBN

[0780383206, 9780780383203]

Publication Title

Proceedings 2003 IEEE International Conference on Field Programmable Technology Fpt 2003

External Full Text Location

https://doi.org/10.1109/FPT.2003.1275763

First Page

303

Last Page

306

Grant

ER63384

This document is currently not available here.

Share

COinS