Abstract
Streaming SIMD Extensions (SSE) is a unique feature embedded in the Pentium III and IV classes of microprocessors. By fully exploiting SSE, parallel algorithms can be implemented on a standard personal computer and a theoretical speedup of four can be achieved. In this paper, we demonstrate the implementation of a parallel LU matrix decomposition algorithm for solving linear systems with SSE and discuss advantages and disadvantages of this approach based on our experimental study.
Original language | English |
---|---|
Pages (from-to) | 39-44 |
Number of pages | 6 |
Journal | Microprocessors and Microsystems |
Volume | 26 |
Issue number | 1 |
DOIs | |
Publication status | Published - 25 Feb 2002 |
Keywords
- Instruction level parallelism
- LU decomposition
- Parallel algorithms
ASJC Scopus subject areas
- Software
- Hardware and Architecture
- Computer Networks and Communications
- Artificial Intelligence