TY - GEN
T1 - A fast and flexible sorting algorithm with CUDA
AU - Chen, Shifu
AU - Qin, Jing
AU - Xie, Yongming
AU - Zhao, Junping
AU - Heng, Pheng Ann
PY - 2009/9/21
Y1 - 2009/9/21
N2 - In this paper, we propose a fast and flexible sorting algorithm with CUDA. The proposed algorithm is much more practical than the previous GPU-based sorting algorithms, as it is able to handle the sorting of elements represented by integers, floats and structures. Meanwhile, our algorithm is optimized for the modern GPU architecture to obtain high performance. We use different strategies for sorting disorderly list and nearly-sorted list to make it adaptive. Extensive experiments demon- strate our algorithm has higher performance than previous GPU-based sorting algorithms and can support real-time applications.
AB - In this paper, we propose a fast and flexible sorting algorithm with CUDA. The proposed algorithm is much more practical than the previous GPU-based sorting algorithms, as it is able to handle the sorting of elements represented by integers, floats and structures. Meanwhile, our algorithm is optimized for the modern GPU architecture to obtain high performance. We use different strategies for sorting disorderly list and nearly-sorted list to make it adaptive. Extensive experiments demon- strate our algorithm has higher performance than previous GPU-based sorting algorithms and can support real-time applications.
KW - CUDA
KW - GPU-based sorting algorithm
KW - Parallel sorting algorithm
UR - http://www.scopus.com/inward/record.url?scp=70349127789&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-03095-6_28
DO - 10.1007/978-3-642-03095-6_28
M3 - Conference article published in proceeding or book
SN - 3642030947
SN - 9783642030949
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 281
EP - 290
BT - Algorithms and Architectures for Parallel Processing - 9th International Conference, ICA3PP 2009, Proceedings
T2 - 9th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2009
Y2 - 8 June 2009 through 11 June 2009
ER -