This sorting library gives programmers an easy way to increase sorting performance by harnessing the highly parallel computational power available in todays graphics cards.

Tests shows that it can outperform highly optimized CPU-based Quicksort with a factor of 10 on cards commonly available in 2007. Sorting 16 million floating point numbers or integers take less than half a second!

Benchmarks

 Benchmarks 

这个排序库可以让程序员利用现在GPU卡的高并行计算能力进行快速排序。

下面是测试报告:

Benchmarks

 Benchmarks

该图显示了一个均匀分布的浮点数数组进行排序时,在8800GTX显卡上性能。图上比较了在标准C库桑的几个算法:GPU快速排序库、基数排序,板蓝根/归并排序GPUSort双调排序)和Introsort(结合快速排序和堆排序)算法。

下载地址:http://www.cse.chalmers.se/research/group/dcs/gpuqsortdcs.html