Authors: Gajić, Dušan
Stanković, Radomir 
Title: The impact of address arithmetic on the gpu implementation of fast algorithms for the vilenkin-chrestenson transform
Journal: Proceedings of The International Symposium on Multiple-Valued Logic
First page: 296
Last page: 301
Conference: 43rd International Symposium on Multiple-Valued Logic, ISMVL 2013; Toyama; Japan; 22 May 2013 through 24 May 2013
Issue Date: 1-Aug-2013
ISBN: 978-0-769-54976-7
ISSN: 0195-623X
DOI: 10.1109/ISMVL.2013.59
This paper considers the impact of address arithmetic in the Cooley-Tukey and the constant geometry fast algorithms for the Vilenkin-Chrestenson transform on their implementation for the graphics processing unit (GPU). We consider issues such as using different transform radices and analyze the number of GPU instructions and register usage in the OpenCL implementations of the considered algorithms. Further, we compare the program running times on the GPU and on the central processing unit (CPU). Experiments show that the GPU implementations are from 10 to 22 times faster than the C/C++ CPU implementations, depending on the transform radix and the number of variables in the processed function. The OpenCL implementation of the constant geometry algorithm translates into a lower number of GPU arithmetic and fetch instructions and uses less registers. This implementation requires up to 21% shorter processing times than the corresponding Cooley-Tukey algorithm implementation.
Keywords: GPU computing | Multiple-valued logic | spectral techniques | Vilenkin-Chrestenson transform
Publisher: IEEE

Show full item record


checked on Apr 22, 2024

Page view(s)

checked on Apr 23, 2024

Google ScholarTM




Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.