A comparative analysis of OpenMP and CUDA performance as exemplified by the computation of Fourier transform

Авторлар

  • B. Medetov IETP, Al Farabi Kazakh National University, Kazakhstan, Almaty
  • A. Koishigarin IETP, Al Farabi Kazakh National University, Kazakhstan, Almaty
  • A. Yskak IETP, Al Farabi Kazakh National University, Kazakhstan, Almaty
  • K. Niazaliev IETP, Al Farabi Kazakh National University, Kazakhstan, Almaty
  • A. Naurzbayeva IETP, Al Farabi Kazakh National University, Kazakhstan, Almaty

Кілт сөздер:

parallel computing, Fourier transform, NVIDIA CUDA, OpenMP, digital processing

Аңдатпа

A comparative analysis of the performance of the two technologies of parallel computing, OpenMP and nVidia CUDA have been carried out as exemplified by the computation of Fourier transform. It was obtained that the execution time for the Fourier transform on multi-core central processor depends on the number of cores nonlinearly. In addition, the form of this dependence changes because of the number of threads: for the threads whose number is lower than that of hard cores the dependence is powerlike whereas for the threads whose number is higher than the hard cores number the dependence is exponential. The maximum efficiency of computation with the use of OpenMP can be achieved when the number of threads used in the program is twice the number of hard cores. The comparison conducted for this case showed that for a small number of frames OpenMP is more efficient in terms of execution time, otherwise, CUDA offers an advantage.

Жарияланды

2018-04-02

Журналдың саны

Бөлім

Plasma Physics

Осы автордың (немесе авторлардың) ең көп оқылатын мақалалары