|
Highly Efficient FFT for Exascale: HeFFTe v2.3
|
SYCL/DPC++ specific methods, vector-like container, error checking, etc. More...
Classes | |
| struct | cos_pre_pos_processor |
| Implementation of Cosine Transform pre-post processing methods using CUDA. More... | |
| struct | sin_pre_pos_processor |
| Implementation of Cosine Transform pre-post processing methods using CUDA. More... | |
Functions | |
| sycl::queue | make_sycl_queue () |
| Creates a new SYCL queue, try to use the GPU but if an issue is encountered then default to the CPU. | |
| template<typename precision_type , typename index > | |
| void | convert (sycl::queue &stream, index num_entries, precision_type const source[], std::complex< precision_type > destination[]) |
| Convert real numbers to complex when both are located on the GPU device. More... | |
| template<typename precision_type , typename index > | |
| void | convert (sycl::queue &stream, index num_entries, std::complex< precision_type > const source[], precision_type destination[]) |
| Convert complex numbers to real when both are located on the GPU device. More... | |
| template<typename scalar_type , typename index > | |
| void | scale_data (sycl::queue &stream, index num_entries, scalar_type *data, double scale_factor) |
| Scales real data (double or float) by the scaling factor. | |
| template<typename scalar_type , typename index > | |
| void | direct_pack (sycl::queue &stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, scalar_type const source[], scalar_type destination[]) |
| Performs a direct-pack operation for data sitting on the GPU device. More... | |
| template<typename scalar_type , typename index > | |
| void | direct_unpack (sycl::queue &stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, scalar_type const source[], scalar_type destination[]) |
| Performs a direct-unpack operation for data sitting on the GPU device. More... | |
| template<typename scalar_type , typename index > | |
| void | transpose_unpack (sycl::queue &stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, index buff_line_stride, index buff_plane_stride, int map0, int map1, int map2, scalar_type const source[], scalar_type destination[]) |
| Performs a transpose-unpack operation for data sitting on the GPU device. More... | |
Variables | |
| sycl::queue | internal_sycl_queue |
| Default queue to use in case the user does not provide one. More... | |
SYCL/DPC++ specific methods, vector-like container, error checking, etc.
The name is chosen distinct from the oneMKL name that use "oneapi".