Highly Efficient FFT for Exascale: HeFFTe v2.3
|
SYCL/DPC++ specific methods, vector-like container, error checking, etc. More...
Classes | |
struct | cos_pre_pos_processor |
Implementation of Cosine Transform pre-post processing methods using CUDA. More... | |
struct | sin_pre_pos_processor |
Implementation of Cosine Transform pre-post processing methods using CUDA. More... | |
Functions | |
sycl::queue | make_sycl_queue () |
Creates a new SYCL queue, try to use the GPU but if an issue is encountered then default to the CPU. | |
template<typename precision_type , typename index > | |
void | convert (sycl::queue &stream, index num_entries, precision_type const source[], std::complex< precision_type > destination[]) |
Convert real numbers to complex when both are located on the GPU device. More... | |
template<typename precision_type , typename index > | |
void | convert (sycl::queue &stream, index num_entries, std::complex< precision_type > const source[], precision_type destination[]) |
Convert complex numbers to real when both are located on the GPU device. More... | |
template<typename scalar_type , typename index > | |
void | scale_data (sycl::queue &stream, index num_entries, scalar_type *data, double scale_factor) |
Scales real data (double or float) by the scaling factor. | |
template<typename scalar_type , typename index > | |
void | direct_pack (sycl::queue &stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, scalar_type const source[], scalar_type destination[]) |
Performs a direct-pack operation for data sitting on the GPU device. More... | |
template<typename scalar_type , typename index > | |
void | direct_unpack (sycl::queue &stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, scalar_type const source[], scalar_type destination[]) |
Performs a direct-unpack operation for data sitting on the GPU device. More... | |
template<typename scalar_type , typename index > | |
void | transpose_unpack (sycl::queue &stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, index buff_line_stride, index buff_plane_stride, int map0, int map1, int map2, scalar_type const source[], scalar_type destination[]) |
Performs a transpose-unpack operation for data sitting on the GPU device. More... | |
Variables | |
sycl::queue | internal_sycl_queue |
Default queue to use in case the user does not provide one. More... | |
SYCL/DPC++ specific methods, vector-like container, error checking, etc.
The name is chosen distinct from the oneMKL name that use "oneapi".