Highly Efficient FFT for Exascale: HeFFTe v2.3
heffte::rocm Namespace Reference

ROCM specific methods, vector-like container, error checking, etc. More...

Classes

struct  cos_pre_pos_processor
 Implementation of Cosine Transform pre-post processing methods using CUDA. More...
 
struct  sin_pre_pos_processor
 Implementation of Sine Transform pre-post processing methods using CUDA. More...
 

Functions

void check_error (hipError_t status, const char *function_name)
 Checks the status of a ROCm command and in case of a failure, converts it to a C++ exception.
 
void check_error (rocfft_status status, const char *function_name)
 Checks the status of a cufft command and in case of a failure, converts it to a C++ exception.
 
template<typename precision_type , typename index >
void convert (hipStream_t stream, index num_entries, precision_type const source[], std::complex< precision_type > destination[])
 Convert real numbers to complex when both are located on the GPU device. More...
 
template<typename precision_type , typename index >
void convert (hipStream_t stream, index num_entries, std::complex< precision_type > const source[], precision_type destination[])
 Convert complex numbers to real when both are located on the GPU device. More...
 
template<typename scalar_type , typename index >
void scale_data (hipStream_t stream, index num_entries, scalar_type *data, double scale_factor)
 Scales real data (double or float) by the scaling factor.
 
template<typename scalar_type , typename index >
void direct_pack (hipStream_t stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, scalar_type const source[], scalar_type destination[])
 Performs a direct-pack operation for data sitting on the GPU device. More...
 
template<typename scalar_type , typename index >
void direct_unpack (hipStream_t stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, scalar_type const source[], scalar_type destination[])
 Performs a direct-unpack operation for data sitting on the GPU device. More...
 
template<typename scalar_type , typename index >
void transpose_unpack (hipStream_t stream, index nfast, index nmid, index nslow, index line_stride, index plane_stide, index buff_line_stride, index buff_plane_stride, int map0, int map1, int map2, scalar_type const source[], scalar_type destination[])
 Performs a transpose-unpack operation for data sitting on the GPU device. More...
 

Detailed Description

ROCM specific methods, vector-like container, error checking, etc.