The code runs a small benchmarks of your blas implementation using multiplication of square matrices. You can pass the size as an argument of the executable.
Two versions are provided, one of which is using the USM inferface.
If the environment is correctly set you should be able to run the sample with:
mkdir build;
cd build
CXX=clang++ cmake .. -DCMAKE_BUILD_TYPE=Release
cmake --build .
sycl_unique<T>is a unique pointer to a USM allocated memory which wraps astd::unique_ptr<T>with a custom deleter and holds the allocated size.fill_randfills astd::vector<T>orsycl_unique<T>with random values.
- Working example adapted from here