DD-AVX
2.0.0
|
DD-AVX_v3 is SIMD accelerated simple interface high precision BLAS Lv.1 and Sparse BLAS operations Library.
These operations can be performed by combining double and double-double precision.
Multi-threading (OpenMP) and SIMD AVX / AVX2 / AVX512 acceleration are available.
This library provides an easy way to implement a fast and accurate Krylov subspace method.
This library is extensions of Lis_DD_AVXKernals and DD-AVX_v2 (archived).
This library provides BLAS / Sparse BLAS functions for the following six types.
It has BLAS Lv.1 and Sparse BLAS functions for these types.
All combinations of BLAS functions are implemented. It works for both D and DD types.
See the axpy sample code and CG method sample code for more information on how to use it.
This library requires the QD library for scalar operations as a submodule. The QD library is downloaded and built automatically by make
.
You can specify the destination directory with DD_AVX_DIR
and make
. The QD libraries are installed in the same directory.
You can build and install the QD library with the following commands:
make avx
make install
make avx2
make install
make avx512
make install
Since github actions's runner does not support AVX512, github actions only verifies the build. We performed manual testing on an Intel® Xeon® Platinum 8280.
https://hishinuma-t.dev/DD-AVX_v3/ (generated by Doxygen)
This is a beta version, and there are some restrictions and changes planned.
The detailed todo is discussed in Issue
We have a complete set of tests for each feature in the test/
directory.
cd test/
make
make test_all
We have implemented an example of the CG method, which is available in sample/
.