Intel® Math Kernel Library 11.3 Update 4 Developer Guide
The Intel Optimized HPCG package includes prebuilt HPCG benchmark for Intel MPI 4.1.3 or higher. All the files of the benchmark are located in the ./benchmarks/hpcg subdirectory of the Intel MKL directory. The following versions of the benchmark are available:
File in ./benchmarks/hpcg/bin |
Description |
---|---|
xhcg_avx |
The Intel AVX optimized version of the benchmark, optimized for systems based on the first and the second generations of Intel Xeon processor E3 family, Intel Xeon processor E5 family, or Intel Xeon processor E7 family. |
xhcg_avx2 |
The Intel AVX2 optimized version of the benchmark, optimized for systems based on Intel Xeon E3-xxxx v3 processor and future Intel processors with Intel AVX2 support. Running the Intel AVX optimized version of the benchmark on an Intel AVX2 enabled system produces non-optimal performance. The Intel AVX2 optimized version of the benchmark does not run on systems that do not support Intel AVX2. |
xhcg_mic |
The Intel Xeon Phi coprocessor optimized version of the benchmark, to be used for native runs on Intel Xeon Phi coprocessors. It can also be used along with the Intel AVX optimized version or Intel AVX2 optimized version for symmetric runs. A symmetric run involves xhpcg_mic running on the Intel Xeon Phi coprocessors and xhpcg_avx or xhpcg_avx2 running on Intel Xeon processors. MPI ranks can be on both Intel Xeon processor hosts and Intel Xeon Phi coprocessors. This version works only with Intel® MPI. |
xhcg_offload |
The Intel Xeon Phi coprocessor optimized version for the offload mode. This version runs on Intel Xeon system and offloads computations to Intel Xeon Phi coprocessor(s). Unlike in the Intel Xeon Phi optimized version, MPI ranks are only on the Intel Xeon processor hosts and not on the Intel Xeon Phi coprocessors. Running this version of the benchmark requires the Redistributable Libraries package for the Intel® Parallel Studio XE Composer Edition (for details, see https://software.intel.com/en-us/articles/redistributables-for-intel-parallel-studio-xe-2015-composer-edition-for-linux). For the supported versions of Intel Parallel Studio XE Composer Edition, see Intel MKL System Requirements. |
The Intel Optimized HPCG package also includes the source code and libraries necessary to build the following versions of the benchmark for other MPI implementations, such as SGI MPT*, MPICH2, or Open MPI: Intel AVX optimized version, Intel AVX2 optimized version, and Intel Xeon Phi coprocessor optimized version for offload mode. Build instructions are available in the QUICKSTART file included with the package. Intel Xeon Phi coprocessor optimized version for native runs is available only for Intel MPI library.
Intel's compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #20110804 |