Available on x86-64 and target feature
avx512f
only.Expand description
Loads 512-bits (composed of 16 packed single-precision (32-bit)
floating-point elements) from memory into result.
mem_addr
does not need to be aligned on any particular boundary.