Available on (x86 or x86-64) and target feature
fma
and x86 only.Expand description
Multiplies packed single-precision (32-bit) floating-point elements in a
and b
, and add the negated intermediate result to packed elements in c
.