


For each element, performs MUL and ADD/SUB calculation with 3 operands and store the result in the first operand. (returns the result.)
Which operands to MUL, and which operand to ADD/SUB, depend on the order of the numbers (1, 2, 3) in the instruction name. depend on the order of arguments for intrinsic.
| ????? | instruction name | odd | even |
|---|---|---|---|
| FMADD | VFMADD132PS | (1) * (3) + (2) | |
| VFMADD213PS | (2) * (1) + (3) | ||
| VFMADD231PS | (2) * (3) + (1) | ||
| _mm_fmadd_ps _mm256_fmadd_ps _mm512_fmadd_ps |
a * b + c | ||
| FMSUB | VFMSUB132PS | (1) * (3) - (2) | |
| VFMSUB213PS | (2) * (1) - (3) | ||
| VFMSUB231PS | (2) * (3) - (1) | ||
| _mm_fmsub_ps _mm256_fmsub_ps _mm512_fmsub_ps |
a * b - c | ||
| FMADDSUB | VFMADDSUB132PS | (1) * (3) + (2) | (1) * (3) - (2) |
| VFMADDSUB213PS | (2) * (1) + (3) | (2) * (1) - (3) | |
| VFMADDSUB231PS | (2) * (3) + (1) | (2) * (3) - (1) | |
| _mm_fmaddsub_ps _mm256_fmaddsub_ps _mm512_fmaddsub_ps |
a * b + c | a * b - c | |
| FMSUBADD | VFMSUBADD132PS | (1) * (3) - (2) | (1) * (3) + (2) |
| VFMSUBADD213PS | (2) * (1) - (3) | (2) * (1) + (3) | |
| VFMSUBADD231PS | (2) * (3) - (1) | (2) * (3) + (1) | |
| _mm_fmsubadd_ps _mm256_fmsubadd_ps _mm512_fmsubadd_ps |
a * b - c | a * b + c | |
| FNMADD | VFNMADD132PS | - (1) * (3) + (2) | |
| VFNMADD213PS | - (2) * (1) + (3) | ||
| VFNMADD231PS | - (2) * (3) + (1) | ||
| _mm_fnmadd_ps _mm256_fnmadd_ps _mm512_fnmadd_ps |
- a * b + c | ||
| FNMSUB | VFNMSUB132PS | - (1) * (3) - (2) | |
| VFNMSUB213PS | - (2) * (1) - (3) | ||
| VFNMSUB231PS | - (2) * (3) - (1) | ||
| _mm_fnmsub_ps _mm256_fnmsub_ps _mm512_fnmsub_ps |
- a * b - c | ||
_mask_ if k bit is 0, a is copied.
_mask3_ if k bit is 0, c is copied.
_maskz_ if k bit is 0, zero cleared.