math/mpx-mul4-x86-sse2.S: `mmla4' only need 48 bytes of stack.