From: Mark Wooding Date: Sat, 5 Nov 2016 21:28:22 +0000 (+0000) Subject: math/mpx-mul4-x86-sse2.S: `mmla4' only need 48 bytes of stack. X-Git-Tag: 2.3.0~14 X-Git-Url: https://git.distorted.org.uk/~mdw/catacomb/commitdiff_plain/847e1f3e083c82fa118244b427dc60ef83f74f49?hp=f48fb6a6b1e0f29a45b42beae638ef9886312579 math/mpx-mul4-x86-sse2.S: `mmla4' only need 48 bytes of stack. --- diff --git a/math/mpx-mul4-x86-sse2.S b/math/mpx-mul4-x86-sse2.S index a6613ed0..e466cfa9 100644 --- a/math/mpx-mul4-x86-sse2.S +++ b/math/mpx-mul4-x86-sse2.S @@ -522,7 +522,7 @@ INTFUNC(mmul4) // of the sum U V + N Y to [EDI], leaving the remaining carry in // XMM4, XMM5, and XMM6. The registers XMM0, XMM1, XMM2, XMM3, and // XMM7 are clobbered; the general-purpose registers are preserved. - stalloc 64 // space for the carries + stalloc 48 // space for the carries endprologue // Calculate W = U V, and leave it in the destination. Stash the @@ -547,7 +547,7 @@ INTFUNC(mmla4) // carry in XMM4, XMM5, and XMM6. The registers XMM0, XMM1, XMM2, // XMM3, and XMM7 are clobbered; the general-purpose registers are // preserved. - stalloc 64 // space for the carries + stalloc 48 // space for the carries endprologue movd xmm4, [edi + 0] @@ -616,7 +616,7 @@ INTFUNC(mmla4) paddq xmm6, [esp + 32] // And, with that, we're done. - stfree 64 + stfree 48 ret ENDFUNC