symm/{chacha,salsa20}-{arm64,arm-neon}.S: Improve rotation code.
authorMark Wooding <mdw@distorted.org.uk>
Fri, 8 Mar 2024 03:17:45 +0000 (03:17 +0000)
committerMark Wooding <mdw@distorted.org.uk>
Fri, 8 Mar 2024 03:17:45 +0000 (03:17 +0000)
commit717b28abfbfdfb49d1c676b75661dffec3e60911
tree77892e000ab60c16f88118936f67cad00b4baa4a
parentff1e7afff01b944b333b4dc1e5844aa35c41370a
symm/{chacha,salsa20}-{arm64,arm-neon}.S: Improve rotation code.

Apparently I was asleep when I read the architecture reference because I
missed the `sri' instruction and how it can be used to synthesize
rotations with only two instructions rather than three.

Also replace rotation by 16 with the obvious `rev32'.
symm/chacha-arm-neon.S
symm/chacha-arm64.S
symm/salsa20-arm-neon.S
symm/salsa20-arm64.S