[FFmpeg-devel] [PATCH 5/6] x86: lossless audio: SSE4 madd 32bits
christophe.gisquet at gmail.com
Wed Apr 20 08:43:14 CEST 2016
2016-04-20 2:01 GMT+02:00 Ronald S. Bultje <rsbultje at gmail.com>:
> This is typically only an issue if the data came from stack. On win64 as
> well as unix64, the 4th argument never comes from stack but is a direct
> register argument instead.
So no benefit except consistency. I don't mind either way, though.
On the other hand, this hand-coded function improves is only a slight
improvement over gcc's vectorized code, and only because it does a
poor job of it. Probably because the order is small (8) and gcc does
not have enough info on data. So, it's written, but it's not very
More information about the ffmpeg-devel