[FFmpeg-devel] [PATCH 07/10] x86: float dsp: unroll SSE versions

Christophe Gisquet christophe.gisquet at gmail.com
Sat Feb 15 12:14:32 CET 2014


Hi,

2014-02-14 17:00 GMT+01:00 Christophe Gisquet <christophe.gisquet at gmail.com>:
> vector_fmul and vector_fmac_scalar are guaranteed that they can process in
> batch of 16 elements, but their SSE versions only does 8 at a time.
>
> Therefore, unroll them a bit.
> 299 to 261c for 256 elements in vector_fmac_scalar on Arrandale/Win64.

This one should be pretty straightforward.


More information about the ffmpeg-devel mailing list