[FFmpeg-devel] [PATCH 07/10] x86: float dsp: unroll SSE versions

Michael Niedermayer michaelni at gmx.at
Sat Feb 15 19:01:06 CET 2014


On Fri, Feb 14, 2014 at 04:00:51PM +0000, Christophe Gisquet wrote:
> vector_fmul and vector_fmac_scalar are guaranteed that they can process in
> batch of 16 elements, but their SSE versions only does 8 at a time.
> 
> Therefore, unroll them a bit.
> 299 to 261c for 256 elements in vector_fmac_scalar on Arrandale/Win64.
> ---
>  libavutil/x86/float_dsp.asm | 40 ++++++++++++++++++++++++----------------
>  1 file changed, 24 insertions(+), 16 deletions(-)

applied

thanks

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

He who knows, does not speak. He who speaks, does not know. -- Lao Tsu
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20140215/b604206d/attachment.asc>


More information about the ffmpeg-devel mailing list