[FFmpeg-devel] [PATCH 2/2] x86/vp9: inital AVX2 intra_pred

Michael Niedermayer michaelni at gmx.at
Sun Jun 8 03:02:05 CEST 2014


On Mon, May 19, 2014 at 12:02:21AM -0300, James Almer wrote:
> tos3k-vp9-b10000.webm on a Core i5-4200U @1.6GHz
> 
> 1219 decicycles in ff_vp9_ipred_dc_32x32_ssse3, 131070 runs, 2 skips
> 439 decicycles in ff_vp9_ipred_dc_32x32_avx2, 131070 runs, 2 skips
> 
> 3570 decicycles in ff_vp9_ipred_dc_top_32x32_ssse3, 4096 runs, 0 skips
> 2494 decicycles in ff_vp9_ipred_dc_top_32x32_avx2, 4096 runs, 0 skips
> 
> 1419 decicycles in ff_vp9_ipred_dc_left_32x32_ssse3, 16384 runs, 0 skips
> 717 decicycles in ff_vp9_ipred_dc_left_32x32_avx2, 16384 runs, 0 skips
> 
> 2737 decicycles in ff_vp9_ipred_tm_32x32_avx, 1024 runs, 0 skips
> 2088 decicycles in ff_vp9_ipred_tm_32x32_avx2, 1024 runs, 0 skips
> 
> 3090 decicycles in ff_vp9_ipred_v_32x32_avx, 512 runs, 0 skips
> 2226 decicycles in ff_vp9_ipred_v_32x32_avx2, 512 runs, 0 skips
> 
> 1565 decicycles in ff_vp9_ipred_h_32x32_avx, 1024 runs, 0 skips
> 922 decicycles in ff_vp9_ipred_h_32x32_avx2, 1024 runs, 0 skips
> 
> Signed-off-by: James Almer <jamrial at gmail.com>
> ---
>  libavcodec/x86/constants.c      |   8 +-
>  libavcodec/x86/constants.h      |   4 +-
>  libavcodec/x86/vp9dsp_init.c    |  16 ++++
>  libavcodec/x86/vp9intrapred.asm | 157 +++++++++++++++++++++++++++++++++++++++-
>  libavutil/x86/asm.h             |   1 +
>  5 files changed, 177 insertions(+), 9 deletions(-)

applied

thanks

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Complexity theory is the science of finding the exact solution to an
approximation. Benchmarking OTOH is finding an approximation of the exact
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 181 bytes
Desc: Digital signature
URL: <https://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20140608/3ba8f0f6/attachment.asc>


More information about the ffmpeg-devel mailing list