[FFmpeg-devel] [PATCH 2/2] x86/vp9: inital AVX2 intra_pred
Michael Niedermayer
michaelni at gmx.at
Sun Jun 8 03:02:05 CEST 2014
On Mon, May 19, 2014 at 12:02:21AM -0300, James Almer wrote:
> tos3k-vp9-b10000.webm on a Core i5-4200U @1.6GHz
>
> 1219 decicycles in ff_vp9_ipred_dc_32x32_ssse3, 131070 runs, 2 skips
> 439 decicycles in ff_vp9_ipred_dc_32x32_avx2, 131070 runs, 2 skips
>
> 3570 decicycles in ff_vp9_ipred_dc_top_32x32_ssse3, 4096 runs, 0 skips
> 2494 decicycles in ff_vp9_ipred_dc_top_32x32_avx2, 4096 runs, 0 skips
>
> 1419 decicycles in ff_vp9_ipred_dc_left_32x32_ssse3, 16384 runs, 0 skips
> 717 decicycles in ff_vp9_ipred_dc_left_32x32_avx2, 16384 runs, 0 skips
>
> 2737 decicycles in ff_vp9_ipred_tm_32x32_avx, 1024 runs, 0 skips
> 2088 decicycles in ff_vp9_ipred_tm_32x32_avx2, 1024 runs, 0 skips
>
> 3090 decicycles in ff_vp9_ipred_v_32x32_avx, 512 runs, 0 skips
> 2226 decicycles in ff_vp9_ipred_v_32x32_avx2, 512 runs, 0 skips
>
> 1565 decicycles in ff_vp9_ipred_h_32x32_avx, 1024 runs, 0 skips
> 922 decicycles in ff_vp9_ipred_h_32x32_avx2, 1024 runs, 0 skips
>
> Signed-off-by: James Almer <jamrial at gmail.com>
> ---
> libavcodec/x86/constants.c | 8 +-
> libavcodec/x86/constants.h | 4 +-
> libavcodec/x86/vp9dsp_init.c | 16 ++++
> libavcodec/x86/vp9intrapred.asm | 157 +++++++++++++++++++++++++++++++++++++++-
> libavutil/x86/asm.h | 1 +
> 5 files changed, 177 insertions(+), 9 deletions(-)
applied
thanks
[...]
--
Michael GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB
Complexity theory is the science of finding the exact solution to an
approximation. Benchmarking OTOH is finding an approximation of the exact
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 181 bytes
Desc: Digital signature
URL: <https://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20140608/3ba8f0f6/attachment.asc>
More information about the ffmpeg-devel
mailing list