[FFmpeg-devel] [WIP] [PATCH 0/5] sse2/avx functions for 8-bit simple_idct

James Darnley jdarnley at obe.tv
Sat Jun 10 14:46:39 EEST 2017

So here is my second work in progress patch set.  These are the functions from
the second half of my previous.

With Ronald's rounding contribution I am much closer to finishing.  As I
mentioned on IRC I think the speed of decoding an MPEG2 HD sample has increased
from 215 to 235 fps.

I still have a small optimisation to make and I need to use the correct
coefficients.  This will require a large change to the macros.  I am sending
this so that people can nitpick my changes.

James Darnley (5):
  avcodec/x86: cleanup simple_idct10
  avcodec/x86: add x86-64 8-bit simple_idct function
  more cleanup
  avcodec/x86: add x86-64 8-bit simple_idct put function
  avcodec/x86: add x86-64 8-bit simple_idct add function

 libavcodec/tests/x86/dct.c                |   2 +
 libavcodec/x86/idctdsp_init.c             |  23 +++++++
 libavcodec/x86/proresdsp.asm              |   2 +-
 libavcodec/x86/simple_idct.h              |   9 +++
 libavcodec/x86/simple_idct10.asm          | 100 ++++++++++++++++++++++++++++--
 libavcodec/x86/simple_idct10_template.asm |  80 +++++++++++-------------
 6 files changed, 167 insertions(+), 49 deletions(-)


More information about the ffmpeg-devel mailing list