[FFmpeg-cvslog] commit: H.264: split luma dc idct out and implement MMX/SSE2 versions ( Jason Garrett-Glaser )

git at videolan.org git
Fri Jan 14 23:11:16 CET 2011


ffmpeg | branch: master | Jason Garrett-Glaser <darkshikari at gmail.com> | Fri Jan 14 21:34:25 2011 +0000| [19fb234e4af1ff9f58ff2fdd604ac6f6bb87ad6b] | committer: Jason Garrett-Glaser 

H.264: split luma dc idct out and implement MMX/SSE2 versions
About 2.5x the speed.

NOTE: the way that the asm code handles large qmuls is a bit suboptimal.
If x264-style dequant was used (separate shift and qmul values), it might
be possible to get some extra speed.

Originally committed as revision 26336 to svn://svn.ffmpeg.org/ffmpeg/trunk

> http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commit;h=19fb234e4af1ff9f58ff2fdd604ac6f6bb87ad6b
---

 libavcodec/dsputil.h         |    4 +
 libavcodec/h264.c            |   50 ++-----------
 libavcodec/h264.h            |    5 +-
 libavcodec/h264_cabac.c      |    8 +-
 libavcodec/h264_cavlc.c      |    8 +-
 libavcodec/h264dsp.c         |    1 +
 libavcodec/h264dsp.h         |    2 +
 libavcodec/h264idct.c        |   35 ++++++++++
 libavcodec/svq3.c            |   20 +++---
 libavcodec/x86/dsputil_mmx.c |    1 +
 libavcodec/x86/h264_idct.asm |  154 ++++++++++++++++++++++++++++++++++++++++++
 libavcodec/x86/h264dsp_mmx.c |    4 +
 12 files changed, 227 insertions(+), 65 deletions(-)

Diff:   http://git.videolan.org/gitweb.cgi/ffmpeg.git/?a=commitdiff;h=19fb234e4af1ff9f58ff2fdd604ac6f6bb87ad6b



More information about the ffmpeg-cvslog mailing list