[FFmpeg-devel] [PATCH 02/23] vp3/x86: use full transpose for all IDCTs.

Michael Niedermayer michaelni at gmx.at
Tue Mar 12 23:01:00 CET 2013


On Tue, Mar 12, 2013 at 02:54:10PM -0700, Ronald S. Bultje wrote:
> Hi,
> 
> On Tue, Mar 12, 2013 at 11:56 AM, Michael Niedermayer <michaelni at gmx.at> wrote:
> > On Tue, Mar 12, 2013 at 07:28:12AM -0700, Ronald S. Bultje wrote:
> >> From: "Ronald S. Bultje" <rsbultje at gmail.com>
> >>
> >> This way, the special IDCT permutations are no longer needed. Bfin code
> >> is disabled until someone updates it. This is similar to how H264 does
> >> it, and removes the dsputil dependency imposed by the scantable code.
> >
> > does this have any speed/performace effect ?
> 
> sse2 idct_add goes from 135 to 125 cycles. Overall decode time doesn't
> change much, 2.75 seconds both before and after on first 1000 frames
> of big buck bunny 720p (1 thread).
> 
> mmx probably has a similar speedup. I don't expect C performance to
> change at all.

ok, applied

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

In a rich man's house there is no place to spit but his face.
-- Diogenes of Sinope
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: Digital signature
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20130312/b13d6562/attachment.asc>


More information about the ffmpeg-devel mailing list