[FFmpeg-cvslog] r16207 - trunk/libavcodec/h264.c

Michael Niedermayer michaelni
Thu Dec 18 04:25:28 CET 2008


On Thu, Dec 18, 2008 at 02:57:17AM +0000, M?ns Rullg?rd wrote:
> michael <subversion at mplayerhq.hu> writes:
> 
> > Author: michael
> > Date: Thu Dec 18 03:53:18 2008
> > New Revision: 16207
> >
> > Log:
> > Use the new idct functions (except chroma as it was slower in benchmarks)
> > cathedral +0.5% speed
> > aladin +0.6% speed [note aladin has been cat-ed 10 times to reduce the influence
> > of init time]
> > Speedup also verified via START/STOP_TIMER (difference was very significant
> > for the changed parts)
> 
> How much does this hurt on architectures that don't yet have the new
> SIMD functions?

there are no really new SIMD functions.
I just moved the loops like
for(i=0; i<16; i++)
    dsp->idct4x4_add(blah blah);

into dsputil so they are

for(i=0; i<16; i++)
    idct4x4_add_simdwhatever(blah blah);

that way gcc can inline the function and avoids up to 15 calls through dsp->

adding support for this to your favorite architecture is a matter of copy
& paste and adjusting the function names.
Of course one could write the loop in asm, and iam sure it would be faster
but i didnt do this ...
Also this is all a little new and i cannot yet gurantee that the API is
stable, though i do not have any plans to change it i might stumble
across further possible improvments ...


[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

Observe your enemies, for they first find out your faults. -- Antisthenes
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-cvslog/attachments/20081218/85f64e38/attachment.pgp>



More information about the ffmpeg-cvslog mailing list