[Ffmpeg-devel] benchmark of different CABAC routines

Guillaume Poirier gpoirier
Wed Oct 11 17:39:51 CEST 2006


Hi,
Guillaume Poirier wrote:

>Hi,
>
>P4 Northwood 2.40GHz gcc (GCC) 3.3.5 (Debian 1:3.3.5-13) --march=pentium4
>
>I shall test on Prescott/Smithfield later if I have some time
>  
>
svn r6654, PentiumD 3Ghz, x86-64 mode, gcc-4.0, -march=nocona:

BRANCHLESS_CABAC_DECODER, CMOV_IS_FAST 1
7491 dezicycles in decode_residual, 2095551 runs, 1601 skipsbits/s dup=0 drop=0

BRANCHLESS_CABAC_DECODER, CMOV_IS_FAST 0
7489 dezicycles in decode_residual, 2095555 runs, 1597 skipsbits/s dup=0 drop=0

undef BRANCHLESS_CABAC_DECODER, CMOV_IS_FAST 1
7409 dezicycles in decode_residual, 2095486 runs, 1666 skipsbits/s dup=0 drop=0

undef BRANCHLESS_CABAC_DECODER, CMOV_IS_FAST 0
7408 dezicycles in decode_residual, 2095488 runs, 1664 skipsbits/s dup=0 drop=0

ARCH_X86_DISABLED, BRANCHLESS_CABAC_DECODER
7475 dezicycles in decode_residual, 2095542 runs, 1610 skipsbits/s dup=0 drop=0

ARCH_X86_DISABLED, undef BRANCHLESS_CABAC_DECODER
7662 dezicycles in decode_residual, 2095480 runs, 1672 skipsbits/s dup=0 drop=0

So this Pentium4 still doesn't like branchless code, and doesn't seem to
be fast at executing cmov either.
However, this time it likes assembly code.

It's really one-of-a-kind processor.

Guillaume




More information about the ffmpeg-devel mailing list