[FFmpeg-devel] [PATCH] Faster SSE FFT/MDCT

Zuxy Meng zuxy.meng
Sun May 13 11:41:21 CEST 2007


Hi,

2007/5/13, Michael Niedermayer <michaelni at gmx.at>:
> Hi
>
> On Fri, May 11, 2007 at 04:51:21PM +0800, Zuxy Meng wrote:
> > Hi,
> >
> > 2007/5/11, Zuxy Meng <zuxy.meng at gmail.com>:
> > >Hi,
> > >
> > >The patch unrolls some loops, utilizing all 8 xmm registers. fft-test
> > >shows ~10% speed up in (I)FFT and ~5% speed up in (I)MDCT on my
> > >Dothan. Of course with x86-64 we can unroll one more time but I don't
> > >have a test bench....
> > >
> > >Full test passed on x86, and a test on x86-64 would be prudent: I used
> > >xmm8 to save a memory access.
> >
> > I just unrolled another loop in imdct. Now IMDCT is about ~8% faster
> > compared to SVN head. Please ignore the last patch and try this one
> > instead:-)
>
> looks ok if it passes regression tests

Yes it did, same as the original patch:-)
-- 
Zuxy
Beauty is truth,
While truth is beauty.
PGP KeyID: E8555ED6




More information about the ffmpeg-devel mailing list