[FFmpeg-devel] [PATCH] Faster SSE FFT/MDCT

Michael Niedermayer michaelni
Sat May 12 20:37:26 CEST 2007


Hi

On Fri, May 11, 2007 at 04:51:21PM +0800, Zuxy Meng wrote:
> Hi,
> 
> 2007/5/11, Zuxy Meng <zuxy.meng at gmail.com>:
> >Hi,
> >
> >The patch unrolls some loops, utilizing all 8 xmm registers. fft-test
> >shows ~10% speed up in (I)FFT and ~5% speed up in (I)MDCT on my
> >Dothan. Of course with x86-64 we can unroll one more time but I don't
> >have a test bench....
> >
> >Full test passed on x86, and a test on x86-64 would be prudent: I used
> >xmm8 to save a memory access.
> 
> I just unrolled another loop in imdct. Now IMDCT is about ~8% faster
> compared to SVN head. Please ignore the last patch and try this one
> instead:-)

looks ok if it passes regression tests 

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

There will always be a question for which you do not know the correct awnser.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20070512/49e40ff0/attachment.pgp>



More information about the ffmpeg-devel mailing list