[FFmpeg-devel] [PATCH] NEON add_signed_pixels_clamped

Måns Rullgård mans
Sat Apr 4 15:27:19 CEST 2009


David Conrad <lessen42 at gmail.com> writes:

> Hi,
>
> 3% overall wmv3 decoding speedup.
>
> Also, is it possible to have something like
>
> .macro reg=2
> d\(\reg*2)
>
> evaluate to d4? Or any other ideas to put the repeated sections in a
> macro that isn't ugly?

I don't think there is a simple way, but read on.

The attached version is 1 cycle faster on Cortex-A8, 5 cycles faster
on A9.  Unfortunately it's somewhat more difficult to read than the
original.  That's the price you pay for speed.

-- 
M?ns Rullg?rd
mans at mansr.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: add_pixels_clamped_neon.patch
Type: text/x-patch
Size: 3313 bytes
Desc: not available
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20090404/c2a3e863/attachment.bin>



More information about the ffmpeg-devel mailing list