[FFmpeg-devel] [PATCH] VP8 V simple loopfilter in MMX/MMX2/SSE2

Ronald S. Bultje rsbultje
Fri Jul 2 23:07:24 CEST 2010


Hi,

On Thu, Jul 1, 2010 at 11:10 AM, Ronald S. Bultje <rsbultje at gmail.com> wrote:
> On Thu, Jul 1, 2010 at 10:46 AM, Ronald S. Bultje <rsbultje at gmail.com> wrote:
>> On Thu, Jul 1, 2010 at 10:32 AM, Ronald S. Bultje <rsbultje at gmail.com> wrote:
>>> see attached, my first try at doing a loopfilter in SIMD.
>>>
>>> C takes about 660 cycles for the main MB one of 1760 for the 3
>>> together in the splitmv case. MMX didn't really measure since it's
>>> only 1 instruction difference as per MMX2 (I just tested that they
>>> gave identical output). MMX2 takes 190/350 cycles, SSE2 takes 180/330
>>> cycles (which is weird, should be faster, but who knows what my crappy
>>> CPU is doing, this machine is 5 yrs old - Intel Core Duo 2GHz on a
>>> MacBook Pro).
>>
>> Now with vp8dsp-init.c changes also.
>
> Now with proper alignment for constants, thanks to Vitor for noticing.

H loop filter changes attached separately.

Ronald
-------------- next part --------------
A non-text attachment was scrubbed...
Name: vp8_h_loopfilter_simple.patch
Type: application/octet-stream
Size: 7518 bytes
Desc: not available
URL: <http://lists.mplayerhq.hu/pipermail/ffmpeg-devel/attachments/20100702/a58f3edb/attachment.obj>



More information about the ffmpeg-devel mailing list