[Ffmpeg-devel] [PATCH] Snow mmx+sse2 asm optimizations

Oded Shimon ods15
Fri Mar 17 06:12:31 CET 2006


On Thu, Mar 16, 2006 at 11:20:22PM -0500, Robert Edele wrote:
> On Thu, 2006-03-16 at 09:55 -0500, Robert Edele wrote:
> > I'll post the next tranche of the patch once it's committed, which shall
> > be switching the obmc arrays to use 8 bits instead of 6 (but keeping 6-
> > bit precision) along with a few bugfixes by pengvado to allow this to
> > happen. The purpose of the patch is that it simplifies every asm
> > implementation that I've seen so far (and the C) by 1 or 2 instructions
> > per innermost loop.
> > 
> Part 2 of my patch is ready. Passes regression test. Would anyone mind
> reviewing it?

> +  0,  0,  0,  0,  4,  4,  4,  4,  4,  4,  4,  4,  8,  8,  8,  8,  8,  8,  8,  8,  4,  4,  4,  4,  4,  4,  4,  4,  0,  0,  0,  0,
> +/*
>   0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0,
>   0, 1, 1, 1, 2, 2, 2, 3, 3, 4, 4, 4, 5, 5, 5, 6, 6, 5, 5, 5, 4, 4, 4, 3, 3, 2, 2, 2, 1, 1, 1, 0,
>   0, 1, 2, 2, 3, 3, 4, 5, 5, 6, 7, 7, 8, 8, 9,10,10, 9, 8, 8, 7, 7, 6, 5, 5, 4, 3, 3, 2, 2, 1, 0,
> @@ -262,9 +295,27 @@
>   0, 1, 2, 2, 3, 3, 4, 5, 5, 6, 7, 7, 8, 8, 9,10,10, 9, 8, 8, 7, 7, 6, 5, 5, 4, 3, 3, 2, 2, 1, 0,
>   0, 1, 1, 1, 2, 2, 2, 3, 3, 4, 4, 4, 5, 5, 5, 6, 6, 5, 5, 5, 4, 4, 4, 3, 3, 2, 2, 2, 1, 1, 1, 0,
>   0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0,
> +*/

IMO just remove the old values.

- ods15





More information about the ffmpeg-devel mailing list