[Ffmpeg-devel] a little optim for a SSE version of H263_LOOP_FILTER

Michael Niedermayer michaelni
Mon Nov 6 15:37:37 CET 2006


Hi

On Mon, Nov 06, 2006 at 09:58:18PM +0800, Zuxy Meng wrote:
> Hi,
> 
> 2006/11/5, skal <skal65535 at orange.fr>:
> >
> > Hi everybody,
> >
> > in case, it seems to me a SSE version of
> > H263_LOOP_FILTER is possible by replacing
> >      "psubusb %%mm4, %%mm2           \n\t"\
> >      "movq %%mm2, %%mm3              \n\t"\
> >      "psubusb %%mm4, %%mm3           \n\t"\
> >      "psubb %%mm3, %%mm2             \n\t"\
> > at dsputil_mmx.c:587 (fresh cvs), by:
> >      "psubusb %%mm4, %%mm2           \n\t"\
> >      "pminub %%mm4, %%mm2           \n\t"\
> >
> > +maybe a little re-org of the loop (mm3 is gone).
> >
> > Well, this is just for the fun of it, since the speed-up
> > (if any) might not be worth a special version...
> 
> What about rewrite it with purely SSE2? I'll try when I have time...

rather try that with h.264s loop filter please, or some other
h.264 stuff, h.263s loop filter is not used by many files, actually
h.263 isnt that common
that said iam happy about every optimization, no matter which part
of the code so iam happy about improvments for h.263 too

[...]
-- 
Michael     GnuPG fingerprint: 9FF2128B147EF6730BADF133611EC787040B0FAB

In the past you could go to a library and read, borrow or copy any book
Today you'd get arrested for mere telling someone where the library is




More information about the ffmpeg-devel mailing list