[FFmpeg-devel] [PATCH] PPC64: Add versions of functions in libswscale/input.c optimized for POWER8 VSX SIMD.

Dan Parrot dan.parrot at mail.com
Tue Jul 5 07:31:02 EEST 2016


On Mon, 2016-07-04 at 09:20 +0000, Carl Eugen Hoyos wrote:
> Dan Parrot <dan.parrot <at> mail.com> writes:
> 
> > The dataset used was the entire FATE regression suite.
> 
> I don't think this is a particularly useful testcase:
> It takes very long but mostly tests other things.
> 
> Did you test if using ffmpeg -benchmark -f rawvideo -i /dev/zero... 
> showed different results?
> I believe this should be both easier and faster to test.
> 
> > name: rgb24ToY_c_vsx. 
> > no. of calls: 9999. min: 3832 ns. avg: 4709 ns. max: 37550 ns. 
> > total: 47093533 ns. 
> > 
> > name: rgb24ToY_c. 
> > no. of calls: 9999. min: 3809 ns. avg: 4707 ns. max: 29041 ns. 
> > total: 47072923 ns.
> 
> Without any data, I would have thought that this is the most 
> important function (and "no. of calls" seems to confirm this).
> 
> Why is this not faster?
> Can you confirm with START_TIMER / STOP_TIMER that there is no 
> gain?
> 
> Thank you, Carl Eugen
> 
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel at ffmpeg.org
> http://ffmpeg.org/mailman/listinfo/ffmpeg-devel





More information about the ffmpeg-devel mailing list