[FFmpeg-trac] #5568(swscale:open): POWER8 VSX vectorization libswscale/swscale.c
FFmpeg
trac at avcodec.org
Tue Jun 19 23:23:46 EEST 2018
#5568: POWER8 VSX vectorization libswscale/swscale.c
-------------------------------------+-----------------------------------
Reporter: edelsohn | Owner:
Type: enhancement | Status: open
Priority: wish | Component: swscale
Version: git-master | Resolution:
Keywords: bounty vsx | Blocked By:
Blocking: | Reproduced by developer: 0
Analyzed by developer: 0 |
-------------------------------------+-----------------------------------
Comment (by cehoyos):
Replying to [comment:11 bookmoons]:
> Hi guys. I have something toward a vectorized hScale8To15_c from
swscale.c. Profiling with callgrind
(You are expected to test with FFmpeg's TIMER macros unless this does not
work for some reason.)
> shows it's a little faster than the unoptimized version, but not as fast
as the extant altivec version. Hope to figure out what's causing the
difference and improve it.
>
> Unoptimized - 11,140,574
> VSX optimized - 9,670,008
> Altivec optimized - 3,511,966
>
> Does this seem like the right direction?
The direction may be all right but to be accepted in the codebase the
speed has to be improved significantly.
If you are interested in the bounty, make sure to first optimize one
function to learn about our requirements: A patch that unfortunately
contained a lot of work was rejected because it only offered minimal speed
improvements.
--
Ticket URL: <https://trac.ffmpeg.org/ticket/5568#comment:12>
FFmpeg <https://ffmpeg.org>
FFmpeg issue tracker
More information about the FFmpeg-trac
mailing list