[FFmpeg-devel] [PATCH] lavc/vp8dsp: R-V V put_vp8_pixels
Rémi Denis-Courmont
remi at remlab.net
Wed Feb 21 20:01:48 EET 2024
Hello,
Le maanantaina 19. helmikuuta 2024, 13.13.43 EET flow gg a écrit :
> The reason for using m1+le8 instead of stride load + larger group
> multipliers is the same as in "[FFmpeg-devel] [PATCH 1/7] lavc/me_cmp: R-V
> V pix_abs."
>
> In the test, there is
>
> #define src (buf + 2 * SRC_BUF_STRIDE + 2 + 1)
>
> Therefore, not using e8 will result : (fatal signal 7: Bus error).
Yes, you could also just say that alignment is insufficient :)
It is still possible to load rectangles of up to 8 columns using vlseg8e8, but
it might be slower than just repeating the 8 regular loads, and it won't work
if you need calculations between rows.
I may be missing something but I don't understand what purpose the header file
serves here?
--
Rémi Denis-Courmont
http://www.remlab.net/
More information about the ffmpeg-devel
mailing list