[FFmpeg-devel] [PATCH 1/7] lavc/me_cmp: R-V V pix_abs
Rémi Denis-Courmont
remi at remlab.net
Sat Feb 10 11:14:11 EET 2024
Le perjantaina 9. helmikuuta 2024, 17.34.40 EET flow gg a écrit :
> The issue here is that any load greater than e8 will fail the test(Bus
> error), so it cannot use vlse64 or similar methods...
AFAICT, data is aligned on 16 bytes here, so using larger element sizes should
not be a problem. That being the case, you can load pretty much any power-of-
two byte quantity per row up to 512 bits, as 8 segments of 64-bit elements.
That is more than enough to deal with 16-byte rows.
Of course, that results in a tiled data layout, so it only works if individual
elements are all treated equally with no cross-row calculations. This might
require trickery or not work at all for those functions that subtract adjacent
values. But your patchset seems to leave those out anyway.
--
Rémi Denis-Courmont
http://www.remlab.net/
More information about the ffmpeg-devel
mailing list