[FFmpeg-devel] [PATCH 0/3] Provide neon implementations
Hubert Mazur
hum at semihalf.com
Tue Sep 20 14:01:55 EEST 2022
This fixes issues addressed in previous patchset:
- move sub instruction in vsad8_intra,
- remove unnecessary mov instructions,
- remove single lane extraction in loop and place it at the end.
Removing mov instructions from pix_median_abs functions significantly
increased peformance for both.
Hubert Mazur (3):
lavc/aarch64: Add neon implementation for pix_median_abs16
lavc/aarch64: Add neon implementation for vsad8_intra
lavc/aarch64: Add neon implementation for pix_median_abs8
libavcodec/aarch64/me_cmp_init_aarch64.c | 10 ++
libavcodec/aarch64/me_cmp_neon.S | 182 +++++++++++++++++++++++
libavcodec/me_cmp.c | 5 +-
3 files changed, 195 insertions(+), 2 deletions(-)
--
2.34.1
More information about the ffmpeg-devel
mailing list