[FFmpeg-devel] [PATCH] enable auto vectorization for gcc 7 and higher
Soft Works
softworkz at hotmail.com
Wed Jul 27 21:54:36 EEST 2022
> -----Original Message-----
> From: ffmpeg-devel <ffmpeg-devel-bounces at ffmpeg.org> On Behalf Of
> Swinney, Jonathan
> Sent: Wednesday, July 27, 2022 7:35 PM
> To: ffmpeg-devel at ffmpeg.org
> Subject: [FFmpeg-devel] [PATCH] enable auto vectorization for gcc 7
> and higher
>
> I recognize that this patch is going to be somewhat controversial.
> I'm submitting it mostly to see what the opinions are and evaluate
> options. I am working on improving performance for aarch64. On that
> architecture, there are fewer hand written assembly implementations
> of hot functions than there are for x86_64 and allowing gcc to auto-
> vectorize yields noticeable improvements.
>
> Gcc vectorization has improved recently and it hasn't been evaluated
> on the mailing list for a few years. This is the latest discussion I
> found in my searches: http://ffmpeg.org/pipermail/ffmpeg-devel/2016-
> May/193977.html
>
> If the community is not comfortable accepting a patch like this
> outright, would you be willing to accept a new option to the
> configure script, something like --enable-auto-vectorization?
>
> Thanks!
>
> Signed-off-by: Jonathan Swinney <jswinney at amazon.com>
> ---
> configure | 4 +++-
> 1 file changed, 3 insertions(+), 1 deletion(-)
>
> diff --git a/configure b/configure
> index 6629d14099..c63c9348ad 100755
> --- a/configure
> +++ b/configure
> @@ -7173,7 +7173,9 @@ if enabled icc; then
> disable aligned_stack
> fi
> elif enabled gcc; then
> - check_optflags -fno-tree-vectorize
> + case $gcc_basever in
> + 2|2.*|3.*|4.*|5.*|6.*) check_optflags -fno-tree-vectorize ;;
> + esac
> check_cflags -Werror=format-security
> check_cflags -Werror=implicit-function-declaration
> check_cflags -Werror=missing-prototypes
> --
LGTM - basically. I had removed that flag about two years ago and never
seen an issue (Win,Linux,BSD x x86_64,armv7,aarch64).
But it has always been with quite recent versions of gcc, so I can't say
whether it's safe already with 7.x
One exception I've seen was with an Android NDK build in gcc compatibility
mode, where I got a clang compilation error. But that's nothing of concern
I think.
sw
More information about the ffmpeg-devel
mailing list