[FFmpeg-devel] [PATCH] avfilter/avf_showcqt: cqt_calc optimization on x86

Muhammad Faiz mfcc64 at gmail.com
Tue Jun 7 09:51:46 CEST 2016


On Tue, Jun 7, 2016 at 9:49 AM, Michael Niedermayer
<michael at niedermayer.cc> wrote:
> On Tue, Jun 07, 2016 at 08:07:45AM +0700, Muhammad Faiz wrote:
>> On Sat, Jun 4, 2016 at 2:36 PM, Muhammad Faiz <mfcc64 at gmail.com> wrote:
>> > benchmark on x86_64
>> > cqt_time:
>> > plain = 3.292 s
>> > SSE   = 1.640 s
>> > SSE3  = 1.631 s
>> > AVX   = 1.395 s
>> > FMA3  = 1.271 s
>> > FMA4  = not available
>> >
>> > untested on x86_32
>> >
>> > Signed-off-by: Muhammad Faiz <mfcc64 at gmail.com>
>> > ---
>> >  libavfilter/avf_showcqt.c          |   7 ++
>> >  libavfilter/avf_showcqt.h          |   3 +
>> >  libavfilter/x86/Makefile           |   2 +
>> >  libavfilter/x86/avf_showcqt.asm    | 206 +++++++++++++++++++++++++++++++++++++
>> >  libavfilter/x86/avf_showcqt_init.c |  63 ++++++++++++
>> >  5 files changed, 281 insertions(+)
>> >  create mode 100644 libavfilter/x86/avf_showcqt.asm
>> >  create mode 100644 libavfilter/x86/avf_showcqt_init.c
>> >
>>
>> I want to apply this patch but I'm not sure if it works on
>> x86_32 or not (I didn't test it on x86_32)
>
> i had tested fate with it applied on x86_32 earlier today but i think
> this filter isnt tested with fate ...
>
> testing x86_32 is easy:
> --arch=x86_32 --target-os=linux --extra-cflags=-m32 --extra-ldflags=-m32  --enable-cross-compile
>
unfortunately, my linux is pure 64-bit (Fedora 23, it has no 32-bit libraries),
so it cannot be done

thanks


More information about the ffmpeg-devel mailing list