[FFmpeg-devel] [PATCH] avfilter/avf_showcqt: cqt_calc optimization on x86

Muhammad Faiz mfcc64 at gmail.com
Wed Jun 8 11:08:30 CEST 2016


On Tue, Jun 7, 2016 at 2:51 PM, Muhammad Faiz <mfcc64 at gmail.com> wrote:
> On Tue, Jun 7, 2016 at 9:49 AM, Michael Niedermayer
> <michael at niedermayer.cc> wrote:
>> On Tue, Jun 07, 2016 at 08:07:45AM +0700, Muhammad Faiz wrote:
>>> On Sat, Jun 4, 2016 at 2:36 PM, Muhammad Faiz <mfcc64 at gmail.com> wrote:
>>> > benchmark on x86_64
>>> > cqt_time:
>>> > plain = 3.292 s
>>> > SSE   = 1.640 s
>>> > SSE3  = 1.631 s
>>> > AVX   = 1.395 s
>>> > FMA3  = 1.271 s
>>> > FMA4  = not available
>>> >
>>> > untested on x86_32
>>> >
>>> > Signed-off-by: Muhammad Faiz <mfcc64 at gmail.com>
>>> > ---
>>> >  libavfilter/avf_showcqt.c          |   7 ++
>>> >  libavfilter/avf_showcqt.h          |   3 +
>>> >  libavfilter/x86/Makefile           |   2 +
>>> >  libavfilter/x86/avf_showcqt.asm    | 206 +++++++++++++++++++++++++++++++++++++
>>> >  libavfilter/x86/avf_showcqt_init.c |  63 ++++++++++++
>>> >  5 files changed, 281 insertions(+)
>>> >  create mode 100644 libavfilter/x86/avf_showcqt.asm
>>> >  create mode 100644 libavfilter/x86/avf_showcqt_init.c
>>> >
>>>
>>> I want to apply this patch but I'm not sure if it works on
>>> x86_32 or not (I didn't test it on x86_32)
>>
>> i had tested fate with it applied on x86_32 earlier today but i think
>> this filter isnt tested with fate ...
>>
>> testing x86_32 is easy:
>> --arch=x86_32 --target-os=linux --extra-cflags=-m32 --extra-ldflags=-m32  --enable-cross-compile
>>
> unfortunately, my linux is pure 64-bit (Fedora 23, it has no 32-bit libraries),
> so it cannot be done
>
> thanks

I update my linux to support 32-bit (at least libc),
and now testingx86_32 works

thank's for the suggestion


More information about the ffmpeg-devel mailing list