[FFmpeg-devel] [PATCH 2/2]v7 Opus Pyramid Vector Quantization Search in x86 SIMD asm
ikalvachev at gmail.com
Sun Aug 6 21:52:20 EEST 2017
This patch requires "Add macros used in opus_pvq_search to x86util.asm"
as 4 of the macros are moved there.
1. Cosmetics is completely redone.
2. I've left the align code as it is.
I found a really old nasm-2.07 version (from 19 Jan 2010) and made a test build.
I got nasm-2.09.04 (from Jan 11 2011) too, just to be sure.
They all passed without issues.
The x264 x86inc.asm also uses smartalign without
checking version number.
Also I had to do a bit more extensive benchmarks,
because it's hard to tell which version is better
(with or without align).
So far it looks like the align might be faster
with 2-6 cycles at best.
So until somebody finds some concrete issue
I'd like to keep the code as it is.
(maybe try avx2 without align:)
I hope I haven't forgotten to do something.
And I do hope I haven't messed up something new.
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 18750 bytes
Desc: not available
More information about the ffmpeg-devel