[FFmpeg-devel] [PATCH] Fix SSE code to not use SSE2.

Reimar Döffinger Reimar.Doeffinger at gmx.de
Wed Mar 7 08:27:05 CET 2012


On 6 Mar 2012, at 22:49, Jason Garrett-Glaser <jason at x264.com> wrote:
> On Tue, Mar 6, 2012 at 1:11 PM, Reimar Döffinger
> <Reimar.Doeffinger at gmx.de> wrote:
>> movq from SSE register _to_ memory is an SSE2 instruction.
>> Use the SSE movlps function instead that does the same thing.
>> 
>> Signed-off-by: Reimar Döffinger <Reimar.Doeffinger at gmx.de>
>> ---
>>  libavcodec/x86/sbrdsp.asm |    2 +-
>>  1 files changed, 1 insertions(+), 1 deletions(-)
>> 
>> diff --git a/libavcodec/x86/sbrdsp.asm b/libavcodec/x86/sbrdsp.asm
>> index c165c52..c3b559b 100644
>> --- a/libavcodec/x86/sbrdsp.asm
>> +++ b/libavcodec/x86/sbrdsp.asm
>> @@ -104,7 +104,7 @@ cglobal sbr_hf_g_filt, 5, 6, 5
>>     movq        m2, [r1]
>>     punpckldq   m0, m0
> 
> These look pretty SSE2 to me, too.

Unfortunately that depends on the specific opcode chosen, they all have SSE equivalents after all.
According to the gdb backtrace, only the one I fixed is an issue.
Though gdb could have been off by one, but at least the movq seems fine...
We really need someone to test.


More information about the ffmpeg-devel mailing list