[FFmpeg-devel] [PATCH 2/3] avcodec/x86: disable hevc 12b luma deblock
Martin Storsjö
martin at martin.st
Sat Feb 24 12:46:44 EET 2024
On Sat, 24 Feb 2024, J. Dekker wrote:
>
> Nuo Mi <nuomi2021 at gmail.com> writes:
>
>> On Wed, Feb 21, 2024 at 7:10 PM J. Dekker <jdek at itanimul.li> wrote:
>>
>>> Over/underflow in some cases.
>>>
>>> Signed-off-by: J. Dekker <jdek at itanimul.li>
>>> ---
>>> libavcodec/x86/hevcdsp_init.c | 9 +++++----
>>> 1 file changed, 5 insertions(+), 4 deletions(-)
>>>
>>> diff --git a/libavcodec/x86/hevcdsp_init.c b/libavcodec/x86/hevcdsp_init.c
>>> index 31e81eb11f..11cb1b3bfd 100644
>>> --- a/libavcodec/x86/hevcdsp_init.c
>>> +++ b/libavcodec/x86/hevcdsp_init.c
>>> @@ -1205,10 +1205,11 @@ void ff_hevc_dsp_init_x86(HEVCDSPContext *c, const
>>> int bit_depth)
>>> if (EXTERNAL_SSE2(cpu_flags)) {
>>> c->hevc_v_loop_filter_chroma =
>>> ff_hevc_v_loop_filter_chroma_12_sse2;
>>> c->hevc_h_loop_filter_chroma =
>>> ff_hevc_h_loop_filter_chroma_12_sse2;
>>> - if (ARCH_X86_64) {
>>> - c->hevc_v_loop_filter_luma =
>>> ff_hevc_v_loop_filter_luma_12_sse2;
>>> - c->hevc_h_loop_filter_luma =
>>> ff_hevc_h_loop_filter_luma_12_sse2;
>>> - }
>>> + // FIXME: 12-bit luma deblock over/underflows in some cases
>>> + // if (ARCH_X86_64) {
>>> + // c->hevc_v_loop_filter_luma =
>>> ff_hevc_v_loop_filter_luma_12_sse2;
>>> + // c->hevc_h_loop_filter_luma =
>>> ff_hevc_h_loop_filter_luma_12_sse2;
>>> + // }
>>> SAO_BAND_INIT(12, sse2);
>>> SAO_EDGE_INIT(12, sse2);
>>>
>> Hi Dekker,
>> VVC will utilize this function as well.
>> Could you please share the HEVC clip or data that caused the overflow?
>> We'll make efforts to address it during the VVC porting
>>
>
> You can just run ./tests/checkasm/checkasm --test=hevc_deblock to
> find a failing case.
To clarify, this is with the new checkasm test added in this patchset, not
currently in git master - otherwise fate would be failing for everybody on
x86.
> My guess is that delta0 overflows before the right
> shift, see the ARM64 asm which specfically widens this calculation on 12
> bit variant but I'm not 100%, I don't know x86 asm.
Are you sure the input is within valid range? It's always possible that
checkasm produces inputs that the real decoder wouldn't - but it's also
possible that this is a real decoder bug that just hasn't been triggered
by any other test yet.
// Martin
More information about the ffmpeg-devel
mailing list