[FFmpeg-devel] [PATCH] avcodec/mlpdec: Add decoding of object audio data
Marton Balint
cus at passwd.hu
Sun Mar 23 23:50:35 EET 2025
On Sun, 23 Mar 2025, James Almer wrote:
> On 3/23/2025 4:33 PM, Massimo Eynard wrote:
>> On 23/03/2025 20:01, James Almer wrote:
>>> On 3/22/2025 2:49 PM, Massimo Eynard wrote:
>>>> This patch adds support for decoding the fourth MLP substream
>>>> which contains the 16-channel presentation used for Atmos
>>>> audio objects.
>>>>
>>>> By default only the first three substreams are decoded
>>>> unless the new extract_objects flag is enabled as the resulting
>>>> presentation contains audio object feeds instead of classic
>>>> loudspeaker feeds.
>>>>
>>>> As this introduces interpolation of primitive matrices, precision
>>>> has been increased to 2.18 fixed point. Therefore this requires
>>>> DSP code upgrade which has been done for C and x86 implementations
>>>> but not the ARM implementation.
>>>>
>>>> Adds two FATE tests using existing atmos.thd sample to reflect
>>>> changes.
>>>>
>>>> Signed-off-by: Massimo Eynard <eynard.massimo at gmail.com>
>>>> ---
>>>> libavcodec/arm/mlpdsp_armv5te.S | 2 +-
>>>> libavcodec/arm/mlpdsp_init_arm.c | 3 +-
>>>> libavcodec/mlp.h | 10 +-
>>>> libavcodec/mlp_parse.c | 31 ++-
>>>> libavcodec/mlp_parse.h | 1 +
>>>> libavcodec/mlp_parser.c | 11 +-
>>>> libavcodec/mlpdec.c | 389
>>>> +++++++++++++++++++++++++++----
>>>> libavcodec/mlpdsp.c | 50 +++-
>>>> libavcodec/mlpdsp.h | 25 ++
>>>> libavcodec/x86/mlpdsp.asm | 19 +-
>>>> tests/fate/truehd.mak | 10 +
>>>> 11 files changed, 476 insertions(+), 75 deletions(-)
>>>
>>> With atmos.thd i get:
>>>
>>>> [aist#0:0/truehd @ 00000209caf3ee00] Guessed Channel Layout: 7.1.4
>>>> Input #0, truehd, from '../samples/truehd/atmos.thd':
>>>> Duration: N/A, start: 0.000000, bitrate: N/A
>>>> Stream #0:0: Audio: truehd (Dolby TrueHD + Dolby Atmos), 48000 Hz,
>>>> 7.1.4, s32 (24 bit)
>>>
>>> Which is unlikely to be correct. The file has 11 (or 12) objects, which
>>> is exported as 12 channels in an unspecified layout, and automatically
>>> assumed to be a 7.1.4 fixed layout.
>>>
>>
>> This is caused by `guess_input_channel_layout` (in `ffmpeg_demux.c`) which
>> tries to assume a layout.
I wonder if the default value for guess_layout_max should be set to 8. An
unspecified layout with more than 8 channels seems unlikely. (unless it is
indeed unspecified/unknown)
>> Would using `AV_CHANNEL_ORDER_CUSTOM` with all channels set to
>> `AV_CHAN_UNKNOWN` (for unknown position, except LFE if present) be a
>> better solution?
>
AV_CHANNEL_ORDER_CUSTOM with all channels being AV_CHAN_UNKNOWN is
equvivalent to AV_CHANNEL_ORDER_UNSPEC. So unless there is a channel which
is not AV_CHAN_UNKNOWN, there is no reason to use AV_CHANNEL_ORDER_CUSTOM.
Regards,
Marton
More information about the ffmpeg-devel
mailing list