[FFmpeg-trac] #3363(ffprobe:new): ffprobe silently drops non-ASCII metadata in VQF files
FFmpeg
trac at avcodec.org
Fri Jan 31 04:49:20 CET 2014
#3363: ffprobe silently drops non-ASCII metadata in VQF files
---------------------------------+---------------------------------------
Reporter: trejkaz | Type: defect
Status: new | Priority: normal
Component: ffprobe | Version: unspecified
Keywords: | Blocked By:
Blocking: | Reproduced by developer: 0
Analyzed by developer: 0 |
---------------------------------+---------------------------------------
Summary of the bug:
How to reproduce:
{{{
% ffprobe -show_format -show_streams -print_format json test.vqf
% ffprobe -version
ffprobe version N-60503-g28975cb-tessus
built on Jan 28 2014 18:43:59 with llvm-gcc 4.2.1 (LLVM build 2336.1.00)
configuration: --prefix=/Users/tessus/data/ext/ffmpeg/sw --as=yasm
--extra-version=tessus --disable-shared --enable-static --disable-ffplay
--enable-gpl --enable-pthreads --enable-postproc --enable-libmp3lame
--enable-libtheora --enable-libvorbis --enable-libx264 --enable-libxvid
--enable-libspeex --enable-bzlib --enable-zlib --enable-libopencore-amrnb
--enable-libopencore-amrwb --enable-libxavs --enable-version3 --enable-
libvo-aacenc --enable-libvo-amrwbenc --enable-libvpx --enable-libgsm
--enable-libopus --enable-libmodplug --enable-fontconfig --enable-
libfreetype --enable-libass --enable-libbluray --enable-filters --enable-
runtime-cpudetect
libavutil 52. 63.100 / 52. 63.100
libavcodec 55. 49.101 / 55. 49.101
libavformat 55. 28.100 / 55. 28.100
libavdevice 55. 7.100 / 55. 7.100
libavfilter 4. 1.101 / 4. 1.101
libswscale 2. 5.101 / 2. 5.101
libswresample 0. 17.104 / 0. 17.104
libpostproc 52. 3.100 / 52. 3.100
}}}
[json @ 0x103000000] 1 invalid UTF-8 sequence(s) found in string
'Bl?mchen', replaced with ''
The value ffprobe emits is "Blchen".
The value it emitted before fixing #2502 was "Bl�mchen" (invalid character
intentional) - which although containing an invalid character, at least
retained all the valid characters. The current builds drop the "m" as well
as the invalid character.
The value I would like to see, however, is "Blümchen".
If the issue is that the VQF module is doing something wrong to convert to
Unicode, it would be good to get that fixed.
If the issue is that VQF is one of those legacy formats where the encoding
isn't known, would it be possible to have some way to specify the system
encoding? I can't just change the encoding of the entire system, because
doing that in a cross-platform way is not really practical.
There is a sample exhibiting the issue in the mplayer samples:
http://samples.mplayerhq.hu/vqf/handinha.vqf
--
Ticket URL: <https://trac.ffmpeg.org/ticket/3363>
FFmpeg <http://ffmpeg.org>
FFmpeg issue tracker
More information about the FFmpeg-trac
mailing list