[FFmpeg-trac] #2431(avcodec:new): Detect if subtitle streams do not contain valid utf-8

FFmpeg trac at avcodec.org
Sat Apr 6 20:45:00 CEST 2013

#2431: Detect if subtitle streams do not contain valid utf-8
             Reporter:  Nick         |                    Owner:
                 Type:  enhancement  |                   Status:  new
             Priority:  wish         |                Component:  avcodec
              Version:  git-master   |               Resolution:
             Keywords:  sub srt      |               Blocked By:
             Blocking:               |  Reproduced by developer:  0
Analyzed by developer:  0            |

Comment (by Nick):

 I asked on zeranoe.com to build a version of ffmpeg with '''--enable-
 iconv''' to support option '''-sub_charenc'''
 A new Windows build of ffmpeg built with --enable-iconv is now available:
 Thanks to zeranoe.com!

 I tested this version and now -sub_charenc is working ''(more or less)''!
 The ''(first)'' characters are converted correctly from Windows ANSI to
 UTF-8 but now I found another problem. If -sub_charenc option is used to
 convert an srt file then the last character of every subtitle paragraph is
 missing or not correct converted! (see screenshot)
 It seems a problem of the "subtitles character encoding conversion"
 This problem seems independent whether the last character is an "ordinary"
 character or a special character. Furthermore it is independent whether
 the imported srt file is a Windows/DOS or Unix text file.

 Could somebody please check this problem using option "-sub_charenc" with
 an Linux build of ffmpeg and/or avcodec created with --enable-iconv?

 Command line:
 ffmpeg -i input.mp4 -sub_charenc CP1252 -i subtitle_test_2.srt -map 0:v
 -map 0:a -c copy -map 1 -c:s:0 srt -metadata:s:s:0 language=ger
 ''(test files see attachment)''

Ticket URL: <https://ffmpeg.org/trac/ffmpeg/ticket/2431#comment:18>
FFmpeg <http://ffmpeg.org>
FFmpeg issue tracker

More information about the FFmpeg-trac mailing list