[FFmpeg-devel] [PATCH 2/3] textdec: Rename all generic parts from srt to text.

Clément Bœsch ubitux at gmail.com
Thu Aug 2 11:28:33 CEST 2012


On Thu, Aug 02, 2012 at 11:16:49AM +0200, Nicolas George wrote:
[...]
> > So why I am talking about this here? Well, I think ideally the subtitles
> > decoder should *not* receive the timing information. This way, we would
> > have:
> > 
> >  - Matroska demuxer outputting SubRip packets the same way the SubRip
> >    demuxer would output SubRip packets (both without any timing
> >    information). Both would use CODEC_ID_SRT, and the codec will just
> >    honor the markup
> 
> Yes on the principle, but we will need to deal with the fact that the markup
> is wild. Matroska marks text subtitles as S_TEXT, it does not say anything
> about markup, and it is IMHO an abuse from the players to honor it. But this
> abuse is so common that we have to deal with it.
> 
[...]
> >  - Now, CODEC_ID_TEXT will be available for any formats using text
> >    *without* SubRip markup (contrary to Matroska): this might be needed
> >    for some video codec, but it means this codec will also be available
> >    for various subtitles format where we don't need any markup. Right now
> >    MicroDVD, JacoSUB, SAMI, SubViewer etc all have their own decoder for
> >    special markups, but I'm sure various subtitles format don't have any
> >    markup system, and here CODEC_ID_TEXT would make sense: that can not
> >    work with CODEC_ID_TEXT is considered to be "html"/SubRip markup.
> 
> Except the "contrary to Matroska" part, I agree with that.
> 

According to #matroska at freenode a while ago:

22:35 < ubitux> are "S_TEXT/UTF8" entries always text subtitles in SubRip markup?
22:59 < Mosu> yes

and actually the specifications:
http://www.matroska.org/technical/specs/subtitles/srt.html (there is no
markup information but it clearly stats the input clearly is a SubRip
file).

It really looks like to me Matroska is assuming SubRip markup for S_TEXT.

Also, if the matroska developers decide to explictely standardize this
(this might not be trivial since there is no real SubRip markup spec),
they will likely say that S_TEXT is for SubRip markup to not break compat
with all the samples in the wild, and eventually add a "raw no tag soup"
one.

[...]

-- 
Clément B.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 490 bytes
Desc: not available
URL: <http://ffmpeg.org/pipermail/ffmpeg-devel/attachments/20120802/88c84d80/attachment.asc>


More information about the ffmpeg-devel mailing list