[Ffmpeg-devel] retrieving asf textual info in other languages

Måns Rullgård mru
Tue May 17 12:02:52 CEST 2005


Hauke Duden <H.NS.Duden at gmx.net> writes:

> M?ns Rullg?rd wrote:
>
>>Hauke Duden <H.NS.Duden at gmx.net> writes:
>>
>>
>>
>>>Sorry to intrude here, but UTF-8 is very simple. Why not simply
>>>convert it yourself? Below is a simple striaghtforward encode routine
>>>from unicode char to UTF-8, if you need one. Since UCS-2 is a subset
>>>of Unicode this should work for it as well  Use it as you like.
>>
>>This doesn't deal with nasty things like surrogate pairs.
>
> UCS-2 has no surrogate pairs. It is a fixed 2-byte encoding (which
> does not cover the whole Unicode charset). I guess you're thinking
> about UTF-16.

Right.  I keep confusing all the different terms used.

> If you wanted to handle that then the easiest approach would be do
> first decode the UTF-16 chars into plain Unicode and then re-encode
> them as UTF-8.

Yes, converting between the different utf encodings is simpler than
doing arbitrary conversions (sjis, anyone?).

-- 
M?ns Rullg?rd
mru at inprovide.com





More information about the ffmpeg-devel mailing list