string - What is the difference between UTF-32 and UCS-4? -
what difference between utf-32 , ucs-4 ? isn't utf-32 supposed fixed-width encoding ?
utf-32
has started subset of ucs-4
. identical except utf-32 standard has additional unicode semantics. see details on wikipedia:
the original iso 10646 standard defines 31-bit encoding form called ucs-4, in each encoded character in universal character set (ucs) represented 32-bit friendly code value in code space of integers between 0 , hexadecimal 7fffffff.
because 17 planes in use, current code points between 0 , 0x10ffff. utf-32 subset of ucs-4 uses range. since principles , procedures document of jtc1/sc2/wg2 states future assignments of characters constrained bmp or first 14 supplementary planes, utf-32 able represent unicode characters. accordingly, ucs-4 , utf-32 identical except utf-32 standard has additional unicode semantics.
however, not excactly sure, additional unicode semantics
means. maybe can provide better answer.
Comments
Post a Comment