Code page 949

from Wikipedia, the free encyclopedia
Windows code pages
0874 Thai
0932 Japanese
0936 Simplified Chinese
0949 Korean
0950 Traditional Chinese
1250 Central European
1251 Cyrillic
1252 Western European
1253 Greek
1254 Turkish
1255 Hebrew
1256 Arabic
1257 Baltic
1258 Vietnamese

The code page 949 (Korean 코드 페이지 949 , alternatively referred to United Hangul Code (Korean 통합형 한글 코드 , English Unified Hangul Code (UHC)), partially. Also Advanced Wansung ( English Extended Wansung )) is a character encoding of Windows -Betriebssystems that with Windows 95 was introduced. It is downward compatible with EUC-KR and combines the two Korean character sets Wansung (KS X 1001) and Johab .

There were two different character sets for Korea: the Wansung character set (Korean 완성 ), which encodes only the Hangul syllable blocks used in the Korean language (2,350), and the Johab character set, which encodes all possible Hangul syllable blocks, including those that do not appear in the Korean language (11,172 in total). Code page 949 occupies 8822 free code points of the Wansung character set with the additional Hangul syllable blocks of the Johab character set.

Like EUC-KR, code page 949 is a variable 16-bit coding, ie a character can be either one or two bytes in size. The characters in the range 0x00-0x7F are identical to KS X 1003 and consist of only one byte. The characters in the range 0x81-0xFE, on the other hand, consist of two bytes, in contrast to EUC-KR, however, the following byte can also be in the range 0x41-0x7A.

Since Windows XP, the original EUC-KR encoding has also been offered as code page 51949.

Web links

Individual evidence

  1. ^ A b Brief History of Character Codes in North America, Europe, and East Asia