KS X 1001
KS X 1001 (also called Wansung (완성)) is a South Korean standard that defines a character encoding for the Korean script . It was introduced in 1974 as KS C 5601 and revised in 1982, 1987, 1992, 1998, 2002 and most recently in 2004.
KS X 1001 is a 16-bit coding. All characters are encoded in the range 0x21–0x7E in a 94 × 94 matrix, which allows a maximum of 8,836 characters to be encoded - a system that is also used by neighboring encodings such as JIS X 0208 (Japan) or GB2312 (China). It encodes 2,350 prefabricated Hangeul syllables, 4,888 Hanja , the Latin, Greek and Cyrillic script, as well as various symbols, a total of 8,227 characters.
KS X 1001 is the only Korean character set that was widely used. It is implemented in EUC-KR and ISO-2022-KR and used on most websites and operating systems. Under Windows, KS X 1001 is mapped as code page 51949. However, Korean versions of Windows use an extension of this character set, code page 949 , also known as Unified Hangul Code (UHC).
The character set only encodes the 2,350 most common Hangeul syllables, which causes problems especially with foreign words and proper names, as the syllables used for this are usually not included in KS X 1001.
In North Korea , KPS 9566 is used instead of this character set .
Change history
- 1998
- Two characters have been added:
- U + 00AE ® REGISTERED SIGN (0xA2E7 in EUC-KR and 0x2267 in ISO-2022-KR)
- U + 20AC € EURO SIGN (0xA2E6 in EUC-KR and 0x2266 in ISO-2022-KR)
- 2002
- A character was added:
- U + 327E ㉾ CIRCLED HANGUL IEUNG U (0xA2E8 in EUC-KR and 0x2268 in ISO-2022-KR)
Web links
- PDF table of the character set (scanned version from 1987; 2.88 MB)
- Unicode table (ASCII version from 1987)
- PDF table on one page 61 cm × 61 cm (version from 1992; 3.44 MB)