Unicode block Unified CJK ideograms

from Wikipedia, the free encyclopedia
CJK ideogram 次 in abbreviations , traditional characters , Kanji and Hanja

The Unicode block Unified CJK Ideograms (CJK Unified Ideographs, U + 4E00 to U + 9FFF) contains the basic Chinese characters that are not only used in the Chinese script , but also a component of Japanese and Hanja under the name Kanji make up a small part of the Korean script used in South Korea . Many of the characters in this block are used in all three writing systems , but many are only used in one or two of the three. As Chữ nôm , Chinese characters were also used for Vietnamese earlier . The sorting of the original 20,902 characters follows the sorting in the Kangxi dictionary according to radicals, with the characters with the least number of dashes listed first.

The terms “ideographs” in the original or “ideograms” in the translation are misleading, as the scripts represented are strictly speaking not conceptual scripts (see ideography ).

The block is the result mainly in the Far East controversial Han Unification ( English han unification ) encoded Chinese, Japanese, and Korean characters in the same place and makes the representation of each character of the selected font dependent. However, as the only Unicode block with CJK characters, this block fell under the source separation rule , which states that characters that were encoded separately in an earlier character set are also encoded separately in Unicode.

Since the Unicode version 5.1 it is possible with variant selectors to display certain variants of CJK ideograms.

table

The characters occupy the code points U + 4E00 to U + 9FFC in a closed sequence. All characters have the general category “Other Letter” and the bidirectional class “Left to Right”. They are named according to the scheme “CJK UNIFIED IDEOGRAPH-XXXX”, where “XXXX” is the hexadecimal code point.

Unicode number Characters
(400%)
Official name description
U + 4E00 (19968) CJK UNIFIED IDEOGRAPH-4E00 CJK ideogram 4E00
U + 4E01 (19969) CJK UNIFIED IDEOGRAPH-4E01 CJK ideogram 4E01
... ... ... ...
U + 9FFB (40955) CJK UNIFIED IDEOGRAPH-9FFB CJK ideogram 9FFB
U + 9FFC (40956) CJK UNIFIED IDEOGRAPH-9FFC CJK ideogram 9FFC

Character map

Due to the size of this block, the character table is divided into:

Graphic boards

Due to the size of this block, the graphic boards are located under the Unicode block Unified CJK ideograms / graphic boards

Individual references and sources

  1. ^ The secret life of Unicode
  2. The Unicode Standard 4.0, Appendix A - Han Unification History (PDF; 65 kB)
  3. The Unicode Standard, 4.0, Chapter 11 - East Asian Scripts (PDF; 618 kB)

Web links