Codepoint

from Wikipedia, the free encyclopedia

As a code point is related to character sets (eg. As ASCII or Unicode ) the numerical value of a character in the code space ( English "code space"), respectively. In character sets are (abstract signs English "abstract characters") associated with code points. The connection is as coded characters ( English called "encoded character").

A code point is generally represented by a (positive) integer . The code space is the set of all integers that describe a valid character in the character set - in some character sets there are numerical values ​​( bit sequences ) that have no character equivalent and are therefore "invalid".

The code space of the ASCII character set includes decimal 0–127 (hexadecimal: 0–7F), the code space of the Unicode character set includes decimal 0–1114111 (hexadecimal: 0–10FFFF). The character A (capital A of the Latin alphabet) has the code point decimal 65 (hexadecimal: 41) in both character sets.

The list of Unicode blocks shows the assignments for the Unicode character set.

literature

  • Unicode standard . The terms introduced here are defined in Chapter 3, Section D.