ISO 8859-5

from Wikipedia, the free encyclopedia
ISO 8859
-1 Latin-1 , Western European
-2 Latin-2 , Central European
-3 Latin-3 , Southern European
-4 Latin-4 , Northern European
-5 Cyrillic
-6 Arabic
-7 Greek
-8th Hebrew
-9 Latin-5 , Turkish
-10 Latin-6 , Nordic
-11 Thai
-12 (does not exist)
-13 Latin-7 , Baltic
-14 Latin-8 , Celtic
-15 Latin-9 , Western European
-16 Latin-10 , Southeast European

ISO 8859-5 , more precisely ISO / IEC 8859-5 also known as Cyrillic , is a standard for information technology, last updated by ISO in 1998 , for character encoding with eight bits and the fifth part of the ISO / IEC 8859 family of standards .

The characters that can be coded with seven bits correspond to US- ASCII with a leading zero bit. In addition to the 95 representable ASCII characters (20 16 –7E 16 ), ISO 8859-5 encodes 96 more (A0 16 –FF 16 ), so a total of 191 of the theoretically possible 256 (= 2 8 ). Positions 00 16 –1F 16 and 7F 16 –9F 16 are not assigned any characters in ISO / IEC 8859 and therefore ISO / IEC 8859-5. This area was deliberately kept free in order to be able to use the corresponding bytes for device control or to ensure that these do not conflict with such control characters if the coding is insufficiently specified. The designation ISO-8859-5 (with hyphen) defined by the IANA stands for the combination of the characters of this standard with non-displayable control characters according to ISO / IEC 6429.

ISO 8859-5 tries to cover all characters in the Cyrillic alphabets . For various reasons, however, KOI8-R ( Russian ), KOI8-U ( Ukrainian ) and Windows-1251 are more common. In June 2014, 2.4% of all websites use Windows-1251, ISO-8859-5 is used by less than 0.1%.

table

ISO-8859-5 including the control characters not contained in ISO / IEC 8859-5
code … 0 …1 … 2 … 3 … 4 … 5 … 6 … 7 …8th … 9 … A … B ... C … D … E ... F
0 ... NUL SOH STX ETX EOT ENQ ACK BEL BS HT LF VT FF CR SO SI
1… DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
2… SP ! " # $ % & ' ( ) * + , - . /
3… 0 1 2 3 4th 5 6th 7th 8th 9 : ; < = > ?
4… @ A. B. C. D. E. F. G H I. J K L. M. N O
5… P Q R. S. T U V W. X Y Z [ \ ] ^ _
6… ` a b c d e f G H i j k l m n O
7… p q r s t u v w x y z { | } ~ DEL
8th… PAD HOP BPH NBH IND NEL SSA ESA HTS HTJ VTS PLD PLU RI SS2 SS3
9 ... DCS PU1 PU2 STS CCH MW SPA EPA SOS SGCI SCI CSI ST OSC PM APC
A ... NBSP Ё Ђ Ѓ Є Ѕ І Ї Ј Љ Њ Ћ Ќ SHY Ў Џ
B ... А Б В Г Д Е Ж З И Й К Л М Н О П
C ... Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
D ... а б в г д е ж з и й к л м н о п
E ... р с т у ф х ц ч ш щ ъ ы ь э ю я
F ... ё ђ ѓ є ѕ і ї ј љ њ ћ ќ § ў џ

SP ( space ) at position 20 16 is the space , NBSP ( no-break space , including non-breaking space ) to position A0 16 is the non-breaking spaces and SHY ( soft hyphen ) at position AD 16 is the conditional hyphen , which normally is only visible at the end of the line.

See also

Web links

Individual evidence

  1. http://w3techs.com/technologies/comparison/en-iso885905,en-windows1251