JIS X 0201

from Wikipedia, the free encyclopedia

The JIS X 0201 developed in 1969, is the first Japanese character encoding , the wider application has found. JIS stands for Japan Industrial Standard , the counterpart to DIN . The character set is an extension of the ASCII code from seven to eight bits . This means that 128 more characters are available. However, this is not enough to accommodate the entire Japanese writing system with two syllabary fonts and at least 2,000 Chinese characters . Instead, only one syllabary was implemented, the katakana , which was the only font used in telegrams until then . Just a few years later, in 1978, JIS C 6226 was the first font based on 16 bits or 2  bytes to be developed which, with a maximum of 65,535 characters, made it possible to implement Kanji . Most of the fonts commonly used today, such as Shift-JIS, were developed on the basis of JIS C 6226 .

Problems

In JIS X 0201, the 7-bit coding JIS-Roman was adopted at code positions 0 to 127 , in which the two characters backslash (\) and tilde (~) have been replaced by the yen symbol (¥) and the overline (¯) were. This has led to on Japanese computers paths under DOS / Windows than C:¥Program Files¥appear.

A second late sequence of the brief are the half-width katakana (half-width katakana). In JIS X 0201, the katakana are represented by a single byte, and in the first implementations the katakana were not - as was common in Japanese script up to then - represented in an em , but rather like Latin letters in half-width. A new full-width katakana block was introduced in the following character sets, in which the katakana firstly occupied two bytes and secondly had the normal width. Most modern character encodings contain both sentences.

Table of the ASCII extension range

Hex A1 A2 A3 A4 A5 A6 A7 A8 A9 AA FROM AC AD AE AF
Dec 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175
character
Hex B0 B1 B2 B3 B4 B5 B6 B7 B8 B9 BA BB BC BD BE BF
Dec 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191
character
Hex C0 C1 C2 C3 C4 C5 C6 C7 C8 C9 CA CB CC CD CE CF
Dec 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207
character
Hex D0 D1 D2 D3 D4 D5 D6 D7 D8 D9 THERE DB DC DD DE DF
Dec 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223
character

Characters beyond 127 (7F hex ) not mentioned in the table are not used.