In Unicode , special areas are designated as Private Use Areas ( PUA , English for "Unicode areas for personal use"). The code points in these areas are never assigned characters standardized in Unicode itself. This means that these can be used for privately defined characters that have to be agreed individually between the creators and users of the texts they contain. Such agreements can e.g. For example, to share a font file in which such characters are provided for use.
Areas of code
The Unicode standard identifies the three areas described below as intended for the users' own use.
Private use zone
The private use zone is on level 0 ( BMP, Basic Multilingual Plane ) and covers the area from U + E000 to U + F8FF. That's 6400 code points.
Which characters a font has defined in the private use zone can be determined with the template: Private-Use-Area-Test .
Private Use Planes
Unicode levels 15 and 16 only contain the two blocks Supplementary Private Use Area-A and -B . Instead of PUA-A and PUA-B , private use plans (PUP) are sometimes also used in summary .
Supplementary Private Use Area-A
The Supplementary Private Use Area-A covers the entire level 15, i.e. the area from U + F0000 to U + FFFFD. That is 65534 code points.
Which characters a font has defined in the Supplementary Private Use Area-A can be determined with the template: Supplementary-Private-Use-Area-A-Test .
Supplementary Private Use Area-B
The Supplementary Private Use Area-B covers the entire level 16, i.e. the area from U + 100000 to U + 10FFFD. That is 65534 code points.
Which characters a font has defined in the Supplementary Private Use Area-B can be determined with the template: Supplementary-Private-Use-Area-B-Test .
use
The assignment of characters to code points is not regulated by the Unicode consortium in these areas. However, there are various organizations and initiatives that coordinate the allocation of character codes in these areas.
Medieval Unicode Font Initiative
The Medieval Unicode Font Initiative (MUFI) coordinates the coding of historical characters, character variants and ligatures and assigns code points from the private use zone, mainly from the U + E000 to U + EFFF area.
Use on Linux
Under Linux, the private use zone was divided into two areas:
U + E000… U + EFFF: "End User Zone"
U + F000… U + F8FF: "Linux Zone"
U + F000… U + F7FF: 1: 1 mapping to the characters of the current console font
U + F800… U + F8FF: characters defined throughout Linux that are required / desired under Linux, but are not yet included in Unicode.
The end user zone is freely available to the end user. The Linux zone is reserved for internal operating system purposes. The range from U + F000 to U + F7FF is used to cover a 1: 1 mapping of the screen font used for the console. This enables programs such as consolecharsto display all characters of the currently used screen font without knowing their character encodings. Since the Linux text console supports a maximum of 512 characters in a screen font, this range is more than sufficient. The range from U + F800 to U + F8FF is used for characters that are required or desired under Linux, but which are not (yet) included in the Unicode character set:
Codepoint
character
comment
U + F800
DEC VT GRAPHICS HORIZONTAL LINE SCAN 1
With the inclusion of these characters in Unicode 3.2, these 4 code positions are out of date ("deprecated").
U + F801
DEC VT GRAPHICS HORIZONTAL LINE SCAN 3
U + F803
DEC VT GRAPHICS HORIZONTAL LINE SCAN 7
U + F804
DEC VT GRAPHICS HORIZONTAL LINE SCAN 9
U + F810
KEYBOARD SYMBOL FLYING FLAG
Button symbol "waving flag" = Windows button
U + F811
KEYBOARD SYMBOL PULLDOWN MENU
Menu button icon
U + F812
KEYBOARD SYMBOL OPEN APPLE
Button symbol " empty apple "
U + F813
KEYBOARD SYMBOL SOLID APPLE
"Filled apple" button symbol
U + F8D0
Letters and numerals of the fictional Klingon language
U + F8FF
The allocation of code points in the "Linux Zone" is coordinated by the Linux Assigned Names and Numbers Authority (LANANA).
ConScript Unicode Registry
This volunteer project coordinates the ingestion of fictional writings used in novels or films, such as the Middle-earth Fantasy Languages by JRR Tolkien . It assigns code points in all 3 private Unicode blocks and coordinates with LANANA, but not with MUFI.
Other uses
See also
Andreas Stötzner: LINCUA - A Unicode PUA harmonization plan. June 20, 2012. Retrieved August 26, 2012 .
Individual evidence
↑ Michael Everson et al .: Roadmap to the BMP - revision 6.1.0. The Unicode Consortium, February 1, 2012, accessed on August 26, 2012 (The term “Private Use Zone” is not used in the text of the Unicode standard, but can be found on this website officially provided by the Unicode Committee.).
↑ Unicode 6.3 Chapter 2.8, page 34, first paragraph (since the core specification for version 6.3 has not been changed and has not been published again, the files from version 6.2 for 6.3 continue to apply unchanged.)
^ Medieval Unicode Font Initiative. Retrieved August 21, 2012 .
^ H. Peter Anvin (ed.): Linux Zone Unicode Assignments. (TXT) The "Linux Assigned Names And Numbers Authority" (LANANA) project, January 17, 2005, accessed September 12, 2012 .
↑ ConScript Unicode Registry. Retrieved August 21, 2012 .
^ Peter Constable and Lorna A. Priest: SIL Corporate PUA Assignments. April 17, 2012. Retrieved August 21, 2012 .
↑ Chris Harvey: Languagegeek Fonts. June 29, 2012. Retrieved August 21, 2012 .
Level 0
Basic Latin |
Latin-1, addition |
Latin, adult-A |
Latin, adult B |
IPA extensions |
Spacing Modifier Letters |
COMBINING DIACRITICS |
Greek and Coptic |
Cyrillic |
Cyrillic, addition |
Armenian |
Hebrew |
Arabic |
Syriac |
Arabic, addition |
Thaana |
N'Ko |
Samaritan |
Mandaean |
Syriac, addition |
Arabic, adult-A |
Devanagari |
Bengali |
Gurmukhi |
Gujarati |
Oriya |
Tamil |
Telugu |
Kannada |
Malayalam |
Sinhala |
Thai |
Lao |
Tibetan |
Burmese |
Georgian |
Hangeul Jamo |
Ethiopian |
Ethiopian, addition |
Cherokee |
Consolidation Syllable. canada. Original |
Ogam |
Runes |
Tagalog |
Hanunóo |
Buid |
Tagbanuwa |
Khmer |
Mongolian |
Consolidation Syllable. canada. Native, adult |
Limbu |
Tai Le |
New Tai Lue |
Khmer symbols |
Buginese |
Lanna |
Combining diacritical marks, ext. |
Balinese |
Sundanese |
Batak |
Lepcha |
Ol Chiki |
Cyrillic, adult-C |
Georgian, extended |
Sundanese, addition |
Vedic extensions |
Phonetic extensions |
Phonetic extensions, additions |
COMBINING DIACRITICS, ADDITION |
Latin, further addition |
Greek, addition |
General punctuation |
Superscripts and subscripts |
Currency symbol |
Combining diacritical marks for symbols |
Letter-like symbols |
Number sign |
Arrows |
Mathematical operators |
Various technical signs |
Control character symbols |
Optical character recognition |
Enclosed alphanum. Sign |
Frame drawing |
Block elements |
Geometric shapes |
Different symbols |
Dingbats |
Different mathem. Symbols-A |
Additional arrows-A |
Braille characters |
Additional arrows-B |
Different mathem. Symbols-B |
Additional mathem. Operators |
Different symbols and arrows |
Glagolitic |
Latin, adult-C |
Coptic |
Georgian, addition |
Tifinagh |
Ethiopian, extended |
Cyrillic, adult-A |
Additional punctuation |
CJK Radicals, Supplement |
Kangxi radicals |
Ideographic descriptive symbols |
CJK symbols and punctuation |
Hiragana |
Katakana |
Bopomofo |
Hangeul-Jamo, Compatibility |
Kanbun |
Bopomofo, extended |
CJK bars |
Katakana, Phonetic Extensions |
Enclosed CJK signs and months |
CJK compatibility |
Consolidation CJK ideograms, adult A |
I-Ching hexagrams |
Consolidation CJK ideograms |
Yi syllable |
Yi radicals |
Lisu |
Vai |
Cyrillic, adult B |
Bamum |
Modifying Tones |
Latin, adult-D |
Syloti Nagri |
Common Indian numerals |
Phagspa |
Saurashtra |
Devanagari, ext. |
Kayah Li |
Rejang |
Hangeul-Jamo, ext.-A |
Javanese |
Burmese, adult B |
Cham |
Burmese, adult-A |
Tai Viet |
Meitei-Mayek, adult |
Ethiopian, adult-A |
Latin, adult-E |
Cherokee, addition |
Meitei-Mayek |
Hangeul syllable |
Hangeul-Jamo, adult B |
Private Use Zone |
CJK ideograms, compatibility |
Alphabetical forms of presentation |
Arabic forms of presentation-A |
Variant selectors |
Vertical shapes |
COMBINING Half Diacritics |
CJK Compatibility Forms |
Small variations in shape |
Arabic forms of presentation-B |
Half-width and full-width shapes |
Special
level 1
Linear B-syllable |
Linear B ideograms |
Aegean numerals |
Ancient Greek numerals |
Ancient symbols |
Phaistos Disc |
Lycian |
Carish |
Coptic numerals |
Old Italian |
Gothic |
Old Permian |
Ugaritic |
Old Persian |
Mormon alphabet |
Shaw alphabet |
Osmaniya |
Osage |
Albanian |
Alwan |
Linear A |
Cypriot |
Aramaic |
Palmyrenian |
Nabatean |
Hatra script |
Phoenician |
Lydian |
Meroitic hieroglyphics |
Meroitic-demotic |
Kharoshthi |
Old South Arabic |
Old North Arabic |
Manichean |
Avestisch |
Parthian |
Inscriptions-Pahlavi |
Psalter-Pahlavi |
Old Turkish |
Old Hungarian |
Hanifi Rohingya |
Rumi digits |
Yazidi |
Old Sogdian |
Sogdian |
Choral mix |
Elymaic |
Brahmi |
Kaithi |
Sorang-Sompeng |
Chakma |
Mahajani |
Sharada |
Sinhala numerals |
Khojki |
Multanic |
Khudabadi |
Grantha |
Neva |
Tirhuta |
Siddham |
Modes |
Mongolian, supplement |
Takri |
Ahom |
Dogra |
Varang Kshiti |
Dives Akuru |
Nandinagari |
Dsanabadsar square script |
Soyombo |
Pau Cin Hau |
Bhaiksuki |
Fairy tales |
Masaram Gondi |
Gunjala Gondi |
Makassar |
Lisu, addition |
Tamil, addition |
Cuneiform writing |
Cuneiform numerals and punctuation |
Early cuneiform |
Egyptian hieroglyphics |
Egypt. Hieroglyphic Control Characters |
Anatolian hieroglyphics |
Bamum, addition |
Mro |
Bassa Vah |
Pahawh Hmong |
Medefaidrin |
Pollard font |
Ideographic symbols and punctuation |
Xixia |
Xixia components |
Little Kitan script |
Xixia, addition |
Kana, addition |
Kana, exp.-A |
Small kana, extended |
Women's script |
Duployé shorthand |
Shorthand control characters |
Byzantine notes |
Notation |
Ancient Greek notes |
Maya numerals |
Tai Xuan Jing symbols |
Counter digits |
Mathem. alphanum. Symbols |
SignWriting |
Glagolitic, addition |
Nyiakeng Puachue Hmong |
Wancho |
Mende font |
Adlam |
Indian Siyaq numerals |
Ottoman Siyaq Numbers |
Arab. mathem. alphanum. Symbols |
Mahjongg stones |
Dominoes |
Playing cards |
Additional enclosed alphanum. Sign |
Additional enclosed CJK characters |
Various pictographic symbols |
Smileys |
Decorative symbols |
Traffic and map symbols |
Alchemical symbols |
Geometric shapes, exp. |
Additional arrows-C |
Additional pictographic symbols |
Chess symbols |
Pictographic symbols, exp.-A |
Retro computer icons
Levels 2 and 3
Consolidation CJK ideograms, adult B |
Consolidation CJK ideograms, adult C |
Consolidation CJK ideograms, adult D |
Consolidation CJK ideograms, adult E |
Consolidation CJK ideograms, adult F |
CJK ideograms, compatibility, addition |
Consolidation CJK ideograms, adult G
Levels 14 to 16
Tags |
Variant selectors, supplement |
Additional private use area – A |
Additional private use area – B
Alphabetical order
Adlam |
Aegean numerals |
Egyptian hieroglyphics |
Egypt. Hieroglyphic Control Characters |
Ahom |
Albanian |
Alchemical symbols |
Common Indian numerals |
General punctuation |
Alphabetical forms of presentation |
Ancient symbols |
Ancient Greek notes |
Ancient Greek numerals |
Old Italian |
Old North Arabic |
Old Permian |
Old Persian |
Old Sogdian |
Old South Arabic |
Old Turkish |
Old Hungarian |
Alwan |
Anatolian hieroglyphics |
Arabic |
Arabic, addition |
Arabic, adult-A |
Arab. mathem. alphanum. Symbols |
Arabic forms of presentation-A |
Arabic forms of presentation-B |
Aramaic |
Armenian |
Ethiopian |
Ethiopian, extended |
Ethiopian, adult-A |
Ethiopian, addition |
Avestisch |
Balinese |
Bamum |
Bamum, addition |
Basic Latin |
Bassa Vah |
Batak |
Bengali |
Bhaiksuki |
Burmese |
Burmese, adult-A |
Burmese, adult B |
Block elements |
Bopomofo |
Bopomofo, extended |
Brahmi |
Braille characters |
Letter-like symbols |
Buginese |
Buid |
Byzantine notes |
Chakma |
Cham |
Cherokee |
Cherokee, addition |
Choral mix |
CJK ideograms, compatibility |
CJK ideograms, compatibility, addition |
CJK compatibility |
CJK Compatibility Forms |
CJK Radicals, Supplement |
CJK bars |
CJK symbols and punctuation |
Devanagari |
Devanagari, ext. |
Dingbats |
Phaistos Disc |
Dives Akuru |
Dogra |
Dominoes |
Dsanabadsar square script |
Duployé shorthand |
Elymaic |
Women's script |
Early cuneiform |
Geometric shapes |
Geometric shapes, exp. |
Georgian |
Georgian, addition |
Georgian, extended |
Glagolitic |
Glagolitic, addition |
Gothic |
Grantha |
Greek and Coptic |
Greek, addition |
Gujarati |
Gunjala Gondi |
Gurmukhi |
Half-width and full-width shapes |
Hangeul Jamo |
Hangeul-Jamo, ext.-A |
Hangeul-Jamo, adult B |
Hangeul-Jamo, Compatibility |
Hangeul syllable |
Hanifi Rohingya |
Hanunóo |
Hatra script |
Hebrew |
Hiragana |
Superscripts and subscripts |
Ideographic descriptive symbols |
Ideographic symbols and punctuation |
I-Ching hexagrams |
Indian Siyaq numerals |
Inscriptions-Pahlavi |
IPA extensions |
Javanese |
Yazidi |
Kaithi |
Kana, addition |
Kana, exp.-A |
Kanbun |
Kangxi radicals |
Kannada |
Carish |
Katakana |
Katakana, Phonetic Extensions |
Kayah Li |
Cuneiform writing |
Cuneiform numerals and punctuation |
Kharoshthi |
Khmer |
Khmer symbols |
Khojki |
Khudabadi |
Small variations in shape |
Small kana, extended |
Little Kitan script |
Combining diacritical marks for symbols |
COMBINING DIACRITICS |
COMBINING DIACRITICS, ADDITION |
Combining diacritical marks, ext. |
COMBINING Half Diacritics |
Coptic |
Coptic numerals |
Shorthand control characters |
Cypriot |
Cyrillic |
Cyrillic, addition |
Cyrillic, adult-A |
Cyrillic, adult B |
Cyrillic, adult-C |
Lanna |
Lao |
Latin, adult-A |
Latin, adult B |
Latin, adult-C |
Latin, adult-D |
Latin, adult-E |
Latin, further addition |
Latin-1, addition |
Lepcha |
Limbu |
Linear A |
Linear B ideograms |
Linear B-syllable |
Lisu |
Lisu, addition |
Lydian |
Lycian |
Mahajani |
Mahjongg stones |
Makassar |
Malayalam |
Mandaean |
Manichean |
Fairy tales |
Masaram Gondi |
Mathem. alphanum. Symbols |
Mathematical operators |
Maya numerals |
Medefaidrin |
Meitei-Mayek |
Meitei-Mayek, adult |
Mende font |
Meroitic-demotic |
Meroitic hieroglyphics |
Modes |
Modifying Tones |
Mongolian |
Mongolian, supplement |
Mormon alphabet |
Mro |
Multanic |
Nabatean |
Nandinagari |
New Tai Lue |
Neva |
N'Ko |
Notation |
Nyiakeng Puachue Hmong |
Ogam |
Ol Chiki |
Optical character recognition |
Oriya |
Osage |
Ottoman Siyaq Numbers |
Osmaniya |
Pahawh Hmong |
Palmyrenian |
Parthian |
Pau Cin Hau |
Arrows |
Phagspa |
Phonetic extensions |
Phonetic extensions, additions |
Phoenician |
Pictographic symbols, exp.-A |
Pollard font |
Private use area |
Additional private use area-A |
Additional private use area-B |
Psalter-Pahlavi |
Frame drawing |
Rejang |
Rumi digits |
Runes |
Samaritan |
Saurashtra |
Chess symbols |
Sharada |
Shaw alphabet |
Siddham |
Sinhala |
Sinhala numerals |
Smileys |
Sogdian |
Soyombo |
Sorang-Sompeng |
Spacing Modifier Letters |
Special |
Playing cards |
Sundanese |
Sundanese, addition |
SignWriting |
Syloti Nagri |
Retro computer icons |
Control character symbols |
Syriac |
Syriac, addition |
Tagalog |
Tagbanuwa |
Tags |
Tai Le |
Tai Viet |
Tai Xuan Jing symbols |
Takri |
Tamil |
Tamil, addition |
Telugu |
Thaana |
Thai |
Tibetan |
Tifinagh |
Tirhuta |
Ugaritic |
Enclosed alphanum. Sign |
Enclosed CJK signs and months |
Vai |
Varang Kshiti |
Variant selectors |
Variant selectors, supplement |
Vedic extensions |
Consolidation CJK ideograms |
Consolidation CJK ideograms, adult A |
Consolidation CJK ideograms, adult B |
Consolidation CJK ideograms, adult C |
Consolidation CJK ideograms, adult D |
Consolidation CJK ideograms, adult E |
Consolidation CJK ideograms, adult F |
Consolidation CJK ideograms, adult G |
Consolidation Syllable. canada. Original |
Consolidation Syllable. canada. Native, adult |
Traffic and map symbols |
Different mathem. Symbols-A |
Different mathem. Symbols-B |
Various pictographic symbols |
Different symbols and arrows |
Different symbols |
Various technical signs |
Vertical shapes |
Currency symbol |
Wancho |
Xixia |
Xixia, addition |
Xixia components |
Yi radicals |
Yi syllable |
Counter digits |
Number sign |
Decorative symbols |
Additional punctuation |
Additional mathem. Operators |
Additional arrows-A |
Additional arrows-B |
Additional arrows-C |
Additional pictographic symbols |
Additional enclosed alphanum. Sign |
Additional enclosed CJK characters
<img src="https://de.wikipedia.org//de.wikipedia.org/wiki/Special:CentralAutoLogin/start?type=1x1" alt="" title="" width="1" height="1" style="border: none; position: absolute;">