Spaces
The spaces (including blank , blank space , esp. Switzerland. Leerschlag , Eng. Space [ speɪs ] blank [ blæŋk ]) is the demarcation of words or other information within a text. When inserting spaces, there are spaces between words and, in turn , spaces between words (width of spaces between words). By pressing the Leertasteon the computer keyboard a space is entered or set . In the case of a typewriter with which the space bar has been inserted, only the carriage is moved forward by one character position, which results in the desired space between the words. In the lead sentence were Spatien used for the spaces between words.
width
The standard word spacing (width of the inter-word space ) is a quarter of an square , so a "normal" space is usually a quarter of an square, although this depends on the font used. However, there are also spaces of other widths. For example, a narrow space , for example in abbreviations such as »z. B. « or to group numbers . In the case of normal spaces, the word spacing can be changed with various additional options, for example with justification or with marking by spacing .
development
Separation of words was not common in ancient texts ( scriptio continua ). The dense arrangement of the text forced the reader to read it carefully and in most cases aloud. The word division was only introduced in the 7th century. The clear spacing between words made it easier to recognize and understand the texts quickly.
Johannes Gutenberg only knew a word distance. He achieved the margin adjustment of the lines in his 42-line Bible with the help of letters of different widths. The word spacing corresponded to the interior ( hallmark ) of the small "n". From then on, typesetters were guided by him.
The invention of the typewriter brought another word spacing. It always corresponded to the same letter width.
use
If you examine German texts to see how many letters, punctuation marks, word marks and spaces they contain, it turns out that the spaces have the greatest frequency, before E as the most common letter.
Printing
Spaces are firstly needed to make a line for justification to stretch. They are also used in abbreviations such as B. used. The spelling of numbers is also used. a. reached with the help of the spaces (e.g. 123 456 789 ). For most fonts, the space between words corresponds roughly to the space that a narrow character (letter or punctuation mark) would take up. The size of the inter-word spaces remains constant in the flutter sentence , while it can vary in the justified sentence.
Three typical errors in the setting of spaces have been given separate names:
- Plenk : superfluous space before punctuation marks
- Klemp : omitted space after punctuation marks
- "Deppenleerzeichen" ( spaces in compound nouns ): incorrect spaces within compound nouns
In German, punctuation marks are generally never set with a leading space, but always followed by a space, certain dash characters (such as a line dash ) also have a leading space, others ( hyphen ) but generally without: A character set without spaces is called kompress .
Word processing, web typography and data processing
In modern word processing , classic typography meets web design , which creates additional complications:
Spaces belong to whitespace and are therefore to be regarded as control characters : They generally influence the parsing of any code in all respects by delimiting code blocks. In this sense, this also applies to the pure display of text : The words represent the information units to be read individually for the automatic line break that is common on the computer . Therefore, spaces are always break marks in the text, unless they are protected against them . In addition to the different widths of spaces, there are also protected spaces in web typography, where there is automatic break , at whose position at the end of the line a break is not allowed.
In some cases, word processing programs or operating systems automatically recognize suitable or inappropriate spaces ( autocorrection ), such as spaces at the beginning of paragraphs or in individual words (such as leading spaces in file names ).
In many programming languages and encodings, they are the termination sign of the individual commands - in others, however, they are generally ignored as whitespace: C, for example, uses them, Java does not, HTML ignores them (except in the text), but not the URI code, see below .
presentation
Lead type
In the lead sentence , the word spacing were exclusion generated: Typographic white or blank space was using reactive material ( "non-printing" quarter em wide space bands ) between the printed letters inserted (letters of words, etc.). There were spaces in different widths, e.g. B. also for digit grouping . However, this blank material did not count for the former translator of the " character ", it therefore was not a "blank character " in the modern sense.
typewriter
Even a typewriter does not actually have a space. Here was originally developed by blank space or blank firing spoken since there is no character is inserted, but only the so-called car one character position before being moved.
Desktop publishing and web typography
In the presentation of the data processing, the space generally is a white space ( whitespace ) .
For the purposes of the line break there are two types of spaces:
ASCII characters and other encodings
The space has the ASCII code 32
, Unicode U+0020
. There are special spaces for certain purposes, for example the non-breaking or non-breaking space , in ISO 8859-1 with the character code 160, HTML entity
:
code | Upheaval | HTML | Surname | Code block | example | description |
---|---|---|---|---|---|---|
U + 0020 | Yes | & # 32; | Space | Basic Latin | ▕ ▏ | Normal space, comparable to the ASCII character 0x20 |
U + 00A0 | No | & nbsp; | No break space | Latin-1 supplement | ▕ ▏ | Like U + 0020, but the line must not be broken here. |
U + 1680 | Yes | & # 5760; | Ogham Space Mark | Ogham | ▕ ▏ | Used for spaces between words in Ogham texts. Requires an Ogham font. |
U + 2002 | Yes | & ensp; | En space -or- groove | General Punctuation | ▕ ▏ | As wide as an en (half as wide as an em ) |
U + 2003 | Yes | & emsp; | Em Space -or- Mutton | General Punctuation | ▕ ▏ | As wide as an em |
U + 2004 | Yes | & # 8196; | Thick Space −or− Three-Per-Em Space |
General Punctuation | ▕ ▏ | As wide as 1 ⁄ 3 of em |
U + 2005 | Yes | & # 8197; | Mid space −or− four-per-em space |
General Punctuation | ▕ ▏ | As wide as 1 ⁄ 4 of em |
U + 2006 | Yes | & # 8198; | Six-Per-Em Space | General Punctuation | ▕ ▏ | As wide as 1 ⁄ 6 of em . In computer-aided typography, this is often the same as U + 2009. |
U + 2007 | No | & # 8199; | Figure space | General Punctuation | ▕ ▏ | In fonts with fixed-width digits ( table digits ), Figure Space corresponds to the width of a digit. |
U + 2008 | Yes | & # 8200; | Punctuation Space | General Punctuation | ▕ ▏ | As wide as a narrow distance to the next character following a punctuation mark like a comma. |
U + 2009 | Yes | & thinsp; | Thin space | General Punctuation | ▕ ▏ | Narrow space : as wide as 1 ⁄ 5 of the em (sometimes the same as U + 2006) |
U + 200A | Yes | & # 8202; | Hair space | General Punctuation | ▕ ▏ | Even narrower than U + 2009 |
U + 200B | Yes | & # 8203; | Zero-width space | General Punctuation | ▕ ▏ | Space without width : actually no space, it does not count as a whitespace character, but is an invisible space. Used for computer systems to indicate word boundaries when the words are not spelled separately. Therefore it does not count as a whitespace sign. In justified justification , however, spaces can arise between the words. In HTML you can use this character as an optional break in very long words, comparable to the <wbr> tag. However, this HTML tag is not supported by all browsers. |
U + 202F | No | & # 8239; | Narrow no-break space | General Punctuation | ▕ ▏ | Narrow space : does not break like U + 00A0, but narrower |
U + 205F | Yes | & # 8287; | Medium Mathematical Space | General Punctuation | ▕ ▏ | For math formulas |
U + 2060 | No | & # 8288; | Word Joiner | General Punctuation | ▕▏ | Like U + 200B, and therefore no whitespace character either, but without a line break. This character was introduced in Unicode 3.2 to take over the function of U + FEFF (see there) as a "word connector". |
U + 3000 | Yes | & # 12288; | Ideographic space |
CJK Symbols and Punctuation |
▕ ▏ | As wide as a Chinese, Japanese or Korean character. |
U + FEFF | No | & # 65279; | Zero Width No-Break Space = Byte Order Mark (BOM) |
Arabic Presentation Forms-B |
▕ ▏ | Mainly used to represent the byte order mark . Therefore it does not count as a whitespace sign. The use as a zero-width space without break is no longer recommended since Unicode 3.2. See also U + 2060. |
Representation of spaces
The Unicode Symbols for Control Characters block also defines three visual symbols that represent spaces, but are not themselves.
code | Glyph | Surname | description |
---|---|---|---|
U + 2420 | ␠ | symbol for space | In most supporting fonts, a glyph with the letters S P running diagonally from top left to bottom right for space . |
U + 2422 | ␢ | blank symbol | The small b with a dash stands for blank (English for "space"). It was introduced by IBM in the 1960s and is no longer used. |
U + 2423 | ␣ | open box | The open box is used, for example, in software documentation to graphically represent a space. It is also used for this in most cell phone keyboards. |
Use in HTML and relatives with CSS
Multiple consecutive spaces in the source text of HTML and similar documents are output as a single space in the page by a web browser if it is not explicitly prevented. This is done with the CSS property white-space and the value pre-wrap ( white-space:pre-wrap
).
The typographical white space can be defined here by spacing or blocking or scaling a space using CSS. The CSS property exists for word spacing word-spacing
and character spacing letter-spacing
; they accept numerical values followed by a dimension such as px, ex, em, rem, pt, ...
Example word spacing:
<div style="word-spacing:2em">Dieser Satz zeigt großen Wortabstand.</div>
- generated:
Dieser Satz zeigt großen Wortabstand.
Example character spacing:
<div style="letter-spacing:1ex">Dieser Satz zeigt großen horizontalen Zeichenabstand.</div>
- generated:
Dieser Satz zeigt großen horizontalen Zeichenabstand.
Web browsers create line breaks if necessary after spaces, if this is not explicitly prevented. With the help of the CSS property white-space and the value nowrap ( white-space:nowrap
), the break after a non-breakable space may be possible. a. be prevented. In HTML 3.2, the nowrap attribute was provided for this, but only for table cells.
Several different spaces are possible in HTML / XML:
HTML / XML entity | Description / comment | |
---|---|---|
by name / named |
numeric (decimal) |
|
 
|
Normal space; several consecutive ones in the source text are output as a single one | |
|
 
|
Unbroken spaces; Break occurs at the preceding normal space |
 
|
 
|
Space the width of the letter n |
 
|
 
|
Space the width of the letter m |
 
|
 
|
narrow space |
 
|
narrow break-protected space |
Coding in TeX
TeX provides the following commands for manually setting the spacing between characters :
Intermediate space to be represented | syntax | length | presentation |
---|---|---|---|
2 quad |
a\qquad b
|
2 square | |
1 quad |
a\quad b
|
1 square | |
normal text spacing |
a\ b
|
6 ⁄ 18 em | |
large gap |
a\;b
|
5 ⁄ 18 em | |
small gap |
a\,b
|
3 ⁄ 18 em | |
no gap |
ab
|
0 square | |
small negative space |
a\!b
|
- 3 ⁄ 18 em |
The large gap a\;b
and the small negative gap are a\!b
only available in math mode, but not in text mode. There, the space must be a\hspace{-1pt}b
specified using the horizontal spacing (e.g. ).
Spaces in addresses and path information
Spaces in URIs
Since a space cannot appear within URIs and terminates the URI (end of the character string), spaces in e.g. B. Folder or file names recoded in UTF8-compliant manner. The coding consists of the percent sign %
and a two-digit hexadecimal value (with multi-byte characters for each byte).
A space is also %20
specified. The older encoding application / x-www-form-urlencoded uses a plus sign +
.
Example:
The “Please read.pdf” file is on a web server in a folder called “Important documents” .
https://www.example.com/wichtige%20dokumente/bitte%20lesen.pdf
https://www.example.com/wichtige+dokumente/bitte+lesen.pdf
When the file is downloaded, the correct space is inserted again in its name.
Spaces for local files and folders
Especially in Unix and Unix systems , a backslash can be used in \
front of the space to inform the system that a space is part of the path (folder / file name) and does not end the character string.
Example:
~/public/wichtige\ dokumente/bitte\ lesen.pdf
Another option is to enclose paths with spaces in a pair of double or single quotation marks ( "
or '
).
Example:
"~/public/wichtige dokumente/bitte lesen.pdf"
~/public/'wichtige dokumente/bitte lesen.pdf'
Space in 8.3 filenames
In the FAT file system , spaces are actually not permitted in file names, but are written and accepted by some software (example OS / 2 on FAT volumes: EA DATA. SF. This file name for the extended attributes of a volume fulfills the 8.3 length convention, but contains spaces to protect the file from being accessed and changed. With more modern text editors such as Edit.com since MS-DOS version 7.0, such file names can also be read and the files can therefore be edited.). File names shorter than eight letters (and file name extensions shorter than three letters) are padded with spaces from left to right in the directory entries, but these do not belong to the file name.
In the ISO-9660 file system, spaces in file names can be replaced by underscores ("_").
French / English spacing
Since the invention of movable type printing, various typographical conventions have developed in languages using the Latin alphabet. French spacing is a convention that regulates the use of white space around punctuation marks. Spaces are inserted around most punctuation marks, while a single space follows the colon, semicolon and at the end of the sentence. Thus, there is no difference in the space between sentences and the words in the sentence.
A different convention is the so-called English spacing . Here, spaces around the punctuation marks are avoided, but an enlarged space is used after colon, semicolon and whole sentences. Particularly the horizontal distance between the block end and -early (Engl. Sentence spacing ) is in English and many others, on the representation in the Latin alphabet languages based not in use and should not with the line spacing (Engl. Line Spacing ) be confused, of the defines vertical space.
This convention has a long tradition. Before the 20th century, it was common in typesetting to use a single but widened space after the end of the sentence. With the introduction of the typewriter, two consecutive spaces were used between sentences to mimic this style of traditional typesetters. This design had emerged as the norm for a long time, especially in the English-speaking world, especially America.
Since then, various design guidelines and typographers have more or less clearly opposed this redundant punctuation.
In detail, there can be perfectly understandable reasons for two spaces at the end of a sentence:
- The use of some non-proportional fonts can make it easier to read.
- An improvement in legibility can also be achieved in the case of a narrow spacing, i.e. the general letter spacing.
- If certain abbreviations, which are written with a period, appear at the end of a sentence, the intelligibility can suffer.
But gradually the consistent use of just one space is also gaining ground in America. But only the editions from the years 2000 and 2008 of the style manual published by the Government Printing Office (GPO) were clearly helpful and recommended not using double spaces at the end of the sentence.
A major reason for this development is that the use of non-proportional fonts in normal correspondence has become rather uncommon. But there are niches. When writing code in a programming language, a font of the same thickness has great advantages over proportional font . Cambridge University employees have devoted themselves to creating a typographical standard for the C # language , following the formatting guidelines of English spacing .
Ultimately, however, the support of one or the other convention remains a matter of taste and is subject to changing fashions. In America in particular, the discussion about “better writing” is far from over, as numerous publications, forums and blogs on the subject show. The following link takes you to a French page with a summary of the differences in typographical conventions between French and English: Mémento de typographie anglaise à l'usage de rédacteurs francophones. Retrieved May 11, 2016 .
In the word processing program LaTeX , the text design can be influenced using the commands \ frenchspacing or \ nonfrenchspacing .
\ documentclass {scrartcl} \ usepackage [ngerman] {babel} \ begin {document} Dr. No ran. Didn't turn around. Mr. Bond caught up. \ nonfrenchspacing Dr. No ran. Didn't turn around. Mr. Bond caught up. \ end {document} |
Dr. No ran. Didn't turn around. Mr. Bond caught up. Dr. No ran. Didn't turn around. Mr. Bond caught up. |
In the example, the different treatment of the empty space after the end points becomes particularly clear. It is also noticeable that the point after "Dr" is recognized as the end of an abbreviation, while "Mr." is not understood as an abbreviation for the English word "Mister" and the point is treated accordingly as an end-of-sentence character with a wider distance from the next capital letter.
(Text display according to the settings of the online LaTeX editor from overleaf.com)
Web links
Individual evidence
- ^ Karl-Heinz Best : On the frequency of letters, spaces and other characters in German texts. In: Glottometrics 11, 2005, pp. 9-31. (PDF full text ) The article contains information on the frequency of spaces in 5 German-language texts, including 4 literary texts and a longer press release.
- ↑ Unicode Technical Note # 27
- ↑ Attributes without value assignment in the SELFHTML Wiki.
- ↑ typical value, cf. TeXbook, p. 433.
- ↑ Lexique des règles typographiques en usage à l'Imprimerie nationale, 3e édition, Imprimerie nationale, 1993.
- ^ Nelson, Julius (1949) Stylebook for Typists New York: Gregg Publishing Company.
- ^ University of Chicago Press 1911 The Chicago Manual of Style. P. 101.
- ^ Robert Bringhurst., The Elements of Typographic Style (Version 3.1), p. 28.
- ↑ James Felici, The Complete Manual of Typography, p. 80.
- ^ The Chicago Manual of Style (16th ed.), Rule 2.9.
- ↑ Style Manual of the Government Printing Office (GPO), paragraph 2, Leading and spacing, 2.48. Retrieved May 11, 2016 .
- ^ The Elements of C # Style. Retrieved May 11, 2016 .
- ^ Fogarty, Mignon (2008). Grammar Girl's Quick and Dirty Tips for Better Writing. New York: Get paperbacks. p. 85. ISBN 978-0-8050-8831-1 .; Kristi Leonard; et al. (February 1, 2003).
- ^ The Effects of Computer-based Text Spacing on Reading Comprehension and Reading Rate. Retrieved May 11, 2016 .