code

from Wikipedia, the free encyclopedia

A code or code (German pronunciation [ koːt ]) is a mapping rule that uniquely assigns a character or a character sequence from a possibly different set of characters (set of images) to each character in a set of characters . For example, Morse code creates a relationship between letters and a sequence of short and long sound signals (and vice versa).

In communication science, a code denotes a language in the broadest sense . All communication is based on the exchange of information which is generated by the sender according to a specific code and which the receiver interprets according to the same code.

Electronically transmitted or stored data consist, for example, of a long series of zeros and ones ( bits ), the combination of which is defined as digits, characters and / or letters (e.g. 11000001 = 'A'). In another code the same bit combination could e.g. B. mean the number 193. Also for other types of data , e.g. B. Measured values ​​or the nucleobases in DNA strands is defined in codes, which manifestations mean what.

In coding theory , the elements that make up a code are called “code words”, and the symbols that make up the code words form an “alphabet”.

General meaning

In general, a code is an agreement about a set (set) of characters such as B. index, icons or symbols (meaning carriers or references) for the purpose of exchanging information. Information does not exist in “pure” form; it is always formulated in some way. In general terms, a code is a formulation of information in which the characters are subject to syntactic, semantic and pragmatic rules. This requires the following elements:

  1. at least one information-formulating entity (recorder / transmitter)
  2. At least one information-receiving entity (reader / receiver) - can in exceptional cases also be identical to (1)
  3. a concrete or abstract content to be transmitted, the information
  4. an agreement for the purpose of formulating information and, if necessary, transmitting information. This contains a set of meaning carriers or symbols that are known to both instances (1. and 2.) and, if applicable, rules for using the symbols

Human language, animal sounds, but also the semiochemical molecules in bacterial communication and plant communication (auxin) are codes of varying complexity and capacity. In the case of animal sounds, bacteria, fungus and plant communication, the "agreement" has arisen through natural evolution.

definition

A code above the alphabets and is an injective mapping (= coding)

,

maps the symbols of the alphabet to non-empty words above the alphabet . The elements of the image set are called code words of the code. The mapping defined by is called an extension of a code

,

where a word is above the alphabet . A code is said to be immediately decipherable if it does not contain a code word as the beginning of another code word. It is said to be clearly decipherable if its extension is injective. Every immediately decipherable code can also be clearly deciphered, but there are also unambiguously decipherable codes that cannot be deciphered immediately.

Efficiency

In the more specific, usual meaning, a code is an agreement on a set of symbols (references), for example between recorders (senders) and readers (receivers), in order to establish an efficient way of conveying frequent messages. It is used for low-cost (energy- and time-efficient) and secure, reliable recording and transmission of information. This is implemented differently depending on the type of information and the area of ​​application. Efficiency is achieved by reducing complexity , which also decreases the capacity of the code. For this purpose, simpler elements and combinations of these are used instead of the information in its present form. These combinations refer to the original information. The more often something is used, the less effort it should take to process an element.

Codes involve two aspects of efficiency :

  • On the one hand, they replace the original with a pure reference to it (what is designated is replaced by the code word) and therefore only need a store of references that guarantees recognizability and distinctness (freedom from errors).
  • Second, they mostly use a modular system in which more frequent references are provided by short (few-element) combinations and less frequent references are provided by longer (multi-element) combinations. This means that the simplest elements (components, characters ) are used or reused as often as possible (namely in almost all references).

On a larger scale, codes - regardless of their own sophistication - only become truly efficient through repeated use as an aid to transferring information . Because the comprehensibility of the reading system requires some preparatory organization (e.g. learning) in this too. Codes therefore become part of the culture of systems (e.g. national language, technical standards) and develop a certain persistence against changes. The energy minimization makes codes attractive for the transmission of information under special conditions: long distances, long periods of time, large number of recipients, transmission with minimal effort in emergency situations, transmission with the help of other systems that work in this energy and structural dynamics area.

The efficiency can be increased by:

  • The use of references, also in other areas, such as simulation .
  • Multiple consecutive references. An effect can also be achieved indirectly via a long chain.
  • The use of a modular system, if this encourages the use of more frequent simple elements.

Niklas Luhmann uses the term technicus codes as a designation for standardized differences ( leading difference ).

In communications engineering, one speaks of source coding .

redundancy

A code can be structured in such a way that the result into which it encodes information contains superfluous, i.e. redundant parts. That is, the generated sequence of symbols contains data that does not lead to the goal and, if decoded incorrectly, gives a contradicting result.

In the simplest case, the same thing is repeated again. But every increased predictability of characters is redundancy : For example, the u after q in written German are all redundant, since a u is always followed by a q . So no information would be lost if all u following a q were removed. However, the redundancy helps one to still be able to decipher and use important information despite a partial loss of data. So it protects against loss of information . It also enables falsified information to be recognized as such.

In communications engineering, one speaks of channel coding .

Examples of codes

Error-detecting codes

Secret codes

Computer science

See also

literature

Web links

Wiktionary: Code  - explanations of meanings, word origins, synonyms, translations
Commons : Codes  - collection of images, videos and audio files

Individual evidence

  1. Eva-Maria Krech et al .: German pronunciation dictionary. Walter de Gruyter, Berlin / New York 2009, ISBN 978-3-11-018202-6 . Max Mangold: The pronunciation dictionary. Duden Volume 6. Dudenverlag, Mannheim / Zurich 2005, ISBN 978-3-411-04066-7 .
  2. ^ DUDEN Informatik , 1993, ISBN 3-411-05232-5
  3. Introduction to coding theory (PDF; 246 kB) University of Paderborn
  4. ^ Jacobs, Jungnickel: Introduction to combinatorics . S. 132 .