Combinatorial game theory

from Wikipedia, the free encyclopedia

Combinatorial game theory is a branch of mathematics founded by John Horton Conway around 1970 , which deals with a special class of two-person games .

The characteristics of these games, also known as combinatorial games , are:

  • No coincidence .
  • There is no hidden information for a single player (as with playing cards ). ie there is perfect information .
  • It is drawn alternately.
  • The player who manages to make the last move wins.
  • Each game ends after a finite number of moves.

Such games, to which Nim and (after minor rule transformations) Go and Chess belong, open up interesting possibilities for mathematical analysis especially when they break down into components in which there is no mutual influence of the possible moves. Examples are nim heaps and some late endgame positions in go; Even in chess, some forced move positions in pawn endings can be interpreted in this way. The assembly of positions is also known as addition.

The mathematical meaning of combinatorial game theory results from the fact that the games of a subclass can be interpreted as numbers . It is possible to construct whole as well as real and even transfinite (i.e. infinitely large and infinitely small) numbers, the entirety of which is also called surreal numbers . Conversely, the games of combinatorial game theory appear as a generalization of surreal numbers .

Example: domineering

The game domineering is one of the games that can be dealt with in the context of combinatorial game theory. It is intended to serve as an illustration and provide concrete examples.

Rules of the game

In domineering, two players take turns placing dominoes on a chessboard-like playing field , with one stone covering exactly two fields that both must be free when placing. As is common in combinatorial game theory, the two players are referred to as left and right . The player on the left always places the dominoes shown in blue in the following figures vertically and the player on the right always places his stones shown in red horizontally on the board. As with all games of combinatorial game theory, the player who can no longer draw loses the game.

If you want to analyze a position, it obviously only depends on the number of empty fields, but not how the fields that are already occupied have been covered.

Sample game

A game on a 3 × 3 board could e.g. B. run as shown in the following figure: Links opened by placing the stone marked with "1". On the right, place the stone marked with a “2”, whereupon the game on the left decides the game by placing the stone marked with a “3”, because the right then no longer has space for a red stone to be placed horizontally.

A. B. C.
D. E. F.
G H I.
2 3
D. 1
G I.

Regarding the game given as an example, it remains to be noted that Links would even have had a possibility to move in the end position reached with the third move. In addition, the number of free fields, which determines all further possible moves, can be broken down into components, provided that these components are not connected by the side edges of fields. For example, the end position could be the decomposition

D.
G
I.

be made. This possibility of decomposing a position or the reverse process of juxtaposing positions is a very important property of combinatorial games (see sum of games ).

Mathematical formalization

For a mathematical modeling , every possible game rule within combinatorial game theory is described by mathematical objects , namely in particular by sets . For this purpose, the moves of the two players must be mathematically and formally characterized for each position.

This characterization of the possible moves could be represented especially in the game domineering for each position by the amount of free fields. General, d. H. for any combinatorial game, the possible moves of a position can be characterized by two sets and , each of which contains the positions that the player in question can reach by moving. In the sense of the mathematical model, each position becomes a combinatorial game, which is defined recursively by those games that start with the positions that can be reached by a train.

General notation

The notation has changed from z. B. naturalized for the couple if the player on the left has the two options to move to and only the option to move to the right :

A.
B.
C. D.
C. D.
A.
D.
A.
B.

Based on this notation and the respective identification of a position with the game that starts from this position, one arrives at the following definition

Definition of a game

A game (in the sense of playing position ., Eng game ) is the following self-referential defined structure:

Construction rule
If and are two sets of games , then there is a game .

The quantities and are also called the left and right options of the game and represent the game positions that the left or right player can reach with one move from the current game position.

As already explained for the initial example, to shorten the notation, the options are usually listed directly and the quantity brackets are omitted:

Elementary games

The foundation of this recursive definition is game 0, in which neither player can move anymore (the set of left and right options is empty). The corresponding position in domineering is the 1x1 “board”.

A.

The next step is to find more games:

A.
B.
A. B.
A. B.
C.

Prize tiers

With regard to the profit prospects that can be achieved with good strategies, each position belongs to exactly one of the following four classes:

  • Links can win with optimal play, regardless of who starts ( positive )
  • Right can win the game regardless of who starts ( negative )
  • The player making the first move has a winning strategy ( fuzzy )
  • The following player has a winning strategy ( zero )
Left begins
Left wins Right wins
Right begins Left wins Positive
(left always wins)
Zero
(the player following wins)
Right wins Fuzzy
(the attractive one wins)
Negative
(right always wins)

The elementary games already defined can be classified as follows: 1 is positive, −1 is negative, 0 is a zero game and * is fuzzy.

Many of the games examined break down in the course of the game into independent individual components (endgames in Go, rows in Nim). These sub-games are often simple enough to be fully analyzed. Therefore, a central question of combinatorial game theory is how the winning prospects of the overall game can be derived from the information on the partial games. The temperature theory provides an algorithm with which one can play approximately optimally. The maximum deviation from the theoretically best strategy can be limited.

In some cases you can already draw conclusions from the prize categories:

  • If the two components of a game are positive, the entire game is positive. The same goes for negative games.
  • Null games have the property that as a component they have no influence on the outcome of the game and leave the prize category unchanged. Two games that differ by only one zero game are therefore considered equivalent.

Sum of games

The sum of two games is a central concept of combinatorial game theory, with which the juxtaposition of positions known from the Nim is formalized and generalized: A Nim position consists of several piles of game pieces, and with each move exactly one pile is selected where the current one Train is carried out. The move options only depend on the selected pile. In addition, each move leaves the unselected piles unchanged.

Similarly, one can together two games by adding that they are next to each other and the sets of moves defined such that the moving player select one of the two terms has to be there to perform a permissible train. In particular, a player who cannot move in any of the summands has no more opportunity to move, so that he has lost.

Formally it is the sum of the games and recursively defined as

Comparison of two games

In order to be able to compare two games G and H with one another, the prize class of the difference game is determined . The inverse corresponds to a reversal of the roles of left and right. In domineering, this can be achieved by turning the board 90 degrees. Is formal

This definition is also recursive.

If the difference game is two games and a zero game, the total has the same prize class as the total for each game . Therefore, and are considered equivalent. Under identification of equivalence class and representative one writes:

if it is a null game.

The following is also defined:

if is positive.
if is negative.
if is fuzzy.

such as

if or
if or

According to this symbolism, a "bigger" game is always advantageous for the left player.

Alternatively, the order relations can also be introduced in a more formal way, without reference to the concept of the optimal style of play, which is not detailed.

With these operators and relations, the games (modulo equivalence) fulfill the properties of a commutative group with partial order . Strictly speaking, the totality of all games can not be a group as a real class , since the definition of a group only extends to sets. However, essential relationships that can be derived from the group property in the case of sets can also be transferred to classes.

Simplification of games and normal form

A game can be simplified with the following two rules without changing the value of the game (i.e. without leaving the equivalence class).

Dominated options
Are two left options comparable in the game , e.g. B. So is a dominated option. Correspondingly, a right option is dominated by, if .
Dominated options can be left out without changing the value of a game:

Seen clearly, this corresponds to a game situation in which a player has a move that is clearly worse than an alternative move. Under no circumstances will the player choose the worse move - so it makes no difference to the outcome of the game whether the bad move exists or not.

Reversible trains
In a game , the right player's move to is called reversible if there is a left option of with . Ie is at least as good as the original position for links. (Similarly, a move by the left player to is reversible if there is a right option of with .)
For a reversible move, the value of the game does not change if one assumes that the move is always answered immediately by the opponent:
Is in the game the right option on reversible so true
.
If the left option is reversible via , then applies
.

Reversible moves can make sense. The player should continue to play in this partial position after the reversal, otherwise the exchange would be a pure loss.

For finite games (those that contain a finite number of positions) a repeated application of these two rules leads to the normal form of the game, which is unique for each equivalence class.

In the general case, the following statement can at least make: Have two games and neither dominated options, yet reversible trains, it follows from that and have the same left and right options. Ie for every left / right option of there is a left / right option of with .

Examples of higher level games

Four fields can be used to form games of the next level in domineering:

A.
B.
C.
D.
A.
D.
C.
D.
A.
D.
C.
D.

(The last simplification can be made because and thus is a dominated option.)

On the other hand is too

One defines

, etc.

d. H.

, etc.

The games can be interpreted as a kind of score: The left or right player has not only won, but could even make a certain number of further moves. In addition to the integer games, there are also those that can be interpreted as a score :

A.
B.
C. D.
C. D.
A.
D.
A.
B.
C. D.
A.
D.
A.
B.

One defines

.

This assignment is motivated by the following properties:

,

where the second relationship can be calculated as follows:

.

Furthermore, there are also games that do not behave like a played out score. They are "hot" in the sense that any player can improve their situation by moving in the appropriate position:

A.
B. C.
D.
C.
D.
A.
D.
C.
D.
A.
D.

On average has the same value as :

In contrast to , however, it cannot be classified precisely, but is fuzzy compared to values ​​in the defined interval:

The number line can therefore only be located in the range from 0 to 1 indefinitely.

Surreal numbers

Among the games there are those that can be identified with real numbers because they follow the same arithmetic. More precisely, the games called surreal numbers can be characterized as follows:

A surreal number is a game where
  • all left and right options are surreal numbers
  • applies to every left option and right option .

In the context of combinatorial game theory surreal numbers are also abbreviated number (Engl. Number ), respectively. Of the games already defined, and are numbers, but not and .

Numbers are characterized by the fact that they are totally ordered, i. H. for two numbers and either , or . Surreal numbers are a rich class with the properties of an ordered field , which not only contains the whole, rational and real numbers, but also has infinitesimal and infinite representatives. More on this in the main article on surreal numbers .

As a component in a game sum, numbers can be very easily classified strategically according to the following sentence:

Number Avoidance Sentence
If there is a surreal number and a game, but not a surreal number, in a sums game it is always an advantage to draw in the game if possible.

In other words, in a game made up of several components, players only draw in one component until a number (practically a final score) is reached. At the end, the remaining numbers are added up: If the sum is positive, left wins, if it is negative, right wins; if it is 0, the player who follows wins.

A direct consequence of the theorem is the following calculation rule:

Translation principle
If a number and not a number, then applies
.

It is assumed here that there is at least one left or right option. Then follows the identity, since the remaining options of the sum are dominated by the number-avoidance theorem.

Special case: neutral games

Games in which, in addition to the properties listed above, the options for moves are always identical for both players are called neutral games (the original English term impartial is sometimes also translated as objective ). In terms of profit prospects, each position in a neutral game belongs to one of the first two classes on the list above.

A complete analysis of a neutral game is always possible by assigning to each position an equivalent position of the normal Nim game consisting of just one pile . This possibility of reduction was found independently by Roland Sprague in 1935 and by Patrick Michael Grundy in 1940 and is therefore also known as the Sprague-Grundy theorem. The world chess champion and mathematician Emanuel Lasker had already discussed approaches to reduction .

The size of the Nim heap, which is assigned to a position of a neutral game, is also known as its Grundy number. The Grundy number of a position can be relatively easily recursive, i.e. H. from the Grundy numbers of the positions that can be reached in one move: It is equal to the smallest natural number that is not the Grundy number of a successor position.

Grundy numbers have the following two properties:

  • The attractive player has a winning strategy if and only if the Grundy number is not equal to 0.
  • The Grundy number of a sum, i.e. H. a position composed of components is equal to the XOR sum of the Grundy numbers of its components, which simplifies the calculation in such cases in terms of complexity .

The two properties generalize the winning strategy found in 1901 by Charles Leonard Bouton for the Nim game , according to which one should always draw so that the XOR sum of the pile sizes is equal to 0.

Application: Endgames at Go

Even if Go is actually not a game in which the last player to draw wins, games can be constructed in the sense of combinatorial game theory, which reproduce the usual Go rules and effectively lead to the same game result. The easiest way to do this is with the historical stone evaluation , in which both players fill the board with stones (except for two eyes per living group) and the player with the most stones on the board is declared the winner. With the additional rule of returning prisoners , the winner is also the person who can move last. Returning prisoners means that the players are not allowed to pass, but instead of taking a regular move they can return a previously caught stone to the opponent (the stone is thus out of the game).

The stone rating differs from the go rules that are common worldwide today in that a “tax” of two points is levied on each group. But the modern rule variants can also be mapped accordingly.

In addition to the conventions of classic combinatorial game theory according to John Conway, there are also alternative formalisms in which the scores and their properties in relation to the components of an (endgame) position are examined directly. This is pursued in the works from the 1950s by Milnor (1953), Hanner (1959), which already contain essential results from temperature theory, and continued by Berlekamp (1996).

In the book Mathematical Go (1994) by Berlekamp and Wolfe, the results of combinatorial game theory are applied to endgames in Go and the resulting game strategies are worked out. It contains findings and guidelines that were not previously part of the canon of knowledge of professional players and is considered one of the few Go publications by western authors who have received attention in Asia.

Remarks

  1. Proof : Let be a zero game and a game for which player A has a winning strategy. This player then also has a winning strategy for the composite game : For every move that the opponent makes, player A replies with the winning strategy of the player who followed. Otherwise he draws in , according to his winning strategy for .
  2. To this end, the following is defined:
    • no right option of exists with and no left option of exists with
    such as
    • ( and )
    • ( and not )
    • (neither nor )
    This recursive definition is consistent with the principle of the optimal style of play, because the following statements are equivalent for a game :
    • is positive or zero
    • When the right begins, the left can win
    • Right has no "winning move"
    • There is no right option of for which the following applies: if left begins, right can win.
    • There is no right option of being negative or zero
    Ie symbolically no right option of exists with .

See also

literature

Web links

Individual evidence

  1. ^ R. Sprague: About mathematical fighting games , Tôhoku Mathematical Journal, 41 (1935), pp. 438-444 ( online version ).
  2. ^ PM Grundy: Mathematics and games , Eureka. 27 (1940), pp. 9-11 ( online version ( Memento of September 27, 2007 in the Internet Archive )).
  3. Emanuel Lasker: Board games of the peoples. Berlin 1931, p. 177 ff.
  4. Quoted in excerpts in: Jörg Bewersdorff: Glück, Logic and Bluff: Mathematics in the game - methods, results and limits , 5th edition, Wiesbaden 2010, pp. 119–121
  5. CL Bouton: Nim, a game with a complete mathematical theory , Annals of Mathematics, (2) 3 (1901), pp. 35–39 ( online at JSTOR )
  6. Elwyn Berlekamp: Introductory overview of mathematical Go endgames , Combinatorial games, Lecture Notes AMS Short Course, Columbus / OH (USA) 1990, pp. 73-100 ( abstract in Zentralblatt MATH )
  7. John W. Milnor: Sums of positional games , Contrib. Theory of Games, II, Ann. Math. Stud., 28 (1953), pp. 291–301, doi : 10.1515 / 9781400881970-017 ( abstract in Zentralblatt MATH )
  8. Olof Hanner: Mean play of sums of positional games , Pacific Journal of Mathematics, 9 (1959), pp. 81-99 ( online version ).
  9. Elwyn Berlekamp: The economist's view of combinatorial games , Games of No Chance, 1996, pp. 365-405 ( online version )