Cluster coefficient

from Wikipedia, the free encyclopedia

The clustering coefficient (engl. Clustering coefficient ) is in Graph theory , a measure of the formation of cliques and transitivity in a network . If all neighbors of a node are connected in pairs, i.e. each with each other, then they form a clique . This is synonymous with the concept of transitivity, because within a clique the following applies: If A is connected to B and A to C, then B and C are also connected. A distinction is made between the global cluster coefficient , which characterizes the entire network, and the local cluster coefficient , which characterizes an individual node.

Global cluster coefficient

Three nodes are connected to form a triangle at the top, three nodes form a “connected triple” at the bottom. The triangle has a global cluster coefficient of 1, since you count 1 triangle and 3 "connected triples"

The global cluster coefficient , also known as transitivity , is defined as the ratio of the number of triangles to “connected triples” in a network (see adjacent figure).

.

A triangle are three nodes that are all connected to each other. In contrast, a connected triple is a node A and a disordered pair of two nodes B and C, where A has edges to B and C. Each triangle thus forms 3 connected triples. The factor 3 in the numerator of the formula compensates for this. A network consisting of a single triangle receives the cluster coefficient only with a factor of 3 , which corresponds to a perfect clique.

Local cluster coefficient

The local cluster coefficient of a node in a graph , defined by Duncan Watts and Steven Strogatz , denotes in graph theory the quotient of the number of edges that actually run between its neighbors ( ) and the number of edges that could maximally run between these neighbors ( undirected graph :) :

This formula applies to an undirected graph; the factor 2 does not apply to a directed graph, since twice as many edges are possible between the neighbors as in an undirected graph.

Six node graph

In the graph opposite, node 1 has neighbors 2 and 5. Between these neighbors, an edge is possible and also present, so that the cluster coefficient is. The node 2 has 3 neighbors, between which 3 edges are possible, but only the neighboring nodes 1 and 5 are connected by an edge. The cluster coefficient is therefore . According to the calculation, the cluster coefficient of node 6, i.e. all nodes of degree 1, results in division zero by zero. In some implementations this is implemented with the value 1; with many nodes of this type an unnaturally high global cluster coefficient results. Other authors define the local cluster coefficient for isolated nodes and nodes of degree 1 . With the last-mentioned convention, the following local cluster coefficients up to :

.

The local cluster coefficient can also be equivalent as

To be defined.

The mean of all local cluster coefficients is often referred to as the global cluster coefficient:

.

This definition does not return the same value as the first definition of the global cluster coefficient ; the order of calculation of the triangle-to-triple ratio on the one hand and the averaging on the other hand is reversed. The difference is effectively the reverse of the operations of calculating the ratio of triangles and triples and averaging: the latter definition is the mean of the triangle-to-triple ratio, the former in some way calculates the ratio of the mean number of triangles to the mean Number of triples. Both definitions and can deliver very different results: For the network shown, and .

is easier to calculate on the computer, so it is widely used in numerical studies.

Small world networks - regardless of the definition chosen - usually have high cluster coefficients, whereas random graphs tend to have low ones .

See also

Web links

Commons : Clustering coefficient  - collection of images, videos and audio files

swell

  1. a b c d e M. EJ Newman: The Structure and Function of Complex Networks , SIAM Review 45 , 167 (2003), p. 183
  2. a b p Boccaletti, V. Latora, Y. Moreno, M. Chavez, and DU Hwang: Complex Networks: Structure and Dynamics, Physics Reports 424 , 175 (2006)
  3. ^ DJ Watts and Steven Strogatz : Collective dynamics of 'small-world' networks . In: Nature . 393, No. 6684, June 1998, pp. 440-442. bibcode : 1998Natur.393..440W . doi : 10.1038 / 30918 . PMID 9623998 .