Structural formula of an AT base pair with two dashed blue hydrogen bonds.
Structural formula of a GC base pair with three dashed blue hydrogen bonds.

As a base pair is referred to in a double strand of a double-stranded nucleic acid ( DNA or RNA , in this case, after English double stranded as dsDNA or dsRNA referred to) two opposing nucleobases which are mutually complementary and are by hydrogen bonds are held together.

While the length of single-stranded nucleic acids (ssRNA or ssDNA, English single stranded ) is given with the number of nucleotides ( nt ) or bases (b) - in the synthetic case sometimes also with the general designation -mer for the number of sequences of a chain molecule, the size of double-stranded DNA sections is usually given in base pairs, abbreviated bp ,

  • 1 nt = one nucleotide
  • 1 bp = one base pair
  • 1 kbp or kb = 1000 base pairs (kilobase pairs)
  • 1 Mbp or Mb = 1,000,000 base pairs (mega base pairs)
  • 1 Gbp or Gb = 1,000,000,000 base pairs (giga base pairs)

The unduplicated haploid human genome in the nucleus of a germ cell comprises over 3 billion base pairs, about 3.2 Gbp, distributed over 23 chromosomes ( 1 n ; 1 c ). A somatic cell of the human body usually contains a diploid (two times) nuclear chromosomes , or about 6.4 GBP, (2 to 46 chromosomes n ; 2c). This is duplicated (doubled) before a cell division , so that each of the 46 chromosomes consists of two chromatids - identical copies with the same genetic information - before the nuclear division begins as mitosis , with about 13 Gbp (2 n ; 4c). In addition to this nuclear DNA ( nuclear DNA , nDNA), most human cells, as in all eukaryotes, contain a further genome (mitogenome) in each mitochondrion , each about 16.6 kbp ( mitochondrial DNA , mtDNA). An exception are the mature red blood cells , which, like all mammals, have neither a nucleus nor mitochondria. Plant cells also contain the plastid genome (plastome) of their chloroplasts (abbreviated ctDNA or cpDNA).

The number of base pairs is also an important measure of the amount of information that is stored in a gene . Since each base pair represents a choice of 4 possible forms, 1 bp corresponds to the information content of 2  bits , which is twice a bit in the binary code . However, only a small part of the DNA in the human genome carries the genetic information for the construction of proteins; over 95% are non-coding and often consist of repetitive elements .


Base pairing plays an essential role for DNA reduplication , for transcription and translation in the course of protein biosynthesis, as well as for various designs of the secondary structure and tertiary structure of nucleic acids .

Mating rules

Base pairs in a double strand of DNA

A base pair is formed by a hydrogen bond between two nucleobases . One of the purine bases guanine or adenine is connected to one of the pyrimidine bases cytosine , thymine or uracil to form a pair. In the case of the complementary base pairings between two strand sections of nucleic acids, guanine forms a pair with cytosine, and adenine forms a pair with thymine or uracil. This can result in the following pairings:


Watson-Crick pairings

As early as 1949, the Austrian biochemist Erwin Chargaff established with the Chargaff rules that the number of bases adenine (A) and thymine (T) in the DNA is always in a ratio of 1: 1, and the ratio of the bases guanine (G) and cytosine (C) 1: 1. In contrast, the quantitative ratio A: G or C: T varies greatly (Chargaff's rules).

From this, James D. Watson and Francis Harry Compton Crick concluded that AT and GC each form complementary base pairs.

Base pairings also occur in the tRNA and rRNA when the nucleotide strand forms loops and thus complementary base sequences are opposed. Since only uracil is built into the RNA instead of thymine, the pairings are AU and GC.

Unusual pairings

Unusual pairings occur mainly in tRNAs and in triple helices . Although they follow the Watson-Crick scheme, they form other hydrogen bonds: Examples are Reverse-Watson-Crick pairings, Hoogsteen pairings (named after Karst Hoogsteen , born 1923) and Reverse-Hoogsteen pairings

reverse GC pairing
reverse AU pairing
AU Hoogsteen pairing
reverse AU Hoogsteen pairing

Wobble pairings

The name refers to the wobble hypothesis by Francis Crick (1966). Wobble pairings are the non-Watson-Crick pairings GU or GT and AC:

GU wobble pairing
reverse GU wobble pairing
AC wobble pairing
reverse AC wobble pairing

