SIR model

from Wikipedia, the free encyclopedia
Graphs of the three groups S, I and R with the starting values and infection rate and rate for the group R . The rates are in the unit day.
If neither medication nor a vaccination are available, the only way to reduce the number of infections is through appropriate measures. This animation shows the effect of reducing the infection rate by 76% (from to ) (the other parameters are as in the graphic above and ). The information on is adapted to the article text, the values ​​in the figure correspond with N = 1000.

In mathematical epidemiology , a branch of theoretical biology , the SIR model (susceptible-infected-removed model) is a classic approach to describing the spread of infectious diseases with immunity development, which is an extension of the SI model . It is named after the grouping of the population into susceptible (S), infected (I) and people removed from the infection process (R), as explained below. The expansion of the SIR model to include those exposed, i.e. people who are infected but not yet contagious, is described with the SEIR model . Usually, a deterministic model, formulated by common differential equations linked to one another, is considered, in which the variables are continuous and correspond to large populations, but other, in particular stochastic models are also referred to as SIR, which have the grouping in common with the deterministic SIR model.

The model comes from William Ogilvy Kermack and Anderson Gray McKendrick (1927) and is sometimes named after both (Kermack-McKendrick model). Despite the simplicity of the model, the authors were able to model the data of a plague epidemic in Bombay in 1905/06 well.

Differential equations

When SIR model, three groups are distinguished individuals: At the time indicated the number of the disease non-immune healthy ( s usceptible worth individuals), the number of infectious infected ( i nfectious worth individuals) and the number of "remote" from the disease process people ( r emoved individuals). The latter occurs either through recovery with acquired immunity to the disease or through death. In another reading it is the resistant people. Furthermore, let the total number of individuals be, that is:

To simplify the notation, the time dependency is omitted in the following. It applies to any time , , , .

A number of assumptions are made in the SIR model:

  • Each individual can be infected by a pathogen only once and then either becomes immune or dies.
  • Infected people are immediately contagious, an assumption that is not made in the SEIR model .
  • The respective rates are constant.
  • Those who died as a result of the infection are included in R, like those who were immunized.

Then the equations of the SIR model are:

With the rates :

Rate at which infected people recover or die in the unit of time (since the dead are also included in R)
general death rate per person in a population (ie "per capita")
Birth rate per person in a population (ie "per capita")
, the rate that indicates the number of new infections caused by a first infectious case per unit of time. is also referred to as the transmission rate or transmission coefficient.

can be further broken down :, with the contact rate and the likelihood of infection transmission on contact.

The infection rate (English: "force of infection") , that is, the probability per unit of time that a susceptible person will be infected is

where represents the proportion of infected people in the total population and thus the probability of contact with an infected person. People are infected per unit of time.

If one neglects the birth and death rates (N is then constant) the equations result:

The equations are similar to the Lotka-Volterra equations in predator-prey systems and coupled balance equations in many other areas ( replicator equations ).

In the literature, a variant of the equations is sometimes used in which the transmission coefficient is included and is often also referred to as although it has a different value : If, for example, and our coefficient is above , then the value must be inserted into the variant of the differential equation . If we use a different identifier for the sake of clarity , the first differential equation is written in the variant:

Base reproduction number and course of an epidemic

The base reproduction number is

The usual abbreviation for the basic reproduction number is used here (it should not be confused with the initial value of the number of resistant people at the time , which is sometimes also referred to as).

The base reproduction number indicates the number of further infections caused by an infected person (during the entire duration of their infectious period) in the beginning of the epidemic. In addition , there is also the factor that indicates the duration of the infectious period. In the beginning of an epidemic, birth and death rates can be roughly neglected, i.e. set; then one obtains the form of the SIR equations given at the end of the last section. For the beginning of an epidemic it must be and consequently (since applies at the beginning ) according to the SIR equations and thus (see also the following section about the discretized form of the equations)

As the process progresses, the number of infected people increases according to the SIR equations , if , so and with it

On the left is the product of the basic reproduction number and the proportion of the susceptible in the population. The latter is at the same time the probability of meeting an infectable person during contact. The inequality is synonymous with

The growth of the infected decreases (subsidence or end of the epidemic) if it falls below the value . With a value of the basic reproduction number this would be half of the population and with a third, so that in the former case half and in the latter case 2/3 of the population are infected or resistant, i.e. H. are no longer susceptible to infection; one then speaks of “ herd immunity ”.

In influenza , for example, the base reproduction numbers are usually between 2 and 3.

Discretized form of differential equations

The discretized form of the differential equations with time step is:

corresponds to the number of newly infected persons in the observation period , i.e. what appears in the official statistics as the number of newly infected people , whereby in practice corrections are made for delayed reporting and other things. A day is often selected and set as the time unit and observation period for the report .

For the derivation of the basic reproduction number, consider the discretized form (step ) of the differential equation for :

With the duration of the infectious period used for and according to the definition of the base reproduction number or net reproduction number :

where at the beginning of the epidemic, as is known, results

So is

and for , at the beginning of the epidemic , it results:

Mathematical treatment

With the help of the SIR model, we can determine for given initial values whether the course of the disease will lead to an epidemic . This question is equivalent to the question of whether the number of infected people is increasing at the time . Consider the derivation:

.

Here we call the threshold of an epidemic because of for all time the inequality for all this and for abating the epidemic:

.

for everyone .

In the SIR model, an epidemic occurs exactly when is. This is an essential statement of the model, also known as the threshold theorem. In order to start an epidemic, there must be a minimum density of infectables. Conversely, if the number of those who can be infected is pushed below this threshold during the course of the epidemic, the epidemic will expire.

Maximum number of infected people

From the above differential equations for and it follows:

.

Integration by separating the variables provides:

with the logarithm function . The function is a first integral of the system and is constant on the trajectories of the system in the phase space given by I and S. The maximum number of infected people is obviously for and with . With the above equation assuming :

If you place and and you get:

The equation for ("final size equation") also results from the first integrals :

from the values ​​for (with ) and (with ). The equation can be used to determine . In particular, it results for the solution , that is, there is no breakout.

Number of "survivors"

The question also arises as to whether the epidemic will be “survived” at all, that is, whether there will be susceptible substances left in the end. For this we calculate , i.e. with time towards infinity ( ). Analogously results from the above differential equations

, whose solution is, with the exponential function .

This obviously means that the entire population will not be infected. From it also follows . It turns out that at the end of an epidemic there is less of a lack of susceptibles than of infected people!

Approximations: Reduce the number of parameters

The square of the hyperbolic secant (red) occurring in the solutions of the SIR model has an even steeper slope than the hyperbolic secant itself

If we know the initial values , we can quickly determine the dynamics of a disease using the differential equations above. Often, however, it is precisely these constants that are difficult to determine, which is why we want to approximate the above equations in the following.

It follows immediately from the differential equations discussed

The equation simplifies to a riccatischen differential equation if through the first 3 summands of the Taylor series to is approximated:

so

where were introduced:

The function is the hyperbolic secant and the hyperbolic tangent , its inverse function.

This means that the differential equation for can be expressed with only three parameters:

So these three parameters are (with the initially exponential growth corresponding to the doubling time), the phase and . Depending on the data, the differential equation or the implicit equation for can be used.

If you place and you get and thus:

With you get an approximate value for the "extent of the epidemic" :

This gives the second part of the threshold theorem. Be with me at the beginning , then the "extent of the epidemic" is:

and . In the end, the number of susceptible people is reduced by compared to the level before the epidemic.

According to the last differential equation in the SIR model, the number of infected people is:

The curve of has the shape of a bell curve with an initially exponential increase. For example, Kermack and McKendrick found for the plague epidemic in Bombay 1905/06 (with almost always fatal outcome, so that a week was taken as the unit of time for the rates) with:

The good agreement made this a frequently cited example in mathematical epidemiology, but it has also been criticized.

David George Kendall found exact solutions for and the SIR model in 1956 , but the differential equations are mostly solved numerically.

Extension of the model

This SIRD model shows the time course of the four groups S, I, R and D for the starting values and the infection rate (identical to the values ​​in the graphic for the
SIR model above), a rate for group R and the mortality rate

If you want to consider the dead separately (instead of adding them to group R ), you can expand it to the SIRD model (Susceptible-Infected-Recovered-Deceased-Model) . Here, the group includes R only the individuals who have survived the disease and have become immune, and the dead form their own group D .

The following system of differential equations has to be solved:

At the time indicates the number of people who have recovered and the number of people who have died from the disease. Also means the rate at which infected people recover and the mortality rate at which infected people die. , and transmission rate have the same meaning as in the SIR model .

Another modification takes into account vaccination of newborns with a proportion :

There are also variants in which two (or more) population groups are considered, for example the interaction of a core group, which is particularly active in promoting an infection, with the rest of the population.

Stochastic SIR models are used to study smaller populations that cannot be treated well with deterministic models. Only integer values ​​of the population proportions and statistical distributions for the transition rates such as . The course of epidemics is not determined deterministically here, i.e. H. an epidemic can also be applied to stop, if by chance in the infection period (infectious period, d. e. the period in which a Infizierter can transmit the infection) there are no contacts. Usually simulations (usually with the Monte Carlo method ) are carried out several times with the same parameters and the results are then statistically evaluated.

A variant that takes quarantine measures and isolation measures such as social distancing into account was used to explain subexponential growth, i.e. growth of the infected in accordance with a power law in time, for Covid-19 in China from the end of January 2020 (SIR-X model). The differential equations are (with adaptation to the form of the SIR equations used here):

A newly introduced group of symptomatic infected people is in quarantine. The general measures to reduce contact are also described (social distancing, etc.) and affect infected and non-infected people equally, the special quarantine measures for infected people with the coefficient . The respective measures are no longer applicable . A new effective one results with an effective infection period . This new effective is smaller than . Another method to simulate the influence of isolating measures is to use approaches that vary over time.

See also

literature

  • NF Britton: Essential Mathematical Biology . 1st edition. Springer, Berlin 2003, ISBN 1-85233-536-X .
  • Michael Li: An introduction to mathematical modeling of infectious diseases, Springer, 2018

Web links

Individual evidence

  1. Kermack, McKendrick, A contribution to the mathematical theory of epidemics, Proc. Roy. Soc. A, Vol. 115, 1927, pp. 700-721
  2. ^ Sebastian Möhler: Spread of infectious diseases . ( tu-freiberg.de [PDF; accessed on March 12, 2020]).
  3. Regina Dolgoarshinnykh: Introduction to Epidemic Modeling . ( columbia.edu [PDF; accessed March 12, 2020]).
  4. a b c d e Eichner, Kretzschmar, Mathematische Modelle in der Infektionsepidemiologie, in A. Krämer, R. Reintjes (Eds.), Infektionsepidemiologie, Springer 2003, pp. 81–94
  5. Michael Li, An introduction to mathematical modeling of infectious diseases, Springer, 2018, Section 2.1, Kermick-McKendrick model.
  6. a b Viola Priesemann et al., Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions , Science, May 15, 2020
  7. ^ Norman Bailey, The mathematical theory of infectious diseases, Griffin and Company 1975, p. 11, threshold theorem
  8. Odo Diekmann, Hans Heesterbeek, Tom Britton, Mathematical tools for understanding infectious disease dynamics, Princeton UP 2013, p. 15
  9. The following derivation with the associated formulas can be found e.g. B. in Michael Li, An introduction to mathematical modeling of infectious diseases, Springer, 2018, p. 45
  10. Kermack, McKendrick, A contribution to the mathematical theory of epidemics, Proc. Royal Soc. A, Volume 115, 1927, p. 714
  11. Nicholas Bacaer: The model of Kermack and McKendrick for the plague epidemic in Bombay and the type reproduction number with seasonality, Journal of Mathematical Biology, Volume 64, 2012, pp. 403-422. According to this, the constant parameters obtained are unrealistic and the plague epidemic occurred seasonally in Bombay from 1897 to at least 1911, coupled to the rat population, so that a more complex model is necessary
  12. ^ DG Kendall, Deterministic and stochastic epidemics in closed populations, Proc. Third Berkeley Symposium Math. Stat. & Prob., Vol. 4, 1956, University of California Press, pp. 149-165, Project Euclid
  13. Amenaghawon Osemwinyen, Aboubakary Diakhaby: Mathematical Modeling of the Transmission Dynamics of Ebola Virus . July 2015 ( researchgate.net [accessed March 12, 2020]).
  14. Benjamin Maier, Dirk Brockmann, Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China, Science, April 8, 2020, online
  15. For example Q. Lin et al., A conceptual model for the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China with individual reaction and governmental action, Int. J. Infectious Diseases, Volume 93, 2020, pp. 211-216

Remarks

  1. This form (which does not occur) is used by Kermack and McKendrick for their SIR model. On p. 713 you use the functions for and for infection rate and the rate for group R and write
  2. For the sake of comparability, the differential equations given in the source have been brought into the same form as in the SIR model above. There is no uniform use of the parameters in the literature; next to the above DGL is z. B. also (infection rate named here for the sake of clarity ) common. The identity then applies . In the figures used above , and the descriptions on Wikimedia Commons refer to the latter variant of the DGL, so that the infection rates mentioned there are only the values ​​according to the nomenclature in the article (the captions have been adapted for the article ).