Hypoexponential distribution

From The Right Wiki
Jump to navigationJump to search
Hypoexponential
Parameters λ1,,λk>0 rates (real)
Support x[0;)
PDF Expressed as a phase-type distribution
αexΘΘ1
Has no other simple form; see article for details
CDF Expressed as a phase-type distribution
1αexΘ1
Mean i=1k1/λi
Median General closed form does not exist[1]
Mode (k1)/λ if λk=λ, for all k
Variance i=1k1/λi2
Skewness 2(i=1k1/λi3)/(i=1k1/λi2)3/2
Excess kurtosis no simple closed form
MGF α(tIΘ)1Θ1
CF α(itIΘ)1Θ1

In probability theory the hypoexponential distribution or the generalized Erlang distribution is a continuous distribution, that has found use in the same fields as the Erlang distribution, such as queueing theory, teletraffic engineering and more generally in stochastic processes. It is called the hypoexponetial distribution as it has a coefficient of variation less than one, compared to the hyper-exponential distribution which has coefficient of variation greater than one and the exponential distribution which has coefficient of variation of one.

Overview

The Erlang distribution is a series of k exponential distributions all with rate λ. The hypoexponential is a series of k exponential distributions each with their own rate λi, the rate of the ith exponential distribution. If we have k independently distributed exponential random variables Xi, then the random variable,

X=i=1kXi

is hypoexponentially distributed. The hypoexponential has a minimum coefficient of variation of 1/k.

Relation to the phase-type distribution

As a result of the definition it is easier to consider this distribution as a special case of the phase-type distribution.[2] The phase-type distribution is the time to absorption of a finite state Markov process. If we have a k+1 state process, where the first k states are transient and the state k+1 is an absorbing state, then the distribution of time from the start of the process until the absorbing state is reached is phase-type distributed. This becomes the hypoexponential if we start in the first 1 and move skip-free from state i to i+1 with rate λi until state k transitions with rate λk to the absorbing state k+1. This can be written in the form of a subgenerator matrix,

[λ1λ10000λ2λ20000λk2λk20000λk1λk10000λk].

For simplicity denote the above matrix ΘΘ(λ1,,λk). If the probability of starting in each of the k states is

α=(1,0,,0)

then Hypo(λ1,,λk)=PH(α,Θ).

Two parameter case

Where the distribution has two parameters (λ1λ2) the explicit forms of the probability functions and the associated statistics are:[3] CDF: F(x)=1λ2λ2λ1eλ1xλ1λ1λ2eλ2x PDF: f(x)=λ1λ2λ1λ2(exλ2exλ1) Mean: 1λ1+1λ2 Variance: 1λ12+1λ22 Coefficient of variation: λ12+λ22λ1+λ2 The coefficient of variation is always less than 1. Given the sample mean (x¯) and sample coefficient of variation (c), the parameters λ1 and λ2 can be estimated as follows: λ1=2x¯[1+1+2(c21)]1 λ2=2x¯[11+2(c21)]1 These estimators can be derived from the methods of moments by setting 1λ1+1λ2=x¯ and λ12+λ22λ1+λ2=c. The resulting parameters λ1 and λ2 are real values if c2[0.5,1].

Characterization

A random variable XHypo(λ1,,λk) has cumulative distribution function given by,

F(x)=1αexΘ1

and density function,

f(x)=αexΘΘ1,

where 1 is a column vector of ones of the size k and eA is the matrix exponential of A. When λiλj for all ij, the density function can be written as

f(x)=i=1kλiexλi(j=1,jikλjλjλi)=i=1ki(0)λiexλi

where 1(x),,k(x) are the Lagrange basis polynomials associated with the points λ1,,λk. The distribution has Laplace transform of

{f(x)}=α(sIΘ)1Θ1

Which can be used to find moments,

E[Xn]=(1)nn!αΘn1.

General case

In the general case where there are a distinct sums of exponential distributions with rates λ1,λ2,,λa and a number of terms in each sum equals to r1,r2,,ra respectively. The cumulative distribution function for t0 is given by

F(t)=1(j=1aλjrj)k=1al=1rkΨk,l(λk)trklexp(λkt)(rkl)!(l1)!,

with

Ψk,l(x)=l1xl1(j=0,jka(λj+x)rj).

with the additional convention λ0=0,r0=1.[4]

Uses

This distribution has been used in population genetics,[5] cell biology,[6][7] and queuing theory.[8][9]

See also

References

  1. "HypoexponentialDistribution". Wolfram Language & System Documentation Center. Wolfram. 2012. Retrieved 27 February 2024.
  2. Legros, Benjamin; Jouini, Oualid (2015). "A linear algebraic approach for the computation of sums of Erlang random variables". Applied Mathematical Modelling. 39 (16): 4971–4977. doi:10.1016/j.apm.2015.04.013.
  3. Bolch, Gunter; Greiner, Stefan; de Meer, Hermann; Trivedi, Kishor S. (2006). Queuing Networks and Markov Chains: Modeling and Performance Evaluation with Computer Science Applications (2nd ed.). Wiley. pp. 24–25. doi:10.1002/0471791571. ISBN 978-0-471-79157-7.
  4. Amari, Suprasad V.; Misra, Ravindra B. (1997). "Closed-form expressions for distribution of sum of exponential random variables". IEEE Transactions on Reliability. 46 (4): 519–522. doi:10.1109/24.693785.
  5. Strimmer, Korbinian; Pybus, Oliver G. (2001). "Exploring the demographic history of DNA sequences using the generalized skyline plot". Molecular Biology and Evolution. 18 (12): 2298–2305. doi:10.1093/oxfordjournals.molbev.a003776. PMID 11719579.
  6. Yates, Christian A.; Ford, Matthew J.; Mort, Richard L. (2017). "A multi-stage representation of cell proliferation as a Markov process". Bulletin of Mathematical Biology. 79 (12): 2905–2928. arXiv:1705.09718. doi:10.1007/s11538-017-0356-4. PMC 5709504. PMID 29030804.
  7. Gavagnin, Enrico; Ford, Matthew J.; Mort, Richard L.; Rogers, Tim; Yates, Christian A. (2019). "The invasion speed of cell migration models with realistic cell cycle time distributions". Journal of Theoretical Biology. 481: 91–99. arXiv:1806.03140. doi:10.1016/j.jtbi.2018.09.010. PMID 30219568.
  8. Călinescu, Malenia (August 2009). "Forecasting and capacity planning for ambulance services" (PDF). Faculty of Sciences. Vrije Universiteit Amsterdam. Archived from the original (PDF) on 15 February 2010.
  9. Bekker, René; Koeleman, Paulien M. (2011). "Scheduling admissions and reducing variability in bed demand". Health Care Management Science. 14 (3): 237–249. doi:10.1007/s10729-011-9163-x. PMC 3158339. PMID 21667090.

Further reading

  • M. F. Neuts. (1981) Matrix-Geometric Solutions in Stochastic Models: an Algorthmic Approach, Chapter 2: Probability Distributions of Phase Type; Dover Publications Inc.
  • G. Latouche, V. Ramaswami. (1999) Introduction to Matrix Analytic Methods in Stochastic Modelling, 1st edition. Chapter 2: PH Distributions; ASA SIAM,
  • Colm A. O'Cinneide (1999). Phase-type distribution: open problems and a few properties, Communication in Statistic - Stochastic Models, 15(4), 731–757.
  • L. Leemis and J. McQueston (2008). Univariate distribution relationships, The American Statistician, 62(1), 45—53.
  • S. Ross. (2007) Introduction to Probability Models, 9th edition, New York: Academic Press

zh:Erlang分布