In probability theory and statistics, the '''split normal distribution''' also known as the '''two-piece normal distribution''' results from joining at the mode the corresponding halves of two normal distributions with the same mode but different variances. It is claimed by Johnson et al.<ref name="Johnson1994" /> that this distribution was introduced by Gibbons and Mylroie<ref name=Gibbons/> and by John.<ref name="John1982" /> But these are two of several independent rediscoveries of the Zweiseitige Gauss'sche Gesetz introduced in the posthumously published ''Kollektivmasslehre'' (1897)<ref>Fechner, G.T. (ed. Lipps, G.F.) (1897). ''Kollectivmasslehre''. Engelmann, Leipzig.</ref> of Gustav Theodor Fechner (1801-1887), see Wallis (2014).<ref>Wallis, K.F. (2014). The two-piece normal, binormal, or double Gaussian distribution: its origin and rediscoveries. ''Statistical Science'', vol. 29, no. 1, pp.106-112. doi:10.1214/13-STS417.</ref> Another rediscovery has appeared more recently in a finance journal.<ref>de Roon, F. and Karehnke, P. (2016). A simple skewed distribution with asset pricing applications. ''Review of Finance'', 2016, 1-29.</ref>
{{Probability distribution | name = Split-normal | type = density <!-- | pdf_image = 350px|Probability density function for the normal distribution<br /><small>The red curve is the ''standard normal distribution''</small> | cdf_image = 350px|Cumulative distribution function for the normal distribution<br /> --> | notation = <math>\mathcal{SN}(\mu,\,\sigma_1,\sigma_2)</math> | parameters = <math>\mu \in \Re </math> — mode (location, real)<br/><math>\sigma_1 > 0 </math> — left-hand-side standard deviation (scale, real)<br/><math>\sigma_2 > 0 </math> — right-hand-side standard deviation (scale, real) | support = <math>x \in \Re </math>| | pdf = <math> A \exp \left(- \frac {(x-\mu)^2}{2 \sigma_1^2}\right) \quad \text{if } x< \mu</math> <br/><math> A \exp \left(- \frac {(x-\mu)^2}{2 \sigma_2^2}\right) \quad \text{otherwise,}</math> <br/><math> \text{where} \quad A= \sqrt{2/\pi} (\sigma_1+\sigma_2)^{-1}</math> | mode = <math> \mu </math> | mean = <math> \mu+\sqrt{2 / \pi}(\sigma_2-\sigma_1)</math> | variance = <math>(1-2 / \pi)(\sigma_2-\sigma_1)^2 + \sigma_1 \sigma_2</math> | skewness = <math> \gamma_3 = \sqrt{\frac{2}{\pi}}(\sigma_2-\sigma_1)\left[\left(\frac{4}{\pi}-1\right)(\sigma_2-\sigma_1)^2 + \sigma_1 \sigma_2\right]</math> }}
== Definition == The split normal distribution arises from merging two opposite halves of two probability density functions (PDFs) of normal distributions in their common mode.
The PDF of the split normal distribution is given by<ref name=Johnson1994/>
:<math>f(x;\mu,\sigma_1,\sigma_2) = \begin{cases} A \exp \left(- \dfrac {(x-\mu)^2}{2 \sigma_1^2}\right) & \text{if } x< \mu \\[1ex] A \exp \left(- \dfrac {(x-\mu)^2}{2 \sigma_2^2}\right) & \text{otherwise} \end{cases} </math> where :<math>\quad A= \sqrt{2/\pi} (\sigma_1+\sigma_2)^{-1}.</math>
=== Discussion === The split normal distribution results from merging two halves of normal distributions. In a general case the 'parent' normal distributions can have different variances which implies that the joined PDF would not be continuous. To ensure that the resulting PDF integrates to 1, the normalizing constant '''A''' is used.
In a special case when <math>\sigma_1^2=\sigma_2^2=\sigma_{*}^2</math> the split normal distribution reduces to normal distribution with variance <math>\sigma_{*}^2</math>.
When σ<sub>2</sub>≠σ<sub>1</sub> the constant '''A''' is different from the constant of normal distribution. However, when <math>\sigma_1^2=\sigma_2^2=\sigma_{*}^2</math> the constants are equal.
The sign of its third central moment is determined by the difference (σ<sub>2</sub>-σ<sub>1</sub>). If this difference is positive, the distribution is skewed to the right and if negative, then it is skewed to the left.
Other properties of the split normal density were discussed by Johnson et al.<ref name="Johnson1994" /> and Julio.<ref name="Julio2007" />
== Alternative formulations == The formulation discussed above originates from John.<ref name="John1982" /> The literature offers two mathematically equivalent alternative parameterizations . Britton, Fisher and Whitley<ref name="BFW1998" /> offer a parameterization if terms of mode, dispersion and normed skewness, denoted with <math>\mathcal{SN}(\mu,\, \sigma^2,\gamma)</math>. The parameter μ is the mode and has equivalent to the mode in John's formulation. The parameter σ <sup>2</sup>>0 informs about the dispersion (scale) and should not be confused with variance. The third parameter, γ ∈ (-1,1), is the normalized skew.
The second alternative parameterization is used in the Bank of England's communication and is written in terms of mode, dispersion and unnormed skewness and is denoted with <math>\mathcal{SN}(\mu,\, \sigma^2,\xi)</math>. In this formulation the parameter μ is the mode and is identical as in John's <ref name="John1982" /> and Britton, Fisher and Whitley's <ref name="BFW1998" /> formulation. The parameter σ <sup>2</sup> informs about the dispersion (scale) and is the same as in the Britton, Fisher and Whitley's formulation. The parameter ξ equals the difference between the distribution's mean and mode and can be viewed as unnormed measure of skewness.
The three parameterizations are mathematically equivalent, meaning that there is a strict relationship between the parameters and that it is possible to go from one parameterization to another. The following relationships hold:<ref name="BD2011" /> :<math>\begin{align} \sigma^2 &= \sigma_1^2(1+\gamma)= \sigma_2^2(1-\gamma) \\ \gamma &= \frac{\sigma_2^2-\sigma_1^2}{\sigma_2^2+\sigma_1^2} \\ \xi &=\sqrt{2 / \pi}(\sigma_2-\sigma_1) \\ \gamma &= \operatorname{sgn}(\xi) \sqrt{1-\left( \frac{\sqrt{1+2\beta}-1}{\beta} \right)^2}, \quad \text{where} \quad \beta = \frac{\pi\xi^2}{2\sigma^2}. \end{align}</math>
== Multivariate Extensions == The multivariate generalization of the split normal distribution was proposed by Villani and Larsson.<ref name="VL2006"/> They assume that each of the principal components has univariate split normal distribution with a different set of parameters μ, σ<sub>2</sub> and σ<sub>1</sub>.
== Estimation of parameters == John<ref name="John1982" /> proposes to estimate the parameters using maximum likelihood method. He shows that the likelihood function can be expressed in an intensive form, in which the scale parameters σ<sub>1</sub> and σ<sub>2</sub> are a function of the location parameter μ. The likelihood in its intensive form is: :<math> L(\mu) = -\left[\sum_{x_i: x_i<\mu} (x_i-\mu)^2 \right]^{1/3} - \left[\sum_{x_i: x_i>\mu} (x_i-\mu)^2 \right]^{1/3}</math> and has to be maximized numerically with respect to a single parameter μ only.
Given the maximum likelihood estimator <math>\hat{\mu}</math> the other parameters take values: :<math> \hat{\sigma}_1^2 = \frac{-L(\mu)}{N} \left[\sum_{x_i: x_i<\mu} (x_i-\mu)^2 \right]^{2/3},</math> :<math> \hat{\sigma}_2^2 = \frac{-L(\mu)}{N} \left[\sum_{x_i: x_i>\mu} (x_i-\mu)^2 \right]^{2/3},</math> where '''N''' is the number of observations.
Villani and Larsson<ref name="VL2006" /> propose to use either maximum likelihood method or bayesian estimation and provide some analytical results for either univariate and multivariate case.
== Applications == The split normal distribution has been used mainly in econometrics and time series. A remarkable area of application is the construction of the fan chart, a representation of the inflation forecast distribution reported by inflation targeting central banks around the globe.<ref name="Julio2007"/><ref name="BoEInfRep"/>
== References == {{Reflist|refs= <ref name="BoEInfRep">Bank of England, ''[http://www.bankofengland.co.uk/publications/inflationreport/irfanch.htm Inflation Report] {{webarchive|url=https://web.archive.org/web/20100813232522/http://www.bankofengland.co.uk/publications/inflationreport/irfanch.htm |date=2010-08-13 }}''</ref>
<ref name=Gibbons>{{cite journal |last1=Gibbons|first1=J.F. |last2=Mylroie|first2=S. |year=1973 |title=Estimation of impurity profiles in ion-implanted amorphous targets using joined half-Gaussian distributions |journal= Applied Physics Letters |volume= 22 |issue=11 |pages= 568–569|doi=10.1063/1.1654511|bibcode=1973ApPhL..22..568G }} </ref>
<ref name=BD2011>{{Cite conference | last = Banerjee | first = N. |author2=A. Das | title = Fan Chart: Methodology and its Application to Inflation Forecasting in India | series = Reserve Bank of India Working Paper Series | year = 2011 }} </ref>
<ref name=BFW1998>{{Cite journal | volume = February 1998 | pages = 30–37 | last = Britton | first = E. |author2=P. Fisher |author3=Whitley, J. | title = The inflation report projections: understanding the fan chart | journal = Quarterly Bulletin | year = 1998 }} </ref>
<ref name=VL2006>{{Cite journal | issn = 0361-0926 | volume = 35 | issue = 6 | pages = 1123–1140 | last = Villani | first = Mattias |author2=Rolf Larsson | title = The Multivariate Split Normal Distribution and Asymmetric Principal Components Analysis | journal = Communications in Statistics - Theory and Methods | year = 2006 | doi=10.1080/03610920600672252 | citeseerx = 10.1.1.533.4095 | s2cid = 124959166 }} </ref>
<ref name="John1982">{{Cite journal | journal = Communications in Statistics - Theory and Methods | volume = 11 | issue = 8 | pages = 879–885 | last = John |first=S. | title = The three-parameter two-piece normal family of distributions and its fitting | year = 1982 | doi=10.1080/03610928208828279 }}</ref>
<ref name="Johnson1994">{{Cite book | publisher = John Wiley & Sons | last = Johnson, N.L., Kotz, S. and Balakrishnan, N. | title = Continuous Univariate Distributions, Volume 1 |page=173 |isbn=978-0-471-58495-7 | year = 1994 }}</ref>
<ref name="Julio2007">{{Cite conference | publisher = Banco de la República | last = Juan Manuel Julio | title = The Fan Chart: The Technical Details Of The New Implementation | accessdate = 2010-09-11 | year = 2007 | url = http://ideas.repec.org/p/col/000094/004294.html | postscript =, [http://www.banrep.gov.co/docum/ftp/borra468.pdf direct link] }}</ref>
}} {{ProbDistributions|continuous-infinite}}
Category:Continuous distributions Category:Normal distribution