# Biplot

> Mediated Wiki article. Canonical URL: https://mediated.wiki/source/Biplot
> Markdown URL: https://mediated.wiki/source/Biplot.md
> Source: https://en.wikipedia.org/wiki/Biplot
> Source revision: 1354951565
> License: Creative Commons Attribution-ShareAlike 4.0 International (https://creativecommons.org/licenses/by-sa/4.0/)

{{Short description|Type of exploratory graph used in statistics}}
{{confused|Bye Plot}}
[[File:Biplot of Anderson's Iris data set.svg|thumb|Biplot of [Fisher's iris data set](/source/Iris_flower_data_set). The scattered points are the input scores of observations and the arrows show the contribution of each feature to the input loading vectors. ]]
thumb|Spectramap biplot of Anderson's iris data set
[[File:IrisDAbiplot.jpg|thumb|Discriminant analysis biplot of Fisher's [iris data](/source/Iris_flower_data_set)]]

'''Biplots''' are a type of exploratory graph used in [statistics](/source/statistics), a generalization of the simple two-variable [scatterplot](/source/scatter_plot).
A biplot overlays a ''score plot'' with a ''loading plot''.
A biplot allows information on both [samples](/source/sampling_(statistics)) and variables of a [data matrix](/source/data_matrix_(statistics)) to be displayed graphically. Samples are displayed as points while variables are displayed either as vectors, linear [axes](/source/Cartesian_coordinate_system) or nonlinear trajectories. In the case of categorical variables, ''category level points'' may be used to represent the levels of a categorical variable. A ''generalised'' biplot displays information on both continuous and categorical variables.

The biplot was introduced by [K. Ruben Gabriel](/source/K._Ruben_Gabriel) (1971).<ref>'Gabriel, K. R. (1971). The biplot graphic display of matrices with application to principal component analysis. ''Biometrika'', ''58''(3), 453–467.</ref>

==Construction==
A biplot is constructed by using the [singular value decomposition](/source/singular_value_decomposition) (SVD) to obtain a [low-rank approximation](/source/low-rank_approximation) to a transformed version of the data matrix '''X''', whose ''n'' rows are the samples (also called the cases, or objects), and whose ''p'' columns are the variables. The transformed data matrix '''Y''' is obtained from the original matrix '''X''' by centering and optionally standardizing the columns (the variables). Using the SVD, we can write '''Y'''&nbsp;=&nbsp;Σ<sub>''k''=1,...''p''</sub>''d''<sub>''k''</sub>'''u'''<sub>''k''</sub>'''v'''<sub>''k''</sub><sup>T</sup>;, where the '''u'''<sub>''k''</sub> are ''n''-dimensional column vectors, the '''v'''<sub>''k''</sub> are ''p''-dimensional column vectors, and the ''d''<sub>''k''</sub> are a non-increasing sequence of non-negative [scalars](/source/scalar_(mathematics)). The biplot is formed from two scatterplots that share a common set of axes and have a between-set [scalar product](/source/scalar_product) interpretation. The first scatterplot is formed from the points (''d''<sub>1</sub><sup>α</sup>''u''<sub>1''i''</sub>, &nbsp;''d''<sub>2</sub><sup>α</sup>''u''<sub>2''i''</sub>), for ''i''&nbsp;=&nbsp;1,...,''n''.  The second plot is formed from the points (''d''<sub>1</sub><sup>1−α</sup>''v''<sub>1''j''</sub>,&nbsp;''d''<sub>2</sub><sup>1−α</sup>''v''<sub>2''j''</sub>), for ''j''&nbsp;=&nbsp;1,...,''p''.  This is the biplot formed by the dominant two terms of the SVD, which can then be represented in a two-dimensional display.  Typical choices of α are 1 (to give a distance interpretation to the row display) and 0 (to give a distance interpretation to the column display), and in some rare cases α=1/2 to obtain a symmetrically scaled biplot (which gives no distance interpretation to the rows or the columns, but only the scalar product interpretation).  The set of points depicting the variables can be drawn as arrows from the origin to reinforce the idea that they represent biplot axes onto which the samples can be projected to approximate the original data.

==References==
{{reflist}}

==Sources==
* {{cite journal|doi=10.1093/biomet/58.3.453|last=Gabriel|first=K.R.|author-link=K. Ruben Gabriel|year=1971|title=The biplot graphic display of matrices with application to principal component analysis|journal=Biometrika|volume=58 |issue=3|pages=453–467}}
* Gower, J.C., Lubbe, S. and le Roux, N. (2010). ''Understanding Biplots''. [Wiley](/source/John_Wiley_%26_Sons). {{ISBN|978-0-470-01255-0}}
* Gower, J.C. and Hand, D.J (1996). ''Biplots''. [Chapman & Hall](/source/Chapman_%26_Hall), London, UK. {{ISBN|0-412-71630-5}}
* Yan, W. and Kang, M.S. (2003). ''GGE Biplot Analysis''. [CRC Press](/source/CRC_Press), Boca Raton, Florida. {{ISBN|0-8493-1338-4}}
* Demey, J.R., Vicente-Villardón, J.L., Galindo-Villardón, M.P. and Zambrano, A.Y. (2008).  ''Identifying molecular markers associated with classification of genotypes by External Logistic Biplots''. [Bioinformatics](/source/Bioinformatics). 24(24):2832–2838

{{Statistics|descriptive}}

Category:Statistical charts and diagrams
Category:Factor analysis

---
Adapted from the Wikipedia article [Biplot](https://en.wikipedia.org/wiki/Biplot) by Wikipedia contributors ([contributor history](https://en.wikipedia.org/wiki/Biplot?action=history)). Available under [Creative Commons Attribution-ShareAlike 4.0 International](https://creativecommons.org/licenses/by-sa/4.0/). Changes may have been made.
