Scatter matrix

From The Right Wiki
Jump to navigationJump to search
For the notion in quantum mechanics, see scattering matrix.

In multivariate statistics and probability theory, the scatter matrix is a statistic that is used to make estimates of the covariance matrix, for instance of the multivariate normal distribution.

Definition

Given n samples of m-dimensional data, represented as the m-by-n matrix, X=[x1,x2,,xn], the sample mean is

x=1nj=1nxj

where xj is the j-th column of X.[1] The scatter matrix is the m-by-m positive semi-definite matrix

S=j=1n(xjx)(xjx)T=j=1n(xjx)(xjx)=(j=1nxjxjT)nxxT

where ()T denotes matrix transpose,[2] and multiplication is with regards to the outer product. The scatter matrix may be expressed more succinctly as

S=XCnXT

where Cn is the n-by-n centering matrix.

Application

The maximum likelihood estimate, given n samples, for the covariance matrix of a multivariate normal distribution can be expressed as the normalized scatter matrix

CML=1nS.[3]

When the columns of X are independently sampled from a multivariate normal distribution, then S has a Wishart distribution.

See also

References

  1. Raghavan (2018-08-16). "Scatter matrix, Covariance and Correlation Explained". Medium. Retrieved 2022-12-28.
  2. Raghavan (2018-08-16). "Scatter matrix, Covariance and Correlation Explained". Medium. Retrieved 2022-12-28.
  3. Liu, Zhedong (April 2019). Robust Estimation of Scatter Matrix, Random Matrix Theory and an Application to Spectrum Sensing (PDF) (Master of Science). King Abdullah University of Science and Technology.