Factor regression model

From The Right Wiki
Jump to navigationJump to search

Within statistical factor analysis, the factor regression model,[1] or hybrid factor model,[2] is a special multivariate model with the following form:

yn=Axn+Bzn+c+en

where,

yn is the n-th G×1 (known) observation.
xn is the n-th sample Lx (unknown) hidden factors.
A is the (unknown) loading matrix of the hidden factors.
zn is the n-th sample Lz (known) design factors.
B is the (unknown) regression coefficients of the design factors.
c is a vector of (unknown) constant term or intercept.
en is a vector of (unknown) errors, often white Gaussian noise.

Relationship between factor regression model, factor model and regression model

The factor regression model can be viewed as a combination of factor analysis model (yn=Axn+c+en) and regression model (yn=Bzn+c+en). Alternatively, the model can be viewed as a special kind of factor model, the hybrid factor model [2]

yn=Axn+Bzn+c+en=[AB][xnzn]+c+en=Dfn+c+en

where, D=[AB] is the loading matrix of the hybrid factor model and fn=[xnzn] are the factors, including the known factors and unknown factors.

Software

Open source software to perform factor regression is available.

References

  1. Carvalho, Carlos M. (1 December 2008). "High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics". Journal of the American Statistical Association. 103 (484): 1438–1456. doi:10.1198/016214508000000869. PMC 3017385. PMID 21218139.
  2. 2.0 2.1 Meng, J. (2011). "Uncover cooperative gene regulations by microRNAs and transcription factors in glioblastoma using a nonnegative hybrid factor model". International Conference on Acoustics, Speech and Signal Processing. Archived from the original on 2011-11-23.