Description
Definition
Suppose
ACG was originally defined on the real sphere (and projective space) in 1 and generalized to the complex Sphere in 2 and the real Stiefel/Grassmann manifolds in 3. I believe 4 were the first to generalize to the complex Grassmann manifold. The generalization to quaternionic spaces here is I think new but follows naturally from the same derivations.
The ACG family has the density wrt the normalized invariant measure on the Grassmann and Stiefel manifolds of5
Note for
so the distribution is invariant to right unitary transformations and is thus a distribution on the Grassmann manifold.
Note
Representation of points on the Grassmann manifold
Points on the Grassmann manifold can also be expressed as a unique projector matrix
where
I haven't seen this covered in any of the papers, and for
ACG is mainly useful when one requires:
- efficient exact sampling OR
- efficient evaluation of the normalization constant (e.g. hierarchical Bayesian models where
$\Sigma$ is also inferred)
Parameterizations
Scale
The scale parameterization (as used above) uses a positive-definite scale matrix
Note that
Therefore, the useful information in
Furthermore, when
Precision
The precision parameterization
Properties
Closure
For any
Normalization constant
The normalization constant in the two parameterizations is
$c_{k,\mathbb{F}}(\Sigma) = |\Sigma|^{k\mathrm{dim}_\mathbb{F}/2}$ $c_{k,\mathbb{F}}(P) = |P|^{-k\mathrm{dim}_\mathbb{F}/2}$
Mode
Let
Then using Cauchy interlacing theorem (or its generalization to Hermitian quaternion matrices 6), one can show that the density is globally maximized when
If the first
On the Stiefel manifold, ACG is of course not unimodal.
Moments
Empirically, the Riemannian mean on
On
On
Of course, this is just empirical. I haven't seen any works derive the Riemannian mean and was not successful in deriving it myself.
Median
Unknown
Fitting
MLE
It suffices to maximize the likelihood on the Grassmann manifold.
Given an IID sample
which can be solved with an expectation maximization approach:
For
4 proved that the log-likelihood on the real and complex Grassmann manifolds is convex on the geodesics of SPD matrices with unit determinant and found that for almost all samples of size
The above algorithms should probably be benchmarked to compare performance and accuracy, but it's probably better to have an implementation that defaults to using the expectation-maximization algorithm but allows the user to specify a Manopt optimization method, in which case optimization on SPDFixedDeterminant
with the specified algorithm is performed.
References/Notes
Footnotes
-
Tyler, David E. “Statistical Analysis for the Angular Central Gaussian Distribution on the Sphere.” Biometrika 74, no. 3 (1987): 579–89. https://doi.org/10.2307/2336697. ↩ ↩2
-
Kent, John T. "Data analysis for shapes and images." Journal of statistical planning and inference 57.2 (1997): 181-193. https://doi.org/10.1016/S0378-3758(96)00043-2 ↩
-
Chikuse, Yasuko. "The matrix angular central Gaussian distribution." Journal of Multivariate Analysis 33.2 (1990): 265-274. https://doi.org/10.1016/0047-259X(90)90050-R ↩
-
Auderset, Claude, Christian Mazza, and Ernst Ruh. "Grassmannian estimation." arXiv preprint arXiv:0809.3697 (2008). https://doi.org/10.48550/arXiv.0809.3697 ↩ ↩2
-
Because $\Sigma$ is unique only up to a scale factor, some references require it have unit determinant and therefore unit normalization constant. ↩
-
Tam, Tin-Yau. "Interlacing inequalities and Cartan subspaces of classical real simple Lie algebras." SIAM Journal on Matrix Analysis and Applications 21.2 (2000): 581-592. https://doi.org/10.1137/S0895479898343528 ↩
-
Ciobotaru, Corina, and Christian Mazza. "Consistency and asymptotic normality of M-estimates of scatter on Grassmann manifolds." Journal of Multivariate Analysis 190 (2022): 104998. https://doi.org/10.1016/j.jmva.2022.104998 ↩