MANOVA/MANCOVA: Zero to Hero Tutorial

This comprehensive tutorial takes you from the foundational concepts of Multivariate Analysis of Variance (MANOVA) and Multivariate Analysis of Covariance (MANCOVA) all the way through advanced test statistics, assumption checking, post-hoc analysis, effect size interpretation, and practical usage within the DataStatPro application. Whether you are a complete beginner or an experienced analyst, this guide is structured to build your understanding step by step.

Prerequisites and Background Concepts
What are MANOVA and MANCOVA?
The Mathematical Framework
Assumptions of MANOVA and MANCOVA
Multivariate Test Statistics
MANCOVA: Adding Covariates
Effect Size Measures
Follow-Up Analyses
Power Analysis and Sample Size
Model Fit and Evaluation
Assumption Checking and Diagnostics
Contrast Analysis in MANOVA
Repeated Measures MANOVA (Profile Analysis)
Using the MANOVA/MANCOVA Component
Computational and Formula Details
Worked Examples
Common Mistakes and How to Avoid Them
Troubleshooting
Quick Reference Cheat Sheet

1. Prerequisites and Background Concepts

Before diving into MANOVA and MANCOVA, it is helpful to be familiar with the following foundational concepts. Do not worry if you are not — each concept is briefly explained here.

1.1 Univariate ANOVA Recap

Analysis of Variance (ANOVA) tests whether the means of a single continuous dependent variable differ across two or more groups defined by a categorical independent variable (factor). The core idea is to partition the total variability into:

Between-group (treatment) variability: How much the group means differ from the grand mean.
Within-group (error) variability: How much individual observations differ from their own group mean.

The F-ratio compares these two sources:

$F = \frac{MS_{between}}{MS_{within}} = \frac{SS_{between}/(k-1)}{SS_{within}/(n-k)}$

Where $k$ is the number of groups and $n$ is the total number of observations. A large $F$ provides evidence that at least one group mean differs from the others.

1.2 Vectors and Matrices

In multivariate analysis, each observation is described by a vector of measurements on $p$ dependent variables:

$\mathbf{y}_i = (y_{i1}, y_{i2}, \dots, y_{ip})^T$

The mean vector for group $j$ is:

$\bar{\mathbf{y}}_j = \frac{1}{n_j}\sum_{i \in \text{group } j} \mathbf{y}_i$

The grand mean vector across all observations is:

$\bar{\mathbf{y}} = \frac{1}{n}\sum_{i=1}^n \mathbf{y}_i$

1.3 The Covariance Matrix

The covariance matrix $\boldsymbol{\Sigma}$ ( $p \times p$ ) contains:

Variances of each dependent variable on the diagonal: $\sigma_{jj} = \text{Var}(Y_j)$ .
Covariances between pairs of dependent variables off-diagonal: $\sigma_{jk} = \text{Cov}(Y_j, Y_k)$ .

$\boldsymbol{\Sigma} = \begin{pmatrix} \sigma_{11} & \sigma_{12} & \cdots & \sigma_{1p} \\ \sigma_{21} & \sigma_{22} & \cdots & \sigma_{2p} \\ \vdots & \vdots & \ddots & \vdots \\ \sigma_{p1} & \sigma_{p2} & \cdots & \sigma_{pp} \end{pmatrix}$

The covariance matrix is symmetric ( $\sigma_{jk} = \sigma_{kj}$ ) and positive semi-definite ( $\mathbf{v}^T\boldsymbol{\Sigma}\mathbf{v} \geq 0$ for any vector $\mathbf{v}$ ).

1.4 The Multivariate Normal Distribution

The multivariate normal distribution $\mathcal{N}_p(\boldsymbol{\mu}, \boldsymbol{\Sigma})$ generalises the univariate normal to $p$ dimensions. Its probability density function is:

$f(\mathbf{y}) = \frac{1}{(2\pi)^{p/2}|\boldsymbol{\Sigma}|^{1/2}} \exp\left\{-\frac{1}{2}(\mathbf{y} - \boldsymbol{\mu})^T \boldsymbol{\Sigma}^{-1} (\mathbf{y} - \boldsymbol{\mu})\right\}$

Where $|\boldsymbol{\Sigma}|$ is the determinant of $\boldsymbol{\Sigma}$ and $\boldsymbol{\Sigma}^{-1}$ is its inverse. MANOVA assumes observations follow this distribution within each group.

1.5 Matrix Determinants and Eigenvalues

The determinant $|\mathbf{A}|$ of a square matrix $\mathbf{A}$ is a scalar that summarises the matrix in terms of the "volume" it represents. A determinant of zero indicates a singular (non-invertible) matrix.

Eigenvalues $\lambda_1, \lambda_2, \dots, \lambda_p$ of a matrix $\mathbf{A}$ satisfy:

$\mathbf{A}\mathbf{v} = \lambda \mathbf{v}$

For covariance matrices, eigenvalues represent the variance explained in each orthogonal direction (principal component). They are always non-negative. The sum of all eigenvalues equals the trace of the matrix: $\sum \lambda_i = \text{tr}(\mathbf{A}) = \sum_i a_{ii}$ .

1.6 The F-Distribution and Wilks' Lambda

The F-distribution with parameters $(df_1, df_2)$ arises from the ratio of two independent chi-squared variables divided by their degrees of freedom. It is the reference distribution for many univariate hypothesis tests.

Wilks' Lambda $\Lambda^*$ is the primary multivariate test statistic in MANOVA. It can be exactly or approximately converted to an $F$ -statistic (see Section 5). Understanding that multivariate test statistics generalise the $F$ -ratio to multiple dependent variables simultaneously is the key conceptual bridge from ANOVA to MANOVA.

2. What are MANOVA and MANCOVA?

2.1 MANOVA: Multivariate Analysis of Variance

MANOVA (Multivariate Analysis of Variance) is the multivariate extension of ANOVA. Instead of testing whether groups differ on a single dependent variable, MANOVA simultaneously tests whether groups differ on a set of dependent variables considered jointly.

Formally, MANOVA tests:

$H_0$ : The mean vectors of all groups are equal: $\boldsymbol{\mu}_1 = \boldsymbol{\mu}_2 = \dots = \boldsymbol{\mu}_k$ .
$H_1$ : At least two group mean vectors differ on at least one dependent variable (or linear combination thereof).

2.2 MANCOVA: Multivariate Analysis of Covariance

MANCOVA (Multivariate Analysis of Covariance) extends MANOVA by adding one or more continuous covariates to the model. Covariates are variables that:

Are related to the dependent variables but not of primary interest.
Are included to statistically control for their effects, thereby:
- Reducing error variance (increasing statistical power).
- Removing confounding effects of baseline differences between groups.

MANCOVA asks: "After accounting for the covariates, do groups differ on the set of dependent variables?"

2.3 Why Use MANOVA Instead of Multiple ANOVAs?

A common question is: "Why not just run separate ANOVAs for each dependent variable?" There are several compelling reasons to prefer MANOVA:

Reason	Explanation
Controls Type I error	Multiple ANOVAs inflate the familywise error rate. With 5 DVs at $\alpha = 0.05$ , the familywise rate approaches $1 - 0.95^5 = 0.226$ . MANOVA tests all DVs simultaneously at a single $\alpha$ .
Detects combined effects	Groups may not differ on any single DV but differ significantly on a linear combination of DVs. MANOVA can detect these combined patterns that separate ANOVAs miss.
Accounts for correlations	Dependent variables are usually correlated. MANOVA uses the entire covariance structure, not just individual variances, leading to more powerful and appropriate tests.
Single, unified test	A single omnibus test provides a clear, coherent answer to "Do the groups differ on the outcome construct?" before decomposing into individual variables.

⚠️ MANOVA is not always better than separate ANOVAs. If the dependent variables are conceptually unrelated and you have clear, directional hypotheses about each, separate ANOVAs with Bonferroni correction may be more appropriate and interpretable.

2.4 Real-World Applications

MANOVA and MANCOVA are used across many disciplines:

Clinical Psychology: Comparing treatment groups on multiple symptom severity scales simultaneously (depression, anxiety, sleep quality).
Education Research: Comparing teaching methods on multiple outcome measures (reading, mathematics, science scores).
Pharmacology: Comparing drug dosages on multiple physiological endpoints (blood pressure, heart rate, cortisol level).
Marketing: Comparing advertising strategies on consumer perceptions across multiple brand attributes.
Sports Science: Comparing training protocols on multiple performance metrics (speed, endurance, strength).
Neuroscience: Comparing patient groups on multiple cognitive measures (memory, attention, executive function).
Environmental Science: Comparing sites on multiple ecological variables (species diversity, biomass, soil pH).

2.5 Design Terminology

Term	Description	Example
Dependent Variable (DV)	Continuous outcome variable(s) being measured	Exam scores across subjects
Independent Variable (IV) / Factor	Categorical grouping variable	Teaching method (A, B, C)
Covariate	Continuous variable controlled for in MANCOVA	Pre-test score, age
Between-subjects factor	Different participants in each group	Drug vs. Placebo groups
Within-subjects factor	Same participants in all conditions	Time points in repeated measures
One-way MANOVA	One IV, multiple DVs	Group × multiple outcomes
Factorial MANOVA	Two or more IVs, multiple DVs	Group × Time × multiple outcomes

3. The Mathematical Framework

3.1 The MANOVA Model

For a one-way MANOVA with $k$ groups, $p$ dependent variables, and $n_j$ observations in group $j$ ( $n = \sum n_j$ ), each observation is modelled as:

$\mathbf{y}_{ij} = \boldsymbol{\mu} + \boldsymbol{\alpha}_j + \boldsymbol{\epsilon}_{ij}$

Where:

$\mathbf{y}_{ij}$ ( $p \times 1$ ) = observation vector for the $i$ -th unit in group $j$ .
$\boldsymbol{\mu}$ ( $p \times 1$ ) = grand mean vector.
$\boldsymbol{\alpha}_j$ ( $p \times 1$ ) = effect of group $j$ , with constraint $\sum_{j=1}^k n_j \boldsymbol{\alpha}_j = \mathbf{0}$ .
$\boldsymbol{\epsilon}_{ij}$ ( $p \times 1$ ) = random error vector, assumed $\boldsymbol{\epsilon}_{ij} \sim \mathcal{N}_p(\mathbf{0}, \boldsymbol{\Sigma})$ .

The key assumption is that all groups share the same within-group covariance matrix $\boldsymbol{\Sigma}$ (homogeneity of covariance matrices).

3.2 The Matrix of Sum of Squares and Cross Products (SSCP)

MANOVA partitions the total variation in the multivariate data into between-group and within-group components using Sum of Squares and Cross Products (SSCP) matrices:

Total SSCP matrix $\mathbf{T}$ ( $p \times p$ ):

$\mathbf{T} = \sum_{j=1}^k \sum_{i=1}^{n_j} (\mathbf{y}_{ij} - \bar{\mathbf{y}})(\mathbf{y}_{ij} - \bar{\mathbf{y}})^T$

Between-group (Hypothesis) SSCP matrix $\mathbf{H}$ ( $p \times p$ ):

$\mathbf{H} = \sum_{j=1}^k n_j (\bar{\mathbf{y}}_j - \bar{\mathbf{y}})(\bar{\mathbf{y}}_j - \bar{\mathbf{y}})^T$

Within-group (Error) SSCP matrix $\mathbf{E}$ ( $p \times p$ ):

$\mathbf{E} = \sum_{j=1}^k \sum_{i=1}^{n_j} (\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)(\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)^T$

Fundamental decomposition: $\mathbf{T} = \mathbf{H} + \mathbf{E}$

This is the multivariate generalisation of $SS_{total} = SS_{between} + SS_{within}$ .

Degrees of freedom:

$df_H = k - 1$ (between groups)
$df_E = n - k$ (within groups)
$df_T = n - 1$ (total)

3.3 The SSCP Matrices in Detail

The diagonal elements of $\mathbf{H}$ and $\mathbf{E}$ are the familiar $SS_{between}$ and $SS_{within}$ values from separate ANOVAs for each dependent variable. The off-diagonal elements capture the covariance structure — how variation in one DV covaries with variation in another, both between and within groups. This is what MANOVA uses beyond what separate ANOVAs provide.

For dependent variables $j$ and $k$ :

$H_{jk} = \sum_{g=1}^G n_g (\bar{y}_{gj} - \bar{y}_j)(\bar{y}_{gk} - \bar{y}_k)$

$E_{jk} = \sum_{g=1}^G \sum_{i=1}^{n_g} (y_{igj} - \bar{y}_{gj})(y_{igk} - \bar{y}_{gk})$

3.4 The Hypothesis Test

MANOVA tests whether the between-group variation is large relative to the within-group variation, simultaneously for all $p$ dependent variables. This is assessed through the eigenvalues of the matrix $\mathbf{E}^{-1}\mathbf{H}$ :

$\mathbf{E}^{-1}\mathbf{H} \mathbf{v}_s = \lambda_s \mathbf{v}_s, \quad s = 1, 2, \dots, \min(k-1, p)$

Where:

$\lambda_s$ = the $s$ -th eigenvalue (ratio of between-group to within-group variation in the $s$ -th discriminant direction).
$\mathbf{v}_s$ = the $s$ -th eigenvector (the direction in multivariate space most discriminating between groups).
$s^* = \min(k-1, p)$ = the number of non-zero eigenvalues.

The eigenvalues $\lambda_1 \geq \lambda_2 \geq \dots \geq \lambda_{s^*} \geq 0$ are the basis for all four multivariate test statistics (Section 5).

3.5 The Estimated Within-Group Covariance Matrix

The pooled within-group covariance matrix $\mathbf{S}_W$ (the multivariate analogue of $MS_{within}$ ) is estimated as:

$\mathbf{S}_W = \frac{\mathbf{E}}{n - k} = \frac{1}{n-k}\sum_{j=1}^k \sum_{i=1}^{n_j} (\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)(\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)^T$

This is an unbiased estimator of the common within-group covariance matrix $\boldsymbol{\Sigma}$ under the homogeneity assumption.

3.6 The Factorial MANOVA Model

For a two-way factorial MANOVA with factors A ( $a$ levels) and B ( $b$ levels):

$\mathbf{y}_{ijl} = \boldsymbol{\mu} + \boldsymbol{\alpha}_i + \boldsymbol{\beta}_j + (\boldsymbol{\alpha\beta})_{ij} + \boldsymbol{\epsilon}_{ijl}$

Where:

$\boldsymbol{\alpha}_i$ = main effect of factor A at level $i$ .
$\boldsymbol{\beta}_j$ = main effect of factor B at level $j$ .
$(\boldsymbol{\alpha\beta})_{ij}$ = interaction effect of A × B.
$\boldsymbol{\epsilon}_{ijl} \sim \mathcal{N}_p(\mathbf{0}, \boldsymbol{\Sigma})$ .

Three separate SSCP matrices and hypothesis tests are conducted: for the main effect of A, the main effect of B, and the A × B interaction.

4. Assumptions of MANOVA and MANCOVA

MANOVA and MANCOVA rest on several critical assumptions. Violations of these assumptions can lead to inflated Type I error rates, reduced power, or misleading results.

4.1 Multivariate Normality

Assumption: Within each group, the $p$ dependent variables jointly follow a multivariate normal distribution $\mathcal{N}_p(\boldsymbol{\mu}_j, \boldsymbol{\Sigma})$ .

Why it matters: The multivariate test statistics (Wilks' Lambda, Pillai's trace, etc.) are derived assuming multivariate normality. Severe departures — particularly heavy tails or outliers — inflate Type I error rates.

How to check:

Mardia's tests: Test multivariate skewness ( $b_{1,p}$ ) and kurtosis ( $b_{2,p}$ ) — the most widely used multivariate normality tests.
Royston's test: Extends the Shapiro-Wilk test to the multivariate setting.
Henze-Zirkler test: Based on a consistent test of multivariate normality.
Q-Q plots of Mahalanobis distances: If multivariate normality holds, the squared Mahalanobis distances $D^2_i = (\mathbf{y}_i - \bar{\mathbf{y}})^T \mathbf{S}^{-1} (\mathbf{y}_i - \bar{\mathbf{y}})$ should follow a $\chi^2_p$ distribution. Plot $D^2_{(i)}$ vs. quantiles of $\chi^2_p$ — departures from the diagonal indicate non-normality or outliers.

Robustness: MANOVA is fairly robust to moderate departures from multivariate normality when group sizes are large and equal, particularly for Pillai's Trace (the most robust statistic). With small or unequal group sizes, non-normality is more problematic.

4.2 Homogeneity of Covariance Matrices (Homoscedasticity)

Assumption: All $k$ groups share the same within-group covariance matrix:

$\boldsymbol{\Sigma}_1 = \boldsymbol{\Sigma}_2 = \dots = \boldsymbol{\Sigma}_k = \boldsymbol{\Sigma}$

Why it matters: This is the multivariate analogue of homoscedasticity in ANOVA. Violations lead to inflated or deflated Type I error rates depending on the pattern of inequality and the relative group sizes.

How to check:

Box's M test: The standard test for equality of covariance matrices. Tests $H_0: \boldsymbol{\Sigma}_1 = \boldsymbol{\Sigma}_2 = \dots = \boldsymbol{\Sigma}_k$ .

$M = (n-k)\ln|\mathbf{S}_W| - \sum_{j=1}^k (n_j - 1)\ln|\mathbf{S}_j|$

Box's M is converted to an approximate $\chi^2$ or $F$ statistic. A significant result ( $p < 0.05$ ) indicates heterogeneous covariance matrices.

⚠️ Box's M is extremely sensitive to non-normality and detects trivial violations in large samples. A common guideline is to be concerned only when $p < 0.001$ rather than $p < 0.05$ . Always pair Box's M with visual inspection of the covariance matrices.

What to do when violated:

Use Pillai's Trace instead of Wilks' Lambda (Pillai's Trace is more robust to heterogeneous covariance matrices).
If group sizes are equal, MANOVA results are robust to moderate violations.
Use heteroscedasticity-robust MANOVA variants (e.g., James's test, Johansen's procedure).

4.3 Independence of Observations

Assumption: Each observation must be independent of all others. No observation should influence or be correlated with any other observation.

Why it matters: Correlation among observations (e.g., siblings in the same family, students in the same classroom, repeated measures from the same participant) violates the independence assumption and inflates Type I error.

How to check: Consider the study design. If observations are nested, clustered, or repeated, use appropriate models (multilevel MANOVA, repeated measures MANOVA).

4.4 Absence of Multivariate Outliers

Assumption: There are no extreme multivariate outliers — observations that are unusual on the combination of DVs even if they are not extreme on any single DV.

Why it matters: Multivariate outliers can distort the SSCP matrices and dramatically influence the test statistics.

How to check:

Mahalanobis distance: $D^2_i = (\mathbf{y}_i - \bar{\mathbf{y}})^T \mathbf{S}^{-1} (\mathbf{y}_i - \bar{\mathbf{y}})$ . Compare to $\chi^2_{p, \alpha}$ (e.g., $\chi^2_{p, 0.001}$ ). Observations exceeding this threshold are flagged as potential outliers.
Robust Mahalanobis distance: Uses robust (MCD or MVE) location and scatter estimates, which are not themselves influenced by outliers.

4.5 Linearity Among Dependent Variables

Assumption: The relationships among the dependent variables are linear within each group.

Why it matters: MANOVA uses covariances (which measure linear association) to capture the multivariate structure. Non-linear relationships among DVs reduce the efficiency of MANOVA.

How to check: Scatter plot matrix (pairs plot) of the DVs within each group; look for non-linear patterns.

4.6 No Perfect Multicollinearity Among Dependent Variables

Assumption: No dependent variable is a perfect linear combination of other dependent variables.

Why it matters: Perfect multicollinearity makes the within-group covariance matrix $\mathbf{E}$ singular (non-invertible), preventing the computation of $\mathbf{E}^{-1}\mathbf{H}$ .

How to check: Compute the condition number or determinant of $\mathbf{E}$ . If $|\mathbf{E}| \approx 0$ or the condition number is very large, multicollinearity is a problem.

4.7 Additional Assumptions for MANCOVA

Homogeneity of regression (slopes): In MANCOVA, the regression of the DVs on the covariates must be the same across all groups — i.e., there is no interaction between the factor(s) and the covariate(s).

How to check: Fit a model including the factor × covariate interaction and test its significance. If significant, the homogeneity of regression assumption is violated and MANCOVA is inappropriate.

Covariate measured without error and not affected by treatment: Covariates should be measured reliably (low measurement error) and should not be caused by (a consequence of) the group membership — otherwise, the covariate adjustment is biased.

5. Multivariate Test Statistics

MANOVA provides four multivariate test statistics, all based on the eigenvalues $\lambda_1 \geq \lambda_2 \geq \dots \geq \lambda_{s^*}$ of $\mathbf{E}^{-1}\mathbf{H}$ , where $s^* = \min(k-1, p)$ . Each statistic summarises the eigenvalues differently and has different properties.

5.1 Wilks' Lambda ( $\Lambda^*$ )

Wilks' Lambda is the most widely used multivariate test statistic. It is the ratio of the determinant of the within-group SSCP matrix to the determinant of the total SSCP matrix:

$\Lambda^* = \frac{|\mathbf{E}|}{|\mathbf{H} + \mathbf{E}|} = \frac{|\mathbf{E}|}{|\mathbf{T}|} = \prod_{s=1}^{s^*} \frac{1}{1 + \lambda_s}$

Range: $0 \leq \Lambda^* \leq 1$ .

Interpretation:

$\Lambda^* = 1$ : No group differences (all eigenvalues = 0; $\mathbf{H} = \mathbf{0}$ ).
$\Lambda^* = 0$ : Perfect group separation (at least one eigenvalue is infinite).
Smaller $\Lambda^*$ → stronger group differences → more likely to reject $H_0$ .

Conversion to F-statistic (Rao's approximation):

$F = \frac{1 - \Lambda^{*1/t}}{\Lambda^{*1/t}} \cdot \frac{df_2}{df_1}$

Where:

$t = \sqrt{\frac{p^2 df_H^2 - 4}{p^2 + df_H^2 - 5}}, \quad df_1 = p \cdot df_H, \quad df_2 = t\left(df_E - \frac{p - df_H + 1}{2}\right) - \frac{p \cdot df_H - 2}{2}$

Exact F when $p = 1$ or $p = 2$ or $df_H = 1$ or $df_H = 2$ .

For $df_H = 1$ (two groups or testing one contrast):

$F = \frac{1 - \Lambda^*}{\Lambda^*} \cdot \frac{df_E - p + 1}{p} \sim F_{p,\, df_E - p + 1}$

Properties:

Powerful when group differences are spread across multiple discriminant dimensions.
More sensitive to violations of multivariate normality than Pillai's Trace.
The most commonly reported MANOVA statistic in published research.

5.2 Pillai's Trace ( $V$ )

$V = \text{tr}\left[\mathbf{H}(\mathbf{H}+\mathbf{E})^{-1}\right] = \sum_{s=1}^{s^*} \frac{\lambda_s}{1 + \lambda_s}$

Range: $0 \leq V \leq s^* = \min(k-1, p)$ .

Interpretation:

$V = 0$ : No group differences.
$V = s^*$ : Perfect group separation.
Larger $V$ → stronger group differences.

Conversion to F-statistic:

$F = \frac{V/s^*}{(s^* - V)/s^*} \cdot \frac{df_2}{df_1} = \frac{V(2n_Y + s^* + 1)}{(s^* - V)(2n_X + s^* + 1)} \sim F_{s^* m, s^* n_Y}$

Where $m = df_H$ , $n_X = (df_E - p - 1)/2$ , $n_Y = (df_H - p - 1)/2$ (details of notation vary by software).

Properties:

The most robust statistic to violations of homogeneity of covariance matrices and multivariate normality.
Recommended when Box's M is significant or group sizes are unequal.
Less powerful than Wilks' Lambda when group differences are concentrated on a single discriminant dimension.
Preferred statistic when assumptions are questionable.

5.3 Hotelling-Lawley Trace ( $T^2$ / $U$ )

$U = \text{tr}(\mathbf{E}^{-1}\mathbf{H}) = \sum_{s=1}^{s^*} \lambda_s$

Range: $0 \leq U < \infty$ .

Conversion to F-statistic:

$F = \frac{U}{s^*} \cdot \frac{df_E(df_E - p - 1)}{df_H(df_E + df_H - p - 1)} \cdot \frac{df_E - p - 1}{df_E}$

More precisely:

$F = \frac{df_E(df_E - p - 1) U}{s^* \cdot p \cdot df_H} \cdot \frac{1}{?}$

The exact approximation formula varies slightly by software; the statistic follows approximately an $F_{df_1, df_2}$ distribution.

Properties:

The most powerful statistic when group differences are concentrated on a single dominant discriminant dimension.
Most sensitive to outliers (least robust).
Equivalent to Hotelling's $T^2$ test when $k = 2$ .

5.4 Roy's Largest Root ( $\theta$ )

$\theta = \frac{\lambda_1}{1 + \lambda_1}$

Where $\lambda_1$ is the largest eigenvalue of $\mathbf{E}^{-1}\mathbf{H}$ .

Range: $0 \leq \theta \leq 1$ .

Conversion to F (approximate):

$F = \theta \cdot \frac{df_E}{df_H} \sim F_{p, df_E - p + 1} \text{ (upper bound)}$

Properties:

The most powerful statistic when all group separation is on a single discriminant dimension — a condition that is rarely met in practice.
Provides an upper bound to the distribution and is typically approximated.
The least robust statistic; severely affected by heterogeneous covariance matrices and outliers.
Most useful when there is a strong theoretical reason to expect a single dominant group difference.

5.5 Comparison of the Four Test Statistics

Statistic	Formula	Range	Most Powerful When	Most Robust	Recommended When
Wilks' $\Lambda^*$	$\prod 1/(1+\lambda_s)$	$[0,1]$	Effects spread across dimensions	Moderate	Assumptions met, standard choice
Pillai's $V$	$\sum \lambda_s/(1+\lambda_s)$	$[0, s^*]$	Effects spread across dimensions	Most robust	Assumptions questionable, unequal $n$
Hotelling-Lawley $U$	$\sum \lambda_s$	$[0, \infty)$	Single dominant dimension	Least robust	Assumptions met, single dimension expected
Roy's $\theta$	$\lambda_1/(1+\lambda_1)$	$[0,1]$	Single dimension	Least robust	Single dominant dimension, theory-driven

💡 In most applied research, report all four statistics. When they agree, the conclusion is robust. When they disagree, report Pillai's Trace as the most reliable result and investigate why they differ (often due to the structure of group differences).

5.6 Hotelling's $T^2$ (Two-Group Case)

When $k = 2$ (only two groups), MANOVA reduces to Hotelling's $T^2$ test — the multivariate generalisation of the independent-samples $t$ -test:

$T^2 = \frac{n_1 n_2}{n_1 + n_2} (\bar{\mathbf{y}}_1 - \bar{\mathbf{y}}_2)^T \mathbf{S}_W^{-1} (\bar{\mathbf{y}}_1 - \bar{\mathbf{y}}_2)$

Converted to an $F$ -statistic:

$F = \frac{n_1 + n_2 - p - 1}{(n_1 + n_2 - 2)p} T^2 \sim F_{p,\, n_1 + n_2 - p - 1}$

This is an exact $F$ -test (not an approximation) and is equivalent to all four multivariate statistics when $k = 2$ .

6. MANCOVA: Adding Covariates

6.1 The MANCOVA Model

MANCOVA extends MANOVA by including one or more continuous covariates $\mathbf{z}_i$ ( $q \times 1$ ):

$\mathbf{y}_{ij} = \boldsymbol{\mu} + \boldsymbol{\alpha}_j + \mathbf{B}\mathbf{z}_{ij} + \boldsymbol{\epsilon}_{ij}$

Where:

$\mathbf{B}$ ( $p \times q$ ) = matrix of regression coefficients for the covariates (one row per DV, one column per covariate).
$\mathbf{z}_{ij}$ = vector of covariate values for observation $i$ in group $j$ .
$\boldsymbol{\epsilon}_{ij} \sim \mathcal{N}_p(\mathbf{0}, \boldsymbol{\Sigma})$ .

6.2 How MANCOVA Works: Partitioning SSCP with Covariates

MANCOVA computes adjusted SSCP matrices that remove the linear effect of the covariates from both $\mathbf{H}$ and $\mathbf{E}$ :

Step 1: Compute the total SSCP for covariates: $\mathbf{E}_{ZZ}$ (SSCP of $\mathbf{Z}$ within groups) and $\mathbf{E}_{YZ}$ (cross-products between DVs and covariates within groups).

Step 2: Adjust the within-group SSCP for covariates:

$\mathbf{E}^* = \mathbf{E}_{YY} - \mathbf{E}_{YZ} \mathbf{E}_{ZZ}^{-1} \mathbf{E}_{YZ}^T$

Step 3: Similarly adjust the hypothesis SSCP to obtain $\mathbf{H}^*$ .

Step 4: Compute test statistics using $\mathbf{H}^*$ and $\mathbf{E}^*$ as in MANOVA.

The adjusted matrices remove the variance explained by the covariates, leaving a more precise comparison of group means on the residualised DVs.

6.3 Purposes of Covariates

Purpose 1: Increasing Power (Noise Reduction) When the covariates are strongly related to the DVs, removing their variance from $\mathbf{E}^*$ reduces error variance, increasing statistical power for the group comparison.

Purpose 2: Controlling for Confounding (Bias Reduction) When groups differ on the covariate (e.g., groups have different pre-test scores), the covariate adjustment removes this pre-existing difference, providing a fairer comparison of group effects.

⚠️ Covariates should be selected based on theoretical grounds before data collection. Adding covariates post-hoc to achieve significance is p-hacking. Including covariates unrelated to the DVs can actually reduce power.

6.4 Adjusted Means (Estimated Marginal Means)

MANCOVA produces adjusted means (also called estimated marginal means or least-squares means) — the group means on the DVs after statistically controlling for the covariates, evaluated at the grand mean of the covariate(s):

$\bar{\mathbf{y}}_j^{adj} = \bar{\mathbf{y}}_j - \hat{\mathbf{B}}(\bar{\mathbf{z}}_j - \bar{\mathbf{z}})$

Where $\hat{\mathbf{B}} = \mathbf{E}_{YZ}\mathbf{E}_{ZZ}^{-1}$ are the within-group regression coefficients.

These adjusted means represent what the group means would have been if all groups had the same mean covariate value.

6.5 Testing the Covariate Effect

In addition to testing for group differences, MANCOVA also tests whether the covariates themselves have a significant multivariate relationship with the DVs. This is tested using the same four multivariate statistics applied to the SSCP for the covariate(s).

A significant covariate test ( $p < 0.05$ ) confirms that the covariate meaningfully reduces error variance and that including it was justified.

6.6 Homogeneity of Regression Check

Before conducting MANCOVA, test the assumption that the covariate-DV regression slopes are the same across groups:

Test: Add the group × covariate interaction to the model and test its significance using the multivariate statistics.

$H_0: \mathbf{B}_1 = \mathbf{B}_2 = \dots = \mathbf{B}_k$

If the interaction is significant ( $p < 0.05$ ), the homogeneity of regression assumption is violated and standard MANCOVA is inappropriate. Options include:

Testing group differences separately at different covariate values (Johnson-Neyman regions).
Using separate regression models per group.
Modifying the research question to examine the interaction itself.

7. Effect Size Measures

Statistical significance tells us whether group differences are likely to be real; effect size tells us how large those differences are in practical terms. Multiple effect size measures are available for MANOVA/MANCOVA.

7.1 Multivariate Effect Sizes

7.1.1 Partial Eta-Squared ( $\eta^2_p$ )

The most widely reported multivariate effect size, computed separately for each test statistic:

From Wilks' Lambda:

$\eta^2_p = 1 - \Lambda^{*1/t}$

Where $t$ is defined as in Section 5.1. This represents the proportion of multivariate variance associated with the group effect.

From Pillai's Trace:

$\eta^2_p = \frac{V}{s^*}$

General formula:

$\eta^2_p = \frac{SS_{effect}}{SS_{effect} + SS_{error}} = \frac{df_H \cdot F}{df_H \cdot F + df_E}$

Interpretation:

$\eta^2_p$	Effect Size
$0.01$	Small
$0.06$	Medium
$0.14$	Large

⚠️ Partial eta-squared is a biased (upward) estimator of the population effect size, especially with small samples. Prefer omega-squared ( $\omega^2$ ) or epsilon-squared ( $\varepsilon^2$ ) for unbiased estimates.

7.1.2 Omega-Squared ( $\omega^2$ )

Omega-squared provides an unbiased (or less biased) estimate of the population effect size:

$\omega^2 = \frac{df_H(F - 1)}{df_H(F - 1) + n}$

Or equivalently using SSCP components:

$\omega^2 = \frac{SS_H - df_H \cdot MS_E}{SS_T + MS_E}$

Omega-squared can be negative (round to zero when this occurs) — this indicates the population effect is likely zero or very small.

7.1.3 Epsilon-Squared ( $\varepsilon^2$ )

Another less-biased alternative:

$\varepsilon^2 = \frac{SS_H - df_H \cdot MS_E}{SS_T}$

For multivariate settings, $\varepsilon^2$ is computed on the univariate follow-up ANOVAs.

7.1.4 Roy's $\theta$ as Effect Size

Roy's largest root $\theta$ directly represents the proportion of total multivariate variation explained by the strongest discriminant dimension:

$\theta = \frac{\lambda_1}{1 + \lambda_1}$

Interpreted on a 0–1 scale, $\theta$ is an effect size for the first (dominant) discriminant function.

7.2 Univariate Effect Sizes for Follow-Up Tests

After a significant MANOVA, follow-up univariate ANOVAs are conducted on each DV. Report univariate effect sizes for each:

Univariate Eta-Squared:

$\eta^2 = \frac{SS_{between}}{SS_{total}}$

Univariate Partial Eta-Squared:

$\eta^2_p = \frac{SS_{between}}{SS_{between} + SS_{within}}$

Cohen's d (for two-group comparisons within a DV):

$d = \frac{\bar{y}_1 - \bar{y}_2}{s_{pooled}}$

Benchmarks for Cohen's d:

| $|d|$ | Effect Size | | :---- | :---------- | | $0.20$ | Small | | $0.50$ | Medium | | $0.80$ | Large |

7.3 Discriminant Function Effect Sizes

After MANOVA, discriminant function analysis (DFA) can be used to characterise the nature of group differences. Each discriminant function has an associated canonical correlation $r_c$ :

$r_{c,s} = \sqrt{\frac{\lambda_s}{1 + \lambda_s}}$

The canonical $R^2$ ( $r_{c,s}^2$ ) represents the proportion of variance in the discriminant scores explained by group membership for the $s$ -th function:

$r_c^2$	Discriminant Effect
$0.01$	Small
$0.09$	Medium
$0.25$	Large

7.4 Multivariate Effect Sizes for MANCOVA

For MANCOVA, effect sizes are computed using the adjusted SSCP matrices $\mathbf{H}^*$ and $\mathbf{E}^*$ in place of $\mathbf{H}$ and $\mathbf{E}$ . The adjusted $\eta^2_p$ reflects the group effect after controlling for covariates:

$\eta^2_{p,adj} = 1 - \Lambda^{*}_{adj}{}^{1/t}$

8. Follow-Up Analyses

A significant omnibus MANOVA result indicates that groups differ on the set of DVs, but does not specify which DVs or which groups are responsible. Follow-up analyses are needed to decompose and interpret the significant multivariate effect.

8.1 Strategy for Follow-Up Analysis

A principled approach to follow-up analyses after significant MANOVA:

Level 1 — Discriminant Function Analysis (DFA) Identifies which linear combination(s) of DVs best separate the groups. This is the most theoretically informative follow-up and should always be conducted first.

Level 2 — Univariate Follow-Up ANOVAs Tests each DV separately to identify which individual variables contribute to the group differences. Use a Bonferroni-corrected $\alpha = 0.05/p$ for each test.

Level 3 — Post-Hoc Group Comparisons For significant univariate ANOVAs, conduct pairwise comparisons to identify which specific groups differ.

8.2 Discriminant Function Analysis (DFA)

After MANOVA, DFA finds the linear combinations of DVs (discriminant functions) that maximally separate the groups:

$D_s = v_{s1}Y_1 + v_{s2}Y_2 + \dots + v_{sp}Y_p$

Where $\mathbf{v}_s$ is the $s$ -th eigenvector of $\mathbf{E}^{-1}\mathbf{H}$ (the discriminant weights).

Standardised discriminant coefficients: Multiply raw weights by the within-group SD of each DV:

$v_{sj}^* = v_{sj} \times \sqrt{E_{jj}/(n-k)}$

Standardised coefficients indicate the relative importance of each DV in discriminating between groups (analogous to standardised regression coefficients).

Structure coefficients (discriminant loadings): Correlations between each DV and the discriminant function scores:

$r_{D_s, Y_j} = \frac{\text{Cov}(D_s, Y_j)}{\sqrt{\text{Var}(D_s)\text{Var}(Y_j)}}$

Structure coefficients $\geq 0.30$ in absolute value are generally considered meaningful.

Number of significant discriminant functions: Use a sequential likelihood ratio test:

For the $s$ -th function (after removing the effects of all previous functions):

$\Lambda_s^* = \prod_{l=s}^{s^*} \frac{1}{1 + \lambda_l}, \quad \chi^2_s = -\left[n - 1 - \frac{p + k}{2}\right]\ln(\Lambda_s^*), \quad df_s = (p - s + 1)(k - s)$

8.3 Univariate Follow-Up ANOVAs

For each significant DV from the univariate follow-up ANOVAs, report:

$F$ -statistic and p-value.
Partial $\eta^2_p$ (effect size).
Group means and standard deviations.

Alpha adjustment for multiple comparisons:

Method	Formula	When to Use
Bonferroni	$\alpha_{adj} = 0.05/p$	Conservative; $p$ planned comparisons
Holm	Sequential Bonferroni	Less conservative than Bonferroni
Benjamini-Hochberg	FDR control	Many DVs; controls false discovery rate
Roy-Bargmann Stepdown	Sequential ANCOVA	Theoretically ordered DVs
Protected ANOVAs	Conduct only if MANOVA significant	Moderate; no per-test correction

💡 "Protected ANOVAs" — conducting univariate tests only if the omnibus MANOVA is significant — provides some protection against Type I inflation without requiring per-test corrections, but is controversial. The Bonferroni or Holm correction is safer.

8.4 Post-Hoc Pairwise Comparisons

After a significant univariate ANOVA on a specific DV, pairwise comparisons identify which group pairs differ. Common post-hoc tests:

Test	Controls	Recommended When
Tukey HSD	Familywise error	Equal group sizes; all pairwise comparisons
Bonferroni	Familywise error	Any number of comparisons; conservative
Games-Howell	Familywise error	Unequal variances and/or unequal $n$
Scheffé	Familywise error	All possible contrasts; very conservative
LSD (Fisher)	None	Only if ANOVA significant; least conservative
Dunnett	Familywise error	Multiple groups vs. single control group

8.5 Roy-Bargmann Stepdown Analysis

The Roy-Bargmann stepdown analysis is a theoretically motivated follow-up procedure for ordered DVs (when there is a theoretical ordering of importance):

Test the most important DV (DV1) as a univariate ANOVA.
Test the second DV (DV2) as an ANCOVA, with DV1 as a covariate.
Test the third DV (DV3) as an ANCOVA, with DV1 and DV2 as covariates.
Continue sequentially.

This tests whether each DV adds unique group discrimination beyond the information in all higher-priority DVs. Apply Bonferroni correction across the $p$ stepdown tests.

8.6 Planned Contrasts in MANOVA

Planned (a priori) contrasts test specific theoretical hypotheses about group comparisons formulated before data collection. They are more powerful than post-hoc tests because they do not require the omnibus MANOVA to be significant.

A contrast $\boldsymbol{\psi} = \sum_j c_j \bar{\mathbf{y}}_j$ (with $\sum c_j = 0$ ) is tested using:

$\mathbf{H}_\psi = \frac{n_h}{\sum_j c_j^2/n_j} \boldsymbol{\psi} \boldsymbol{\psi}^T$

Where $n_h$ is the harmonic mean of group sizes. The test statistic is Hotelling's $T^2$ for the contrast.

9. Power Analysis and Sample Size

9.1 Factors Affecting Statistical Power in MANOVA

Factor	Effect on Power
Sample size $n$	Larger $n$ → higher power
Effect size	Larger group differences → higher power
Number of DVs $p$	More DVs → lower power (more df used); but power increases if added DVs are informative
Correlation among DVs	Higher within-group correlations → higher power if correlated with group differences
Alpha level $\alpha$	Higher $\alpha$ → higher power (at cost of Type I error)
Number of groups $k$	More groups → lower power per comparison
Equality of group sizes	Equal sizes → higher power
Covariate strength (MANCOVA)	Stronger covariates → higher power

9.2 Rules of Thumb for Sample Size

MANOVA requires adequate sample sizes to:

Estimate the within-group covariance matrix reliably.
Maintain the validity of asymptotic test statistics.

Minimum requirements:

Total sample size $n > p + k$ (to ensure $\mathbf{E}$ is invertible).
$n_j > p$ in each group (preferably $n_j \geq 2p$ ).
$n_j \geq 20$ in each group for reasonable power with moderate effects.

Rule of thumb (Stevens, 2002): $n_j \geq 20$ per group for adequate power with medium effects and 5–8 DVs.

More precise guidance (Tabachnick & Fidell):

Small effects: $n \approx 200+$ per group.
Medium effects: $n \approx 50–100$ per group.
Large effects: $n \approx 20–30$ per group.

9.3 Power Analysis Using Non-Central F

For a one-way MANOVA with $k$ groups and effect size $f^2$ (Cohen's multivariate $f$ ):

The non-centrality parameter for Wilks' Lambda approximately follows a non-central $F$ distribution. Exact power computations require specialised software (G*Power, SAS PROC POWER) that implements the non-central Wilks' distribution.

Cohen's multivariate $f$ from $\eta^2_p$ :

$f^2 = \frac{\eta^2_p}{1 - \eta^2_p}$

Benchmarks:

$f^2$	Effect Size
$0.0099$	Small
$0.0588$	Medium
$0.1373$	Large

9.4 Impact of Number of DVs on Power

The relationship between number of DVs and power is nuanced:

Adding informative DVs (those that differ between groups) increases power.
Adding uninformative DVs (those that do not differ between groups and are correlated with other DVs) decreases power by consuming degrees of freedom.
Optimal strategy: Include only DVs that are theoretically relevant and expected to differ between groups.

10. Model Fit and Evaluation

10.1 Overall Multivariate Significance

The primary model fit assessment in MANOVA is whether the four multivariate statistics reach significance:

Test	$H_0$	Significant Result
Wilks' $\Lambda^*$	All group mean vectors are equal	$p < \alpha$ : At least one mean vector differs
Pillai's $V$	Same as above	$p < \alpha$ : Same interpretation
Hotelling-Lawley $U$	Same as above	$p < \alpha$ : Same interpretation
Roy's $\theta$	Same as above	$p < \alpha$ (upper bound): Same interpretation

10.2 Univariate ANOVA Results

For each dependent variable, report:

Statistic	Formula	Interpretation
$SS_{between}$	$\sum_j n_j(\bar{y}_j - \bar{y})^2$	Between-group sum of squares
$SS_{within}$	$\sum_j \sum_i (y_{ij} - \bar{y}_j)^2$	Within-group sum of squares
$MS_{between}$	$SS_{between}/(k-1)$	Between-group mean square
$MS_{within}$	$SS_{within}/(n-k)$	Within-group mean square (error)
$F$	$MS_{between}/MS_{within}$	Test statistic
$\eta^2_p$	$SS_{between}/(SS_{between}+SS_{within})$	Effect size

10.3 Discriminant Analysis Summary

Report the canonical correlations, eigenvalues, and variance explained for each significant discriminant function:

Function	Eigenvalue $\lambda_s$	Canonical $R$	$R^2$	% Variance	Cumulative %
1	$\lambda_1$	$r_{c,1}$	$r_{c,1}^2$	$100\lambda_1/\sum\lambda$	—
2	$\lambda_2$	$r_{c,2}$	$r_{c,2}^2$	$100\lambda_2/\sum\lambda$	—
$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$

10.4 MANCOVA-Specific Fit Information

For MANCOVA, additionally report:

Multivariate test of the covariate effect (using all four statistics).
Regression coefficients $\hat{\mathbf{B}}$ for each DV on each covariate.
Adjusted group means (estimated marginal means) for each DV.
Test of homogeneity of regression (covariate × group interaction).

11. Assumption Checking and Diagnostics

11.1 Multivariate Normality Tests

11.1.1 Mardia's Tests

Mardia's multivariate skewness:

$b_{1,p} = \frac{1}{n^2}\sum_{i=1}^n \sum_{j=1}^n \left[(\mathbf{y}_i - \bar{\mathbf{y}})^T \mathbf{S}^{-1}(\mathbf{y}_j - \bar{\mathbf{y}})\right]^3$

Under multivariate normality:

$\frac{n \cdot b_{1,p}}{6} \sim \chi^2_{p(p+1)(p+2)/6}$

Mardia's multivariate kurtosis:

$b_{2,p} = \frac{1}{n}\sum_{i=1}^n \left[(\mathbf{y}_i - \bar{\mathbf{y}})^T \mathbf{S}^{-1}(\mathbf{y}_i - \bar{\mathbf{y}})\right]^2$

Under multivariate normality: $E[b_{2,p}] = p(p+2)$ .

Test statistic:

$z_{b_{2,p}} = \frac{b_{2,p} - p(p+2)}{\sqrt{8p(p+2)/n}} \sim \mathcal{N}(0,1)$

11.1.2 Mahalanobis Distance Q-Q Plot

Compute the squared Mahalanobis distance for each observation within each group:

$D^2_{ij} = (\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)^T \mathbf{S}_j^{-1} (\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)$

Plot the ordered $D^2_{(i)}$ values against the quantiles of $\chi^2_p$ . Under multivariate normality, points should fall approximately on a straight line. Curved patterns or isolated points at the upper right indicate non-normality or outliers.

Critical value for outlier identification: $D^2_{ij} > \chi^2_{p, 0.001}$ flags a potential multivariate outlier.

11.2 Box's M Test for Homogeneity of Covariance Matrices

$M = (n - k)\ln|\mathbf{S}_W| - \sum_{j=1}^k (n_j - 1)\ln|\mathbf{S}_j|$

Approximate chi-squared statistic:

$\chi^2_{approx} = M\left(1 - c_1\right), \quad df = \frac{p(p+1)(k-1)}{2}$

Where:

$c_1 = \frac{2p^2 + 3p - 1}{6(p+1)(k-1)}\left(\sum_{j=1}^k \frac{1}{n_j - 1} - \frac{1}{n - k}\right)$

Interpretation:

$p > 0.05$ : No evidence against homogeneity → proceed with standard MANOVA.
$0.001 < p < 0.05$ : Marginal violation → use Pillai's Trace; check group sizes.
$p < 0.001$ : Significant violation → use Pillai's Trace; consider heteroscedastic MANOVA variants.

11.3 Outlier Detection

Univariate outliers: For each DV, compute standardised scores $z_{ij} = (y_{ij} - \bar{y}_j)/s_j$ . Flag observations with $|z| > 3.29$ (Bonferroni-corrected at $\alpha = 0.05$ ).

Multivariate outliers: Mahalanobis distances exceeding $\chi^2_{p, 0.001}$ flag multivariate outliers. These are observations unusual on the combination of DVs even if not extreme on any single DV.

Leverage and influence in MANOVA:

The hat matrix for the multivariate design matrix:

$\mathbf{H} = \mathbf{X}(\mathbf{X}^T\mathbf{X})^{-1}\mathbf{X}^T$

Leverage $h_{ii}$ (diagonal of $\mathbf{H}$ ) measures how much observation $i$ influences its own fitted values. High leverage ( $h_{ii} > 2(p+1)/n$ ) combined with large residuals indicates influential observations.

11.4 Checking Linearity Among DVs

Create a scatterplot matrix (pairs plot) of all DVs within each group. Non-linear relationships suggest transformations (log, square root) or that MANOVA's use of covariances is inefficient.

Look for:

Approximately elliptical joint distributions (consistent with multivariate normality).
No obvious non-linear (e.g., quadratic) trends.
No severe restriction of range in any DV.

11.5 Checking Multicollinearity Among DVs

Compute the condition number and determinant of $\mathbf{E}$ :

If $|\mathbf{E}| \approx 0$ : Near-singular — multicollinearity problem.
Condition number $> 30$ : Potential multicollinearity.

Compute pairwise correlations among DVs: correlations $> 0.90$ in absolute value suggest redundancy. Consider removing one of the highly correlated DVs or combining them into a composite.

💡 While some correlation among DVs is expected (and exploited by MANOVA), extreme multicollinearity creates numerical instability. DVs with $|r| > 0.90$ provide largely redundant information and one should be removed.

11.6 Checking Homogeneity of Regression (MANCOVA)

Test: Include the group × covariate interaction in the model:

$\mathbf{y}_{ij} = \boldsymbol{\mu} + \boldsymbol{\alpha}_j + \mathbf{B}\mathbf{z}_{ij} + (\boldsymbol{\alpha\beta})_j \mathbf{z}_{ij} + \boldsymbol{\epsilon}_{ij}$

Test the multivariate significance of $(\boldsymbol{\alpha\beta})_j$ using all four test statistics. A non-significant result ( $p > 0.05$ ) supports the homogeneity of regression assumption.

12. Contrast Analysis in MANOVA

12.1 Planned Contrasts vs. Post-Hoc Comparisons

Planned (a priori) contrasts are comparisons specified before data collection based on theory. They:

Have more statistical power than post-hoc tests.
Do not require a significant omnibus MANOVA (though it is conventional to require it).
Are limited in number (typically $k-1$ orthogonal contrasts for $k$ groups).

Post-hoc comparisons are exploratory comparisons conducted after the data are seen. They require stronger corrections for multiple testing.

12.2 Contrast Coding Schemes

For a factor with $k$ levels, $k-1$ contrasts can be specified. Common coding schemes:

Helmert Contrasts: Compare each level to the mean of all subsequent levels:

Contrast 1: Level 1 vs. mean of Levels 2, 3, ..., $k$ .
Contrast 2: Level 2 vs. mean of Levels 3, ..., $k$ .
Etc.

Deviation Contrasts: Compare each level to the grand mean (all other groups):

Contrast $j$ : Level $j$ vs. grand mean of all other levels.

Repeated/Difference Contrasts: Compare adjacent levels:

Contrast $j$ : Level $j$ vs. Level $j+1$ .

Orthogonal Polynomial Contrasts: Decompose a quantitative factor's effect into linear, quadratic, cubic trends.

Custom Contrasts: Any set of contrast coefficients $\mathbf{c} = (c_1, c_2, \dots, c_k)^T$ with $\sum_j c_j = 0$ .

12.3 Testing a Single Multivariate Contrast

A single multivariate contrast $\boldsymbol{\psi} = \sum_j c_j \bar{\mathbf{y}}_j$ is tested using Hotelling's $T^2$ :

$T^2 = \frac{\boldsymbol{\psi}^T \mathbf{S}_W^{-1} \boldsymbol{\psi}}{\sum_j c_j^2/n_j}$

$F = \frac{n - k - p + 1}{(n - k)p} T^2 \sim F_{p,\, n-k-p+1}$

12.4 Testing a Set of Multivariate Contrasts

For a set of $s$ planned contrasts defined by contrast matrix $\mathbf{C}$ ( $s \times k$ ), the hypothesis SSCP is:

$\mathbf{H}_C = \bar{\mathbf{Y}}^T \mathbf{C}^T \left[\mathbf{C}(\mathbf{X}^T\mathbf{X})^{-1}\mathbf{C}^T\right]^{-1} \mathbf{C} \bar{\mathbf{Y}}$

The four multivariate test statistics are computed using $\mathbf{H}_C$ in place of $\mathbf{H}$ .

13. Repeated Measures MANOVA (Profile Analysis)

13.1 Repeated Measures as MANOVA

When the same participants are measured on $p$ occasions (or $p$ related conditions), the data can be analysed as a one-sample or $k$ -sample MANOVA — this is called profile analysis or doubly multivariate analysis.

Profile analysis treats the $p$ repeated measurements as $p$ correlated dependent variables in a MANOVA framework. It avoids the sphericity assumption required by repeated-measures ANOVA, making it more general and robust.

13.2 The Three Hypotheses in Profile Analysis

Profile analysis simultaneously tests three distinct hypotheses:

13.2.1 Test of Parallelism (Interaction)

$H_0$ : The profiles of all groups are parallel — the pattern of change across occasions is the same for all groups.

This is the most important test. A significant result means the groups differ in their pattern of responses across occasions (interaction between group and occasion).

Tested using the $p-1$ difference scores $D_m = Y_{m+1} - Y_m$ ( $m = 1, \dots, p-1$ ) as DVs in a MANOVA.

13.2.2 Test of Levels (Group Main Effect)

$H_0$ : All groups have the same average level across occasions — the overall mean response is the same across groups.

Equivalent to a one-way ANOVA on the row means $\bar{Y}_i = \sum_{m=1}^p Y_{im}/p$ . Only interpretable when the parallelism test is not rejected.

13.2.3 Test of Flatness (Occasion Main Effect)

$H_0$ : The profile is flat — there is no change across occasions, averaged over all groups.

Tested using a one-sample MANOVA on the $p-1$ difference scores. Only interpretable when the parallelism test is not rejected.

13.3 Multivariate vs. Univariate Approach to Repeated Measures

Feature	Multivariate (Profile Analysis)	Univariate (RM-ANOVA)
Sphericity assumption	Not required	Required (or corrected)
Power (sphericity met)	Lower	Higher
Power (sphericity violated)	Higher	Lower (unless corrected)
Requires $n > p + k$	Yes	No
Handles missing data	Less easily	Same
Sample size requirements	Higher	Lower
Recommended when	$p$ is small, $n$ is large, sphericity questionable	$p$ is large, $n$ is small, sphericity assumed

💡 For datasets where the sphericity assumption is questionable and sample size is adequate ( $n > p + k + 5$ ), profile analysis (MANOVA) is generally preferred over univariate repeated-measures ANOVA with Greenhouse-Geisser or Huynh-Feldt corrections.

14. Using the MANOVA/MANCOVA Component

The MANOVA/MANCOVA component in the DataStatPro application provides a full end-to-end workflow for conducting multivariate analysis of variance and covariance.

Step-by-Step Guide

Step 1 — Select Dataset Choose the dataset from the "Dataset" dropdown. The dataset should contain at least two continuous dependent variables and one categorical grouping variable.

Step 2 — Select Analysis Type Choose the analysis type:

One-Way MANOVA (single factor, multiple DVs)
Factorial MANOVA (two or more factors, multiple DVs)
One-Way MANCOVA (single factor + covariates, multiple DVs)
Factorial MANCOVA (two or more factors + covariates, multiple DVs)
Profile Analysis (repeated measures MANOVA)
Hotelling's $T^2$ (two-group special case)

Step 3 — Select Dependent Variables (DVs) Select two or more continuous dependent variables from the "Dependent Variables" panel.

💡 Select DVs that are theoretically related and expected to collectively represent the outcome construct. Avoid including DVs that are conceptually unrelated or nearly perfectly correlated ( $|r| > 0.90$ ).

Step 4 — Select Independent Variable(s) / Factor(s) Select the categorical grouping variable(s) from the "Factor(s)" dropdown. For factorial MANOVA, select two or more factors. The application will automatically create all main effects and interactions.

Step 5 — Select Covariate(s) (MANCOVA Only) For MANCOVA, select one or more continuous covariates from the "Covariate(s)" dropdown. The application will:

Automatically test the homogeneity of regression assumption.
Compute adjusted means (estimated marginal means).
Report covariate effects.

Step 6 — Select Contrast Type (Optional) For planned contrasts, select the coding scheme:

None (omnibus test only)
Helmert
Deviation
Repeated/Difference
Polynomial (Trend)
Custom (specify contrast coefficients manually)

Step 7 — Configure Post-Hoc Tests If applicable, select the post-hoc correction method for follow-up pairwise comparisons:

Tukey HSD (recommended for equal group sizes)
Bonferroni
Holm
Games-Howell (recommended for unequal group sizes or heterogeneous variances)
Scheffé

Step 8 — Select Confidence Level Choose the confidence level for confidence intervals and effect sizes (default: 95%).

Step 9 — Select Display Options Choose which outputs to display:

✅ Box's M Test
✅ Multivariate Test Statistics (Wilks', Pillai's, Hotelling-Lawley, Roy's)
✅ Univariate ANOVA Results per DV
✅ Descriptive Statistics (means, SDs, $n$ per group per DV)
✅ Effect Sizes ( $\eta^2_p$ , $\omega^2$ , Cohen's $d$ )
✅ Discriminant Function Analysis
✅ Post-Hoc Pairwise Comparisons
✅ Multivariate Normality Tests (Mardia's, Royston's)
✅ Mahalanobis Distance Q-Q Plot
✅ Adjusted Means Plot (MANCOVA)
✅ Homogeneity of Regression Test (MANCOVA)
✅ Error Covariance Matrix $\mathbf{S}_W$
✅ Discriminant Function Loadings Plot
✅ Group Centroid Plot (Score Plot)
✅ Profile Plot (for Repeated Measures)

Step 10 — Run the Analysis Click "Run MANOVA/MANCOVA". The application will:

Compute group means, grand means, and the SSCP matrices $\mathbf{H}$ , $\mathbf{E}$ , $\mathbf{T}$ .
Run Box's M test and multivariate normality tests.
Compute all four multivariate test statistics and their $F$ approximations.
Compute effect sizes.
Conduct univariate follow-up ANOVAs per DV.
Conduct discriminant function analysis.
Run post-hoc pairwise comparisons (if requested).
Compute adjusted means (MANCOVA).
Generate all selected visualisations and tables.

15. Computational and Formula Details

15.1 Computing the SSCP Matrices Step-by-Step

Step 1: Compute group means and grand mean

$\bar{\mathbf{y}}_j = \frac{1}{n_j}\sum_{i=1}^{n_j} \mathbf{y}_{ij}, \quad \bar{\mathbf{y}} = \frac{1}{n}\sum_{j=1}^k n_j \bar{\mathbf{y}}_j$

Step 2: Compute between-group SSCP matrix $\mathbf{H}$

$\mathbf{H} = \sum_{j=1}^k n_j (\bar{\mathbf{y}}_j - \bar{\mathbf{y}})(\bar{\mathbf{y}}_j - \bar{\mathbf{y}})^T, \quad df_H = k-1$

For $p = 2$ DVs and $k$ groups:

$H_{11} = \sum_j n_j(\bar{y}_{j1} - \bar{y}_1)^2, \quad H_{12} = H_{21} = \sum_j n_j(\bar{y}_{j1} - \bar{y}_1)(\bar{y}_{j2} - \bar{y}_2)$ $H_{22} = \sum_j n_j(\bar{y}_{j2} - \bar{y}_2)^2$

Step 3: Compute within-group SSCP matrix $\mathbf{E}$

$\mathbf{E} = \sum_{j=1}^k \sum_{i=1}^{n_j} (\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)(\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)^T, \quad df_E = n-k$

For $p = 2$ DVs:

$E_{11} = \sum_j\sum_i(y_{ij1}-\bar{y}_{j1})^2, \quad E_{12} = E_{21} = \sum_j\sum_i(y_{ij1}-\bar{y}_{j1})(y_{ij2}-\bar{y}_{j2})$ $E_{22} = \sum_j\sum_i(y_{ij2}-\bar{y}_{j2})^2$

Step 4: Verify decomposition: $\mathbf{T} = \mathbf{H} + \mathbf{E}$

15.2 Computing Eigenvalues of $\mathbf{E}^{-1}\mathbf{H}$

The eigenvalues $\lambda_1 \geq \lambda_2 \geq \dots \geq \lambda_{s^*}$ are the roots of:

$|\mathbf{H} - \lambda \mathbf{E}| = 0 \quad \Leftrightarrow \quad |\mathbf{E}^{-1}\mathbf{H} - \lambda \mathbf{I}| = 0$

For $p = 2$ and $k \geq 3$ (so $s^* = 2$ ), the eigenvalues are the roots of the quadratic:

$\lambda^2 - \text{tr}(\mathbf{E}^{-1}\mathbf{H})\lambda + |\mathbf{E}^{-1}\mathbf{H}| = 0$

$\lambda_{1,2} = \frac{\text{tr}(\mathbf{E}^{-1}\mathbf{H}) \pm \sqrt{[\text{tr}(\mathbf{E}^{-1}\mathbf{H})]^2 - 4|\mathbf{E}^{-1}\mathbf{H}|}}{2}$

15.3 Computing the Four Test Statistics from Eigenvalues

Given eigenvalues $\lambda_1 \geq \lambda_2 \geq \dots \geq \lambda_{s^*}$ :

Wilks' Lambda:

$\Lambda^* = \prod_{s=1}^{s^*} \frac{1}{1+\lambda_s}$

Pillai's Trace:

$V = \sum_{s=1}^{s^*} \frac{\lambda_s}{1+\lambda_s}$

Hotelling-Lawley Trace:

$U = \sum_{s=1}^{s^*} \lambda_s$

Roy's Largest Root:

$\theta = \frac{\lambda_1}{1+\lambda_1}$

15.4 F-Approximations in Full Detail

Wilks' Lambda (Rao's F-approximation):

Let $s^* = \min(p, df_H)$ , $df_H = k-1$ , $df_E = n-k$ .

$t = \sqrt{\frac{p^2 df_H^2 - 4}{p^2 + df_H^2 - 5}}, \quad \text{(set } t = 1 \text{ if } p^2 + df_H^2 - 5 \leq 0\text{)}$

$df_1 = p \cdot df_H, \quad df_2 = t \cdot \left(df_E - \frac{p - df_H + 1}{2}\right) - \frac{p \cdot df_H - 2}{2}$

$F = \frac{1 - \Lambda^{*1/t}}{\Lambda^{*1/t}} \cdot \frac{df_2}{df_1}$

Pillai's Trace:

$s = s^*, \quad m = \frac{|df_H - p| - 1}{2}, \quad n_r = \frac{df_E - p - 1}{2}$

$F = \frac{V/s}{(s - V)/s} \cdot \frac{2n_r + s + 1}{2m + s + 1} \sim F_{s(2m+s+1),\, s(2n_r+s+1)}$

Hotelling-Lawley Trace:

$F = \frac{U}{s} \cdot \frac{2(sn_r + 1)}{s^2(2m + s + 1)} \sim F_{s(2m+s+1),\, 2(sn_r+1)}$

Where $m$ and $n_r$ are defined as in Pillai's Trace.

Roy's Largest Root:

$F \leq \theta \cdot \frac{\max(p, df_H)}{\min(p, df_H)} \sim F_{\max(p,df_H),\, df_E - \max(p,df_H) + df_H}$

Roy's provides an upper bound, not an exact $F$ .

15.5 Discriminant Function Computation

The $s$ -th discriminant function weights $\mathbf{v}_s$ are the eigenvectors of $\mathbf{E}^{-1}\mathbf{H}$ :

$(\mathbf{E}^{-1}\mathbf{H} - \lambda_s\mathbf{I})\mathbf{v}_s = \mathbf{0}$

Normalised so that $\mathbf{v}_s^T \mathbf{E} \mathbf{v}_s = 1$ (within-group variance of discriminant scores = 1).

Discriminant scores for each observation:

$d_{is} = \mathbf{v}_s^T (\mathbf{y}_i - \bar{\mathbf{y}}) = \sum_{j=1}^p v_{sj}(y_{ij} - \bar{y}_j)$

Group centroids on discriminant function $s$ :

$\bar{d}_{js} = \mathbf{v}_s^T (\bar{\mathbf{y}}_j - \bar{\mathbf{y}})$

Structure coefficients:

$r_{Y_j, D_s} = \frac{\mathbf{e}_j^T \mathbf{E} \mathbf{v}_s}{\sqrt{E_{jj}} \cdot 1} = \frac{\sum_{k=1}^p E_{jk} v_{sk}}{\sqrt{E_{jj}}}$

15.6 MANCOVA Adjusted SSCP Computation

For MANCOVA with covariate matrix $\mathbf{Z}$ ( $n \times q$ ):

Within-group SSCP matrices for DVs and covariates:

$\mathbf{E}_{YY} = \sum_j\sum_i(\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)(\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)^T \quad (p \times p)$

$\mathbf{E}_{ZZ} = \sum_j\sum_i(\mathbf{z}_{ij} - \bar{\mathbf{z}}_j)(\mathbf{z}_{ij} - \bar{\mathbf{z}}_j)^T \quad (q \times q)$

$\mathbf{E}_{YZ} = \sum_j\sum_i(\mathbf{y}_{ij} - \bar{\mathbf{y}}_j)(\mathbf{z}_{ij} - \bar{\mathbf{z}}_j)^T \quad (p \times q)$

Adjusted within-group SSCP (after removing covariate effects):

$\mathbf{E}^* = \mathbf{E}_{YY} - \mathbf{E}_{YZ} \mathbf{E}_{ZZ}^{-1} \mathbf{E}_{YZ}^T, \quad df^*_E = n - k - q$

Adjusted total SSCP:

$\mathbf{T}^* = \mathbf{T}_{YY} - \mathbf{T}_{YZ}\mathbf{T}_{ZZ}^{-1}\mathbf{T}_{YZ}^T, \quad df^*_T = n - 1 - q$

Adjusted hypothesis SSCP:

$\mathbf{H}^* = \mathbf{T}^* - \mathbf{E}^*, \quad df^*_H = k - 1$

Test statistics computed using $\mathbf{H}^*$ and $\mathbf{E}^*$ instead of $\mathbf{H}$ and $\mathbf{E}$ .

16. Worked Examples

Example 1: One-Way MANOVA — Comparing Teaching Methods on Academic Performance

Research Question: Do three teaching methods (Traditional, Flipped Classroom, Project-Based Learning) differ significantly on students' performance across three academic subjects (Mathematics, Science, English)?

Data: $n = 90$ students; $k = 3$ groups ( $n_j = 30$ per group); $p = 3$ DVs (scores out of 100: Math, Science, English).

Step 1: Descriptive Statistics

Group	$n$	$\bar{y}_{Math}$ ( $SD$ )	$\bar{y}_{Sci}$ ( $SD$ )	$\bar{y}_{Eng}$ ( $SD$ )
Traditional	30	68.4 (11.2)	71.3 (9.8)	74.6 (10.1)
Flipped	30	75.8 (10.6)	76.1 (11.3)	78.2 (9.4)
Project-Based	30	79.3 (12.1)	80.4 (10.7)	82.1 (11.8)
Grand Mean	90	74.5	75.9	78.3

Step 2: Assumption Checks

Box's M test: $M = 18.43$ , $F_{approx}(12, 87540) = 1.43$ , $p = 0.147$ → No violation of homogeneity of covariance matrices.

Mardia's tests: Skewness $\chi^2_{10} = 12.4$ , $p = 0.258$ ; Kurtosis $z = 1.34$ , $p = 0.180$ → Multivariate normality supported.

Mahalanobis distances: Maximum $D^2 = 9.81 < \chi^2_{3,0.001} = 16.27$ → No multivariate outliers.

Step 3: MANOVA Results

SSCP matrices (summarised):

$\mathbf{H} = \begin{pmatrix} 2892.1 & 2614.3 & 2241.8 \\ 2614.3 & 2831.6 & 2187.4 \\ 2241.8 & 2187.4 & 2048.9 \end{pmatrix}, \quad df_H = 2$

$\mathbf{E} = \begin{pmatrix} 32108.4 & 18421.3 & 17843.2 \\ 18421.3 & 30284.7 & 16912.8 \\ 17843.2 & 16912.8 & 31218.6 \end{pmatrix}, \quad df_E = 87$

Eigenvalues of $\mathbf{E}^{-1}\mathbf{H}$ : $\lambda_1 = 0.2847$ , $\lambda_2 = 0.0312$

Multivariate Test Statistics:

Test Statistic	Value	$F$	$df_1$	$df_2$	$p$ -value	$\eta^2_p$
Wilks' $\Lambda^*$	0.7612	4.182	6	170	< 0.001	0.129
Pillai's Trace $V$	0.2481	4.021	6	172	< 0.001	0.123
Hotelling-Lawley $U$	0.3159	4.463	6	168	< 0.001	0.137
Roy's Largest Root $\theta$	0.2218	6.328	3	87	< 0.001	0.179

Interpretation: All four test statistics are significant ( $p < 0.001$ ), providing strong evidence that the three teaching methods differ significantly on the combined set of academic outcomes. Wilks' $\Lambda^* = 0.7612$ , $F(6, 170) = 4.18$ , $p < 0.001$ , multivariate $\eta^2_p = 0.129$ — a medium-to-large effect.

Step 4: Univariate Follow-Up ANOVAs (Bonferroni-corrected $\alpha = 0.05/3 = 0.017$ )

DV	$F(2, 87)$	$p$ -value	$\eta^2_p$	Significant?
Mathematics	7.84	< 0.001	0.153	✅ Yes
Science	5.91	0.004	0.120	✅ Yes
English	4.47	0.014	0.093	✅ Yes

All three DVs show significant univariate group differences after Bonferroni correction.

Step 5: Post-Hoc Pairwise Comparisons (Tukey HSD) for Mathematics

Comparison	Mean Diff.	SE	$p_{Tukey}$	95% CI
Traditional vs. Flipped	-7.40	2.42	0.009	[-13.18, -1.62]
Traditional vs. Project	-10.90	2.42	< 0.001	[-16.68, -5.12]
Flipped vs. Project	-3.50	2.42	0.315	[-9.28, 2.28]

Project-Based Learning and Flipped Classroom both significantly outperform Traditional teaching on Mathematics. Flipped and Project-Based do not significantly differ.

Step 6: Discriminant Function Analysis

Function	Eigenvalue	Canonical $R$	$R^2$	% Variance	$\chi^2$	$df$	$p$
1	0.2847	0.4711	0.222	90.1%	21.84	6	0.001
2	0.0312	0.1741	0.030	9.9%	2.73	2	0.255

Only Function 1 is significant. It explains 90.1% of between-group variance.

Structure coefficients for Function 1:

DV	Structure Coefficient
Mathematics	0.892
Science	0.851
English	0.793

All three DVs load strongly and positively on Function 1, indicating it represents overall academic performance. The group centroids on Function 1: Traditional = −0.42, Flipped = +0.18, Project-Based = +0.61, confirming a single dimension separating Traditional from the other methods.

Conclusion: Project-based and flipped classroom methods both significantly outperform traditional teaching on the combined academic outcome profile, with medium effect sizes. The group differences are primarily one-dimensional (overall academic performance), with all three subjects contributing similarly.

Example 2: One-Way MANCOVA — Effect of Exercise Programme on Health Outcomes Controlling for Age

Research Question: Do three exercise programmes (Control, Moderate, High Intensity) differ on cardiovascular outcomes (resting heart rate, VO₂ max) after controlling for participants' age?

Data: $n = 75$ participants ( $n_j = 25$ per group); $p = 2$ DVs (Heart Rate bpm, VO₂ max mL/kg/min); $q = 1$ covariate (Age, years).

Step 1: Descriptive Statistics (Unadjusted and Adjusted)

Group	$n$	$\bar{y}_{HR}$ (unadj.)	$\bar{y}_{VO2}$ (unadj.)	$\bar{y}_{HR}^{adj}$	$\bar{y}_{VO2}^{adj}$	$\bar{z}_{Age}$
Control	25	74.2	33.8	73.1	34.6	47.2
Moderate	25	70.1	38.2	71.4	37.4	45.8
High Intensity	25	65.8	44.1	65.3	43.8	44.1
Grand Mean	75	70.0	38.7	70.0	38.7	45.7

Step 2: Homogeneity of Regression Test

Multivariate test of Group × Age interaction:

Wilks' $\Lambda^* = 0.942$ , $F(4, 134) = 1.01$ , $p = 0.407$ → Homogeneity of regression assumption met; proceed with MANCOVA.

Step 3: Covariate Effect

Multivariate test of Age covariate:

Wilks' $\Lambda^* = 0.811$ , $F(2, 71) = 8.26$ , $p < 0.001$ → Age significantly predicts the DV profile. Including Age as a covariate is justified; it improves precision.

Step 4: Adjusted MANCOVA Results

Test Statistic	Value	$F$	$df_1$	$df_2$	$p$ -value	$\eta^2_p$
Wilks' $\Lambda^*$	0.6182	9.641	4	138	< 0.001	0.218
Pillai's Trace	0.3941	9.038	4	140	< 0.001	0.205
Hotelling-Lawley	0.6094	10.664	4	136	< 0.001	0.239
Roy's Largest Root	0.5847	20.464	2	71	< 0.001	0.366

After controlling for Age, the exercise programme effect is highly significant (Wilks' $\Lambda^* = 0.618$ , $p < 0.001$ , $\eta^2_p = 0.218$ ).

Step 5: Univariate Follow-Up ANCOVAs (Bonferroni $\alpha = 0.025$ )

DV	$F(2, 71)$	$p$	$\eta^2_p$	Significant?
Heart Rate (adjusted)	14.83	< 0.001	0.295	✅
VO₂ Max (adjusted)	18.41	< 0.001	0.341	✅

Both DVs show significant group differences after controlling for Age, with large effect sizes.

Step 6: Post-Hoc Pairwise on Adjusted VO₂ Max (Tukey HSD)

Comparison	Adjusted Mean Diff.	SE	$p_{Tukey}$	95% CI
Control vs. Moderate	-2.80	0.88	0.008	[-5.01, -0.59]
Control vs. High	-9.20	0.88	< 0.001	[-11.41, -6.99]
Moderate vs. High	-6.40	0.88	< 0.001	[-8.61, -4.19]

All three groups differ significantly on adjusted VO₂ Max, with High Intensity exercise producing the greatest improvements.

Conclusion: After controlling for age, exercise programme significantly affects the combined cardiovascular profile ( $p < 0.001$ , $\eta^2_p = 0.218$ ). High intensity exercise produces the greatest improvements in both resting heart rate and VO₂ max compared to both moderate exercise and control conditions.

Example 3: Two-Way Factorial MANOVA — Gender and Treatment on Psychological Outcomes

Research Question: Do gender (Male/Female) and treatment condition (CBT/Control) affect the combined profile of three psychological outcomes (Depression, Anxiety, Stress scores)?

Data: $n = 120$ participants; $k = 4$ cells (2 genders × 2 conditions, $n_j = 30$ per cell); $p = 3$ DVs (DASS-21 subscales: Depression, Anxiety, Stress).

Design: 2 (Gender: Male, Female) × 2 (Treatment: CBT, Control) factorial MANOVA.

Step 1: Multivariate Tests for Main Effects and Interaction

Effect	Wilks' $\Lambda^*$	$F$	$df_1$	$df_2$	$p$	$\eta^2_p$
Gender	0.8841	4.82	3	114	0.003	0.113
Treatment	0.7043	15.93	3	114	< 0.001	0.295
Gender × Treatment	0.9512	1.96	3	114	0.124	0.049

Gender × Treatment interaction: Not significant ( $p = 0.124$ ). The effect of treatment on the psychological profile does not depend on gender. Interpret main effects.

Treatment main effect: Highly significant ( $p < 0.001$ , $\eta^2_p = 0.295$ , large effect). CBT significantly reduces the combined psychological distress profile compared to control.

Gender main effect: Significant ( $p = 0.003$ , $\eta^2_p = 0.113$ , medium effect). Females and males differ on the combined psychological profile.

Step 2: Univariate Follow-Up for Treatment Effect (Bonferroni $\alpha = 0.017$ )

DV	$F(1, 116)$	$p$	$\eta^2_p$	CBT Mean	Control Mean	Cohen's $d$
Depression	28.41	< 0.001	0.197	8.4	14.2	0.94
Anxiety	22.18	< 0.001	0.160	9.1	13.8	0.81
Stress	19.84	< 0.001	0.146	11.2	15.7	0.72

CBT significantly reduces all three psychological distress dimensions compared to control, with large effect sizes for depression and anxiety and a medium-to-large effect for stress.

Conclusion: CBT is significantly more effective than control at reducing the combined profile of depression, anxiety, and stress, with a large multivariate effect size ( $\eta^2_p = 0.295$ ). Males and females show somewhat different psychological distress profiles, but the treatment effect is consistent across genders (non-significant interaction).

Example 4: Profile Analysis — Examining Change Across Time Points

Research Question: Do two training groups (Standard vs. Enhanced) differ in their cognitive performance profiles across three time points (Baseline, 3 months, 6 months)?

Data: $n = 60$ participants ( $n_j = 30$ per group); $p = 3$ time points (DVs: Cognitive Score at T0, T3, T6).

Three Profile Analysis Tests:

Test 1 — Parallelism (Group × Time Interaction):

Using difference scores $D_1 = T3 - T0$ and $D_2 = T6 - T3$ as DVs:

Wilks' $\Lambda^* = 0.8127$ , $F(2, 57) = 6.57$ , $p = 0.003$ , $\eta^2_p = 0.187$ .

Profiles are NOT parallel — the two groups show different patterns of change over time. The interaction is significant.

Test 2 — Levels (Group Main Effect):

(Interpreted cautiously given significant parallelism test.)

$F(1, 58) = 12.43$ , $p < 0.001$ — Enhanced training group has higher overall scores.

Test 3 — Flatness (Time Main Effect):

(Interpreted cautiously given significant parallelism test.)

Wilks' $\Lambda^* = 0.4821$ , $F(2, 57) = 30.64$ , $p < 0.001$ — Significant improvement over time across both groups.

Profile Means:

Group	T0 (Baseline)	T3 (3 months)	T6 (6 months)
Standard	52.4	56.8	58.1
Enhanced	51.9	61.3	68.7

The enhanced training group shows substantially greater improvement from T3 to T6 compared to the standard group, explaining the significant interaction (non-parallelism).

Conclusion: The two training programmes produce different patterns of cognitive improvement over time. Enhanced training shows accelerating improvement (particularly from 3 to 6 months), while standard training shows more modest and decelerating gains.

17. Common Mistakes and How to Avoid Them

Mistake 1: Conducting Separate ANOVAs Instead of MANOVA

Problem: Running separate univariate ANOVAs for each dependent variable without accounting for the familywise error rate or the correlational structure among DVs, resulting in inflated Type I error and loss of information about combined effects.
Solution: Use MANOVA when DVs are theoretically related and represent a common construct. MANOVA simultaneously controls the Type I error rate and detects effects in combined DV dimensions. Follow up a significant MANOVA with Bonferroni-corrected univariate ANOVAs rather than conducting ANOVAs independently.

Mistake 2: Ignoring Box's M Test Violations

Problem: Proceeding with Wilks' Lambda (or other statistics sensitive to heterogeneous covariance matrices) when Box's M test is significant ( $p < 0.001$ ), leading to inflated Type I error.
Solution: When Box's M is significant, switch to Pillai's Trace, which is the most robust statistic to heterogeneous covariance matrices. If group sizes are equal, robustness is reasonable. For severely heterogeneous covariance matrices, consider heteroscedastic MANOVA alternatives (James's test, Johansen's procedure).

Mistake 3: Including Too Many or Too Few DVs

Problem: Including an excessive number of DVs that are unrelated to each other or to the research question reduces statistical power by consuming degrees of freedom. Conversely, excluding relevant DVs misses important effects.
Solution: Select DVs based on theory and the research question. DVs should be conceptually related (representing a common construct) and expected to differ between groups. Avoid including DVs simply because they were measured — each DV should have a theoretical rationale.

Mistake 4: Including DVs with Near-Perfect Multicollinearity

Problem: Including two or more DVs that correlate $|r| > 0.90$ creates numerical instability in computing $\mathbf{E}^{-1}$ , potentially producing unreliable or undefined results.
Solution: Compute pairwise correlations among DVs before MANOVA. Remove one of any pair with $|r| > 0.90$ , or combine them into a composite score. Check the condition number and determinant of $\mathbf{E}$ for near-singularity.

Mistake 5: Failing to Test Homogeneity of Regression in MANCOVA

Problem: Conducting MANCOVA without verifying that the covariate-DV regression slopes are equal across groups. If they differ, MANCOVA provides misleading results and adjusted means are uninterpretable.
Solution: Always test the covariate × group interaction before conducting MANCOVA. If the interaction is significant ( $p < 0.05$ ), do not use standard MANCOVA. Instead, report the interaction itself, test group differences at specific covariate values (Johnson-Neyman regions), or use separate regression models per group.

Mistake 6: Interpreting Follow-Up Tests Without a Significant Omnibus MANOVA

Problem: Conducting and reporting follow-up univariate ANOVAs and post-hoc tests after a non-significant omnibus MANOVA, treating significant univariate results as meaningful.
Solution: Only proceed with follow-up analyses if the omnibus multivariate test is significant. A significant univariate result following a non-significant MANOVA is likely a Type I error. The omnibus MANOVA acts as a gate-keeping test.

Mistake 7: Overlooking Multivariate Outliers

Problem: Not checking for multivariate outliers, which can disproportionately influence the SSCP matrices and distort all test statistics and effect sizes.
Solution: Always compute Mahalanobis distances and identify observations exceeding $\chi^2_{p, 0.001}$ . Investigate flagged observations for data entry errors or genuinely unusual cases. Report the analysis with and without outliers if their influence is substantial.

Mistake 8: Confusing Wilks' Lambda Direction of Interpretation

Problem: Interpreting a larger Wilks' Lambda as indicating a stronger group effect, when in fact smaller Wilks' Lambda indicates stronger group separation.
Solution: Remember that Wilks' Lambda = 1 means no group differences; Lambda = 0 means perfect separation. Smaller Lambda → greater group differences. Many researchers compute $1 - \Lambda^*$ as the effect size proportion (approximately $\eta^2_p$ ) to make the direction more intuitive.

Mistake 9: Selecting Post-Hoc Tests Without Considering Group Size Equality

Problem: Using Tukey HSD (which assumes equal group sizes) when group sizes are unequal, producing incorrect critical values and p-values.
Solution: Use Games-Howell for unequal group sizes (especially when combined with heterogeneous variances). Use Tukey HSD only when group sizes are equal or approximately equal. Always verify the assumptions of the chosen post-hoc test.

Mistake 10: Neglecting to Report Effect Sizes

Problem: Reporting only significance levels ( $p$ -values) without effect sizes, which does not convey the practical or scientific importance of the group differences.
Solution: Always report multivariate effect sizes ( $\eta^2_p$ from Wilks' Lambda or Pillai's Trace; $\omega^2$ for unbiased estimates) alongside $p$ -values. For univariate follow-ups, report $\eta^2_p$ and Cohen's $d$ for pairwise comparisons. Interpret effect sizes using benchmarks and in the context of the domain.

18. Troubleshooting

Issue	Likely Cause	Solution
$\mathbf{E}$ matrix is singular (non-invertible)	Perfect multicollinearity among DVs; $n - k < p$ (too few observations per group); duplicate DVs	Remove perfectly correlated DVs; increase sample size; check for duplicate columns in data
All four test statistics give $p = 1.000$	All DVs identical across groups; data entry error; DVs are constants	Check data; verify correct group assignment; inspect DV distributions
Box's M highly significant ( $p < 0.001$ ) but group sizes are equal	Genuine heterogeneous covariance matrices; outliers in some groups	Use Pillai's Trace; investigate outliers; check individual group covariance matrices
Wilks' $\Lambda^* > 1$	Computational error; negative eigenvalues due to numerical issues	Check data for errors; verify $n_j > p$ in all groups; use robust computation
Non-significant MANOVA but significant individual ANOVAs	Type I error inflation from multiple ANOVAs; low power in multivariate test due to uninformative DVs	Apply Bonferroni correction to ANOVAs; remove uninformative DVs; increase sample size
Significant MANOVA but no significant follow-up ANOVAs	Group differences exist on linear combinations of DVs, not individual DVs	Focus on DFA results rather than univariate ANOVAs; report discriminant function structure
All four statistics give different $p$ -values with conflicting conclusions	Group differences are complex (multiple discriminant dimensions); small sample	Report all four statistics; use Pillai's Trace as primary; investigate DFA to understand the structure
Very large Roy's root but small Wilks' Lambda	All group separation concentrated on one dimension	Consistent result; Roy's root is powerful in this case; verify with DFA showing single significant function
MANCOVA homogeneity of regression violated	True interaction between group and covariate	Do not use MANCOVA; report the interaction; use Johnson-Neyman technique; consider moderation analysis
Mahalanobis distances all very large	Severe multivariate non-normality; very small sample; $p$ close to $n$	Increase sample; remove extreme outliers; transform variables; consider nonparametric alternatives
Negative omega-squared ( $\omega^2$ )	True effect is zero or negligible in population; sampling variability	Round to zero; report as "negligible effect"; do not interpret as a meaningful negative relationship
Discriminant function analysis gives one fewer function than expected	One eigenvalue is essentially zero; effective rank of $\mathbf{E}^{-1}\mathbf{H}$ is less than $s^*$	Normal occurrence; only report significant discriminant functions; note the effective dimensionality
Adjusted means in MANCOVA are outside plausible range	Extrapolation beyond the range of the covariate; small $n$	Check that covariate ranges overlap across groups; avoid extreme adjustments; report unadjusted means alongside

19. Quick Reference Cheat Sheet

Core SSCP Matrix Formulas

Matrix	Formula	$df$
Between-group $\mathbf{H}$	$\sum_j n_j(\bar{\mathbf{y}}_j - \bar{\mathbf{y}})(\bar{\mathbf{y}}_j - \bar{\mathbf{y}})^T$	$k - 1$
Within-group $\mathbf{E}$	$\sum_j\sum_i(\mathbf{y}_{ij}-\bar{\mathbf{y}}_j)(\mathbf{y}_{ij}-\bar{\mathbf{y}}_j)^T$	$n - k$
Total $\mathbf{T}$	$\mathbf{H} + \mathbf{E}$	$n - 1$
Pooled covariance $\mathbf{S}_W$	$\mathbf{E}/(n-k)$	—

Four Multivariate Test Statistics

Statistic	Formula	Range	Most Robust	Most Powerful When
Wilks' $\Lambda^*$	$\prod 1/(1+\lambda_s)$	$[0,1]$	Moderate	Effects spread across dimensions
Pillai's $V$	$\sum \lambda_s/(1+\lambda_s)$	$[0,s^*]$	Most robust	Effects spread across dimensions
Hotelling-Lawley $U$	$\sum \lambda_s$	$[0,\infty)$	Least	Single dimension
Roy's $\theta$	$\lambda_1/(1+\lambda_1)$	$[0,1]$	Least	Single dominant dimension

Effect Size Benchmarks

Effect Size	Small	Medium	Large
$\eta^2_p$	0.01	0.06	0.14
$\omega^2$	0.01	0.06	0.14
Cohen's $d$	0.20	0.50	0.80
Canonical $R^2$	0.01	0.09	0.25
Cohen's $f^2$	0.0099	0.0588	0.1373

Assumption Checks Summary

Assumption	Test	Significance Threshold	Action if Violated
Homogeneity of covariance	Box's M	$p < 0.001$	Use Pillai's Trace
Multivariate normality	Mardia's, Royston's	$p < 0.05$	Use Pillai's Trace; transform DVs
Multivariate outliers	Mahalanobis $D^2 > \chi^2_{p,0.001}$	—	Investigate; remove if erroneous
Homogeneity of regression (MANCOVA)	Factor × Covariate interaction	$p < 0.05$	Do not use MANCOVA; report interaction
Multicollinearity among DVs	Pairwise $r > 0.90$	—	Remove one DV from pair
Independence	Study design	—	Use multilevel/GEE models

When to Use Which Statistic

Situation	Recommended Statistic
Standard analysis, assumptions met	Wilks' $\Lambda^*$
Heterogeneous covariance matrices	Pillai's Trace ( $V$ )
Unequal group sizes	Pillai's Trace ( $V$ )
Single discriminant dimension expected	Roy's Largest Root
Robust, conservative test	Pillai's Trace ( $V$ )
All four disagree	Report all; use Pillai's Trace as primary
Two groups only	Hotelling's $T^2$ (exact)

MANCOVA vs. MANOVA

Feature	MANOVA	MANCOVA
Covariates	None	One or more continuous
Reduces error variance	No	Yes (if covariate related to DVs)
Controls for baseline differences	No	Yes
Requires homogeneity of regression	No	Yes (must test)
Reports adjusted means	No	Yes
Additional assumption	—	Covariate × Group non-interaction

Follow-Up Analysis Decision Guide

Scenario	Follow-Up Approach
MANOVA significant, ordered DVs	Roy-Bargmann stepdown analysis
MANOVA significant, unordered DVs	Bonferroni-corrected univariate ANOVAs
MANOVA significant, want to characterise dimensions	Discriminant function analysis
Significant univariate ANOVA, $k > 2$	Post-hoc pairwise comparisons (Tukey, Games-Howell)
Significant univariate ANOVA, $k = 2$	Report group means and Cohen's $d$ directly
Pre-specified group comparisons	Planned contrasts (Hotelling's $T^2$ per contrast)
MANOVA not significant	Report as non-significant; do NOT conduct follow-up

Profile Analysis Hypotheses

Test	$H_0$	What It Tests	Required Condition
Parallelism	Same pattern across groups	Group × Occasion interaction	Always test first
Levels	Same overall mean	Group main effect	Only if parallelism holds
Flatness	No change over occasions	Occasion main effect	Only if parallelism holds

Minimum Sample Size Guidelines

Effect Size	DVs $= 2$	DVs $= 3-5$	DVs $= 6-8$
Large ( $\eta^2_p \approx 0.14$ )	20/group	25/group	30/group
Medium ( $\eta^2_p \approx 0.06$ )	50/group	60/group	80/group
Small ( $\eta^2_p \approx 0.01$ )	200/group	250/group	300/group

Key Formulas Summary

Formula	Description
$\Lambda^* =	\mathbf{E}
$V = \text{tr}[\mathbf{H}(\mathbf{H}+\mathbf{E})^{-1}]$	Pillai's Trace
$U = \text{tr}(\mathbf{E}^{-1}\mathbf{H})$	Hotelling-Lawley Trace
$\theta = \lambda_1/(1+\lambda_1)$	Roy's Largest Root
$r_{c,s} = \sqrt{\lambda_s/(1+\lambda_s)}$	Canonical correlation for function $s$
$T^2 = \frac{n_1 n_2}{n_1+n_2}(\bar{\mathbf{y}}_1-\bar{\mathbf{y}}_2)^T\mathbf{S}_W^{-1}(\bar{\mathbf{y}}_1-\bar{\mathbf{y}}_2)$	Hotelling's $T^2$ (two groups)
$\eta^2_p = 1 - \Lambda^{*1/t}$	Partial eta-squared from Wilks'
$\omega^2 = df_H(F-1)/[df_H(F-1)+n]$	Omega-squared (unbiased)
$D^2_i = (\mathbf{y}_i-\bar{\mathbf{y}})^T\mathbf{S}^{-1}(\mathbf{y}_i-\bar{\mathbf{y}})$	Mahalanobis distance
$\mathbf{E}^* = \mathbf{E}_{YY} - \mathbf{E}_{YZ}\mathbf{E}_{ZZ}^{-1}\mathbf{E}_{YZ}^T$	Adjusted SSCP (MANCOVA)

This tutorial provides a comprehensive foundation for understanding, applying, and interpreting MANOVA and MANCOVA using the DataStatPro application. For further reading, consult Tabachnick & Fidell's "Using Multivariate Statistics" (7th ed., Pearson, 2019), Stevens's "Applied Multivariate Statistics for the Social Sciences" (5th ed., Routledge, 2009), or Rencher's "Methods of Multivariate Analysis" (3rd ed., Wiley, 2012). For feature requests or support, contact the DataStatPro team.

MANOVA and ANCOVA

MANOVA/MANCOVA: Zero to Hero Tutorial

Table of Contents

1. Prerequisites and Background Concepts

1.1 Univariate ANOVA Recap

1.2 Vectors and Matrices

1.3 The Covariance Matrix

1.4 The Multivariate Normal Distribution

1.5 Matrix Determinants and Eigenvalues

1.6 The F-Distribution and Wilks' Lambda

2. What are MANOVA and MANCOVA?

2.1 MANOVA: Multivariate Analysis of Variance

2.2 MANCOVA: Multivariate Analysis of Covariance

2.3 Why Use MANOVA Instead of Multiple ANOVAs?

2.4 Real-World Applications

2.5 Design Terminology

3. The Mathematical Framework

3.1 The MANOVA Model

3.2 The Matrix of Sum of Squares and Cross Products (SSCP)

3.3 The SSCP Matrices in Detail

3.4 The Hypothesis Test

3.5 The Estimated Within-Group Covariance Matrix

3.6 The Factorial MANOVA Model

4. Assumptions of MANOVA and MANCOVA

4.1 Multivariate Normality

4.2 Homogeneity of Covariance Matrices (Homoscedasticity)

4.3 Independence of Observations

4.4 Absence of Multivariate Outliers

4.5 Linearity Among Dependent Variables

4.6 No Perfect Multicollinearity Among Dependent Variables

4.7 Additional Assumptions for MANCOVA

5. Multivariate Test Statistics

5.1 Wilks' Lambda (Λ∗\Lambda^*Λ∗)

5.2 Pillai's Trace (VVV)

5.3 Hotelling-Lawley Trace (T2T^2T2 / UUU)

5.4 Roy's Largest Root (θ\thetaθ)

5.5 Comparison of the Four Test Statistics

5.6 Hotelling's T2T^2T2 (Two-Group Case)

6. MANCOVA: Adding Covariates

6.1 The MANCOVA Model

6.2 How MANCOVA Works: Partitioning SSCP with Covariates

6.3 Purposes of Covariates

6.4 Adjusted Means (Estimated Marginal Means)

6.5 Testing the Covariate Effect

6.6 Homogeneity of Regression Check

7. Effect Size Measures

7.1 Multivariate Effect Sizes

7.1.1 Partial Eta-Squared (ηp2\eta^2_pηp2​)

7.1.2 Omega-Squared (ω2\omega^2ω2)

7.1.3 Epsilon-Squared (ε2\varepsilon^2ε2)

7.1.4 Roy's θ\thetaθ as Effect Size

7.2 Univariate Effect Sizes for Follow-Up Tests

7.3 Discriminant Function Effect Sizes

7.4 Multivariate Effect Sizes for MANCOVA

8. Follow-Up Analyses

8.1 Strategy for Follow-Up Analysis

8.2 Discriminant Function Analysis (DFA)

8.3 Univariate Follow-Up ANOVAs

8.4 Post-Hoc Pairwise Comparisons

8.5 Roy-Bargmann Stepdown Analysis

8.6 Planned Contrasts in MANOVA

9. Power Analysis and Sample Size

9.1 Factors Affecting Statistical Power in MANOVA

9.2 Rules of Thumb for Sample Size

9.3 Power Analysis Using Non-Central F

9.4 Impact of Number of DVs on Power

10. Model Fit and Evaluation

10.1 Overall Multivariate Significance

10.2 Univariate ANOVA Results

10.3 Discriminant Analysis Summary

10.4 MANCOVA-Specific Fit Information

11. Assumption Checking and Diagnostics

11.1 Multivariate Normality Tests

11.1.1 Mardia's Tests

11.1.2 Mahalanobis Distance Q-Q Plot

11.2 Box's M Test for Homogeneity of Covariance Matrices

11.3 Outlier Detection

11.4 Checking Linearity Among DVs

11.5 Checking Multicollinearity Among DVs

11.6 Checking Homogeneity of Regression (MANCOVA)

5.1 Wilks' Lambda ( $\Lambda^*$ )

5.2 Pillai's Trace ( $V$ )

5.3 Hotelling-Lawley Trace ( $T^2$ / $U$ )

5.4 Roy's Largest Root ( $\theta$ )

5.6 Hotelling's $T^2$ (Two-Group Case)

7.1.1 Partial Eta-Squared ( $\eta^2_p$ )

7.1.2 Omega-Squared ( $\omega^2$ )

7.1.3 Epsilon-Squared ( $\varepsilon^2$ )

7.1.4 Roy's $\theta$ as Effect Size

15.2 Computing Eigenvalues of $\mathbf{E}^{-1}\mathbf{H}$