Effects of violations of model assumptions

Last updated
Save as PDF

Page ID: 246

$ \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } $

$ \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} $

$ \newcommand{\dsum}{\displaystyle\sum\limits} $

$ \newcommand{\dint}{\displaystyle\int\limits} $

$ \newcommand{\dlim}{\displaystyle\lim\limits} $

$ \newcommand{\id}{\mathrm{id}}$ $ \newcommand{\Span}{\mathrm{span}}$

( \newcommand{\kernel}{\mathrm{null}\,}\) $ \newcommand{\range}{\mathrm{range}\,}$

$ \newcommand{\RealPart}{\mathrm{Re}}$ $ \newcommand{\ImaginaryPart}{\mathrm{Im}}$

$ \newcommand{\Argument}{\mathrm{Arg}}$ $ \newcommand{\norm}[1]{\| #1 \|}$

$ \newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$ \newcommand{\Span}{\mathrm{span}}$

$ \newcommand{\id}{\mathrm{id}}$

$ \newcommand{\Span}{\mathrm{span}}$

$ \newcommand{\kernel}{\mathrm{null}\,}$

$ \newcommand{\range}{\mathrm{range}\,}$

$ \newcommand{\RealPart}{\mathrm{Re}}$

$ \newcommand{\ImaginaryPart}{\mathrm{Im}}$

$ \newcommand{\Argument}{\mathrm{Arg}}$

$ \newcommand{\norm}[1]{\| #1 \|}$

$ \newcommand{\inner}[2]{\langle #1, #2 \rangle}$

$ \newcommand{\Span}{\mathrm{span}}$ $ \newcommand{\AA}{\unicode[.8,0]{x212B}}$

$ \newcommand{\vectorA}[1]{\vec{#1}} % arrow$

$ \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow$

$ \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } $

$ \newcommand{\vectorC}[1]{\textbf{#1}} $

$ \newcommand{\vectorD}[1]{\overrightarrow{#1}} $

$ \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} $

$ \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} $

$ \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } $

$\newcommand{\longvect}{\overrightarrow}$

$ \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} $

$\newcommand{\avec}{\mathbf a}$ $\newcommand{\bvec}{\mathbf b}$ $\newcommand{\cvec}{\mathbf c}$ $\newcommand{\dvec}{\mathbf d}$ $\newcommand{\dtil}{\widetilde{\mathbf d}}$ $\newcommand{\evec}{\mathbf e}$ $\newcommand{\fvec}{\mathbf f}$ $\newcommand{\nvec}{\mathbf n}$ $\newcommand{\pvec}{\mathbf p}$ $\newcommand{\qvec}{\mathbf q}$ $\newcommand{\svec}{\mathbf s}$ $\newcommand{\tvec}{\mathbf t}$ $\newcommand{\uvec}{\mathbf u}$ $\newcommand{\vvec}{\mathbf v}$ $\newcommand{\wvec}{\mathbf w}$ $\newcommand{\xvec}{\mathbf x}$ $\newcommand{\yvec}{\mathbf y}$ $\newcommand{\zvec}{\mathbf z}$ $\newcommand{\rvec}{\mathbf r}$ $\newcommand{\mvec}{\mathbf m}$ $\newcommand{\zerovec}{\mathbf 0}$ $\newcommand{\onevec}{\mathbf 1}$ $\newcommand{\real}{\mathbb R}$ $\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}$ $\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}$ $\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}$ $\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}$ $\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$ $\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}$ $\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$ $\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}$ $\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}$ $\newcommand{\laspan}[1]{\text{Span}\{#1\}}$ $\newcommand{\bcal}{\cal B}$ $\newcommand{\ccal}{\cal C}$ $\newcommand{\scal}{\cal S}$ $\newcommand{\wcal}{\cal W}$ $\newcommand{\ecal}{\cal E}$ $\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}$ $\newcommand{\gray}[1]{\color{gray}{#1}}$ $\newcommand{\lgray}[1]{\color{lightgray}{#1}}$ $\newcommand{\rank}{\operatorname{rank}}$ $\newcommand{\row}{\text{Row}}$ $\newcommand{\col}{\text{Col}}$ $\renewcommand{\row}{\text{Row}}$ $\newcommand{\nul}{\text{Nul}}$ $\newcommand{\var}{\text{Var}}$ $\newcommand{\corr}{\text{corr}}$ $\newcommand{\len}[1]{\left|#1\right|}$ $\newcommand{\bbar}{\overline{\bvec}}$ $\newcommand{\bhat}{\widehat{\bvec}}$ $\newcommand{\bperp}{\bvec^\perp}$ $\newcommand{\xhat}{\widehat{\xvec}}$ $\newcommand{\vhat}{\widehat{\vvec}}$ $\newcommand{\uhat}{\widehat{\uvec}}$ $\newcommand{\what}{\widehat{\wvec}}$ $\newcommand{\Sighat}{\widehat{\Sigma}}$ $\newcommand{\lt}{<}$ $\newcommand{\gt}{>}$ $\newcommand{\amp}{&}$ $\definecolor{fillinmathshade}{gray}{0.9}$

1.1 Model assumptions for a single factor ANOVA model
1.2 Effects of various violations
1.3 Diagnostic tools
Contributors

1.1 Model assumptions for a single factor ANOVA model

Single factor (fixed effect) ANOVA model:

$$Y_{i_j} = \mu_i + \epsilon_{i_j}, j = 1, ... , n_i; i = 1, ... , r.$$

Important model assumptions

Normality: $\epsilon_{i_j}$'s are normal random variables
Equal Variance: $\epsilon_{i_j}$'s have the same variance ($\sigma^2$).
Independence: $\epsilon_{i_j}$'s are independent random variables.

Some questions:

What will happen if these assumptions are violated?
How to find out whether these assumptions are violated? diagnostic tools:

- residual plots: check normality, equal variance, independence, outliers, etc.

- tests for equal variance

What to do when these assumptions are violated? remedial measures

- Data transformations

- Non-parametric tests

1.2 Effects of various violations

Non-normality:

- It is not a big deal unless the departure from normality is extreme.

- $F$-test and related procedures are pretty robust to the normality assumption, both in terms of significance level and power.

Unequal error variance:

- $F$-test and related analysis are pretty robust against unequal variance under an approximately balanced design.

- One parameter inference such as pairwise comparisons of group means could be substantially affected.

Non-independence:

- It can have serious side effects (effective loss of degrees of freedom).

- It is also hard to correct.

- Thus it is very important to use randomization whenever necessary.

1.3 Diagnostic tools

Based on residuals:

Residuals:
$$\epsilon_{i_j} = Y_{i_j} - \bar{Y}_i, j = 1, ... , n_i; i = 1, ... , r.$$
Studentized residuals: $$r_{i_j} = \frac{e_{i_j}}{s(e_{i_j})}$$, where $s(e_{i_j}) = \sqrt{MSE \times (n_i - 1)/n_i}$ (since Var($e_{i_j}) = \sigma^2(1-1/n_i)$.
Studentized residuals adjust for sample sizes and thus they are comparable across treatment groups when the design is unbalanced.

Normal probability plots

It is a graphical tool to check whether a set of quantities is approximately normally distributed.

Each value is plotted against its "expected value under normality"

- Sort the values from smallest to largest: $x_{(1)}, ... , x_{(n)}$

- For the $i$-th smallest value $x_{(i)}$, the "expected value under normality" is roughly the $\frac{i}{n}$ percentile of the standard normal distribution (the exact definition is a bit more complex).

A plot that is nearly linear suggests agreement with normality
A plot that departs substantially from linearity suggests non-normality

Check normality

Normal probability plots of the residuals

When sample size is small: use the combined residuals across all treatment groups.
When sample size is large: draw separate plot for each treatment group.
Use studentized residuals (but with MSE replaced by $s_{i}^{2}$'s (sample variance of the $i$-th treatment group) in the standard error calculation) when unequal variances are indicated and combined residuals are used. Note that $$s_{i}^{2} = \frac{1}{n_i - 1} \sum_{j=1}^{n_i}(Y_{i_j} - \bar{Y}_i)^2$$
Normality is shown by the normal probability plots being reasonably linear (points falling roughly along the 45$^\circ$ line when using the studentized residuals).

Checking the equal variance assumption

Residual vs. fitted value plots.

When the design is approximately balanced: plot residuals $e_{i_j}$'s against the fitted values $\bar{Y}_i$'s.
When the design is very unbalanced: plot the studentized residuals $r_{i_j}$'s against the fitted values $\bar{Y}_i$'s.
Constancy of the error variance is shown by the plot having about the same extent of dispersion of residuals (around zero) across different treatment groups.

Other things that can be examined by residual plots:

Independence: if measurements are obtained in a time/space sequence, a residual sequence plot can be used to check whether the error terms are serially correlated.
Outliers are identified by residuals with big magnitude.
Existence of other important (but un-accounted for) explanatory variables: whether residual plots shown a certain pattern.

Example: package design

Residuals for the package design example are given below.

Contributors

Cathy Wang

Search

Text Color

Text Size

Margin Size

Font Type