Skip to main content
Statistics LibreTexts

14.4: Randomized block design

  • Page ID
    45229
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Introduction

    Randomized Block Designs and Two-Factor ANOVA

    In previous lectures, we have introduced you to the standard factorial ANOVA, which may be characterized as being crossed, balanced, and replicated. We expect that additional factors (covariates) may contribute to differences among our experimental units, but rather than testing them — which would increase the need for additional experimental units because of the increased number of groups to test — we randomize our subjects. Randomization is intended to disrupt trends of confounding variables (aka covariates). If the resulting experiment has missing values (see Chapter 5), then we can say that the design is partially replicated; if only one observation is made per group, then the design is not replicated — and perhaps, not very useful!!

    A special type of Two-factor ANOVA which includes a “blocking” factor and a treatment factor.

    Randomization is one way to control for “uninteresting” confounding factors. Clearly, there will be scenarios in which randomization is impossible. For example, it is impossible to randomly assign subjects to The blocking factor is similar to the Paired t-test. In the paired t-test we had two individuals or groups that we paired (e.g. twins). One specific design is called the Randomized Block Design and we can have more than 2 members in the group. We arrange the experimental units into similar groups, i.e., by the blocking factor. Examples of blocking factors may include day (experiments may be run over different days), location (experiments may be run at different locations in the laboratory), nucleic acid kits (different vendors), operator (different assistants may work on the experiments), et cetera.

    In general we may not be directly interested in the blocking factor. This blocking factor is used to control some factor(s) that we suspect might affect the response variable. Importantly, this has the effect of reducing the sums of squares by an amount equal to the sums of squares for the block. If variability removed by the blocking is significant, Mean Square Error (MSE) will be smaller, meaning that the \(F\) value for treatment will be bigger — meaning we have a more powerful ANOVA than if we had ignored the blocking.

    Statistical Testing in Randomized Block Designs

    “Blocks” is a Random Factor because we are “sampling” a few blocks out of a larger possible number of blocks. Treatment is a Fixed Factor, usually.

    The statistical model is \[Y_{ij} = \mu + \alpha_{i} + \beta_{j} + \epsilon{i,j} \nonumber\]

    The Sources of Variation are simpler than the more typical Two-Factor ANOVA because we do not calculate all the sources of variation – the interaction is not tested! (Table \(\PageIndex{1}\)).

    Table \(\PageIndex{1}\). Sources of variation for a two-way ANOVA, randomized block design.
    Sources of Variation & Sum of Squares DF Mean Squares
    \(total \ SS = \displaystyle \sum_{i=1}^{a} \sum_{j=1}^{b} \sum_{k=1}^{c} \left(X_{ijk} - \bar{X}\right)^{2}\) \(N-1\) \(\dfrac{total \ SS}{total \ DF}\)
    \(SS_{Factor \ A} = b \cdot n \sum_{i=1}^{a} \left(\bar{X}_{i} - \mu\right)^{2}\) \(a-1\) \(\dfrac{SS \ Factor \ A}{DF \ Factor A}\)
    \(SS_{Factor \ B} = a \cdot n \sum_{j=1}^{b} \left(\bar{X}_{j} - \mu\right)^{2}\) \(b-1\) \(\dfrac{SS \ Factor \ B}{DF \ Factor B}\)
    \(SS_{Total} = SS_{Factor \ A} - SS_{Factor \ B}\) \(n-a-b\) \(\dfrac{SS \ Remainder}{DF \ Remainder}\)

    Critical Value \(F_{0.05(2), (a-1), (Total \ DF - a - b)}\)

    In the exercise example above: Factor A = exercise or management plan.

    Notice that we do not look at the interaction MS or the Blocking Factor (typically).

    Learn by doing

    Rather than me telling you, try on your own. We’ll begin with a worked example, then proceed to introduce you to three problems. See Chapter 14.8 for general discussion of Rcmdr and linear models for models other than the standard 2-way ANOVA.

    Worked example

    Wheel running by mice is a standard measure of activity behavior. Even wild mice will use wheels (Meijer and Roberts 2014). For example, we conduct a study of family parity to see if offspring from the first, second, or third sets of births perform differently in wheel-running behavior (measured in total revs per 24 hr period). Each set of offspring from a female could be treated as a block. Data are for 3 female offspring from each pairing. This type of data set would look like this:

    Table \(\PageIndex{2}\). Wheel running behavior (revolutions of wheel per 24-hr period) by three offspring from each of three birth cohorts among four maternal sets (moms).
    Block Dam 1 Dam 2 Dam 3 Dam 4
    b1 1100 1566 945 450
    b1 1245 1478 877 501
    b1 1115 1502 892 394
    b2 999 451 644 605
    b2 899 405 650 612
    b2 745 344 605 700
    b3 1245 702 1712 790
    b3 1300 612 1745 850
    b3 1750 508 1680 910

    Thus, there were nine offspring for each female mouse (Dam1 – Dam4), three offspring per each of three litters of pups. The litters are the blocks. We need to get the data stacked to run in R. I’ve provided the dataset for you, so scroll to end of this page or click here.

    Question 1. Describe the problem and identify the treatment and blocking factors.

    Answer. Each female has three litters. We’re primarily interested in genetics (and maternal environment) of wheel running behavior, which is associated with the moms (Treatment factor). The questions is whether there is an effect of birth parity on wheel running behavior. Offspring of a first-time mother may experience different environment than offspring of an experienced mother. In this case, parity effects is an interesting question; nevertheless, blocking is the appropriate way to handle this type of problem.

    Question 2. What is the statistical model?

    Answer. Response variable, \(Y\), is wheel running. Let \(\alpha\) be the effect of Dam and \(\beta\) the birth cohorts (i.e., the blocking effect).

    \[Y_{ij} = \mu + \alpha_{i} + \beta_{j} + \epsilon_{i,j} \nonumber\]

    Question 3. Test the model.

    Answer. We fit the main effects (Dam and Block). \(Wheel \sim Dam + Block\)

    Rcmdr: Statistics → Fit models → Linear model

    Linear Model popup menu in R Commander. The model formula entered is Wheel as the sum of Block and Dam.
    Figure \(\PageIndex{1}\): Enter formula for the linear model in R Commander.

    then run the ANOVA summary to get the ANOVA table. Rcmdr: Models → Hypothesis tests → ANOVA table.

    Output

    Anova Table (Type II tests)
    
    Response: Wheel
               Sum Sq  Df  F value    Pr(>F) 
    Dam       1467020   3   4.4732   0.01036 * 
    Block     1672166   2   7.6482   0.00207 **
    Residuals 3279544  30

    Question 4. Conclusions?

    Answer. The null hypotheses are:

    Treatment factor: Offspring of the different dams have same wheel running activity of offspring.

    Blocking factor: No effect of litter parity on wheel running activity of offspring.

    Both the treatment factor (p = 0.01036) and the blocking factor (p = 0.00207) were statistically significant.

    Problem 1.

    Or we might want to measure the Systolic Blood Pressure of individuals that are on different exercise regimens. However, we are not able to measure all the individuals on the same day at the same time. We suspect that time of day and the day of the year might affect an individual’s blood pressure. Given this constraint, the best research design in this circumstance is to measure one individual on each exercise regime at the same time. These different individuals will then be in the same “block” because they share in common the time that their blood pressure was measured. This type of data set would look like this (Table \(\PageIndex{2}\)):

    Table \(\PageIndex{2}\). Simulated blood pressure of five subjects on three different exercise regimens.
    Subject No Exercise Moderate Exercise Intense Exercise
    1 120 115 114
    2 135 130 131
    3 115 110 109
    4 112 107 106
    5 108 103 102
    You’ll need to arrange the data like the data set for the worked example.

    Question 1. Describe the problem and identify the treatment and blocking factors.

    Question 2. What is the statistical model?

    Question 3. Test the model.

    Question 4. Conclusions?

    Problem 2.

    Another example in conservation biology or agriculture. There may be three different management strategies for promoting the recovery of a plant species. A good research design would be to choose many plots of land (blocks) and perform each treatment (management strategy) on a portion of each plot of land (block). A researcher would start with an equal number of plantings in the plots and see how many grew. The plots of land (blocks) share in common many other aspects of that particular plot of land that may affect the recovery of a species.

    Table \(\PageIndex{3}\). Growth of plants in 5 different plots subjected to one of three management plans (simulated data set).
    Plot No. No Management Used Management Plan 1 Management Plan 2
    1 0 11 14
    2 2 13 15
    3 3 11 19
    4 4 10 16
    5 5 15 12
    You’ll need to arrange the data like the data set for the worked example.

    These are examples of Two-Factor ANOVA but we are usually only interested in the treatment Factor. We recognize that the blocking factor may contribute to differences among groups and so wish to control for the fact that we carried out the experiments at different times (e.g., seasons) or at different locations (e.g., agriculture plots)

    Question 1. Describe the problem and identify the treatment and blocking factors.

    Question 2. What is the statistical model?

    Question 3. Test the model.

    Question 4. Conclusions?

    Repeated-Measures Experimental Design

    If multiple measures are taken on the same individual, then we have a repeated-measures experiment. This is a Randomized Block Design. In other words, each animal gets all levels of a treatment (assigned randomly). Thus, samples (individuals) are not independent and the analysis needs to take this into account. Just like for paired \(t\)-tests, one can imagine a number of experiments in biomedicine that would conform to this design.

    Problem 3.

    The data are total blood cholesterol levels for 7 individuals given 3 different drugs (from example, as given in Zar 1999, Ex 12.5, pp. 257-258).

    Table \(\PageIndex{4}\). Repeated measures of blood cholesterol levels of seven subjects on three different drug regimens.
    Subjects Drug 1 Drug 2 Drug 3
    1 164 152 178
    2 202 181 222
    3 143 136 132
    4 210 194 216
    5 228 219 245
    6 173 159 182
    7 161 157 165
    You’ll need to arrange the data like the data set for the worked example.

    Question 1: Is there an interaction term in this design?

    Question 2: Are individuals a fixed or a random effect?

    Question 2. What is the statistical model?

    Question 3. Test the model. Note that we could have done the experiment with 21 randomly selected subjects and a one-factor ANOVA. However, the repeated measures design is best IF there is some association (“correlation”) between the data in each row. The computations are identical to the randomized block analysis.

    Question 4. Conclusions?

    Problem 4.

    Here is a second example of a repeated measures design experiment. Garter snakes respond to odor cues to find prey. Snakes use their tongues to “taste” the air for chemicals, and flick their tongues rapidly when in contact with suitable prey items, less frequently for items not suitable for prey. In the laboratory, researchers can test how individual snakes respond to different chemical cues by presenting each snake with a swab containing a particular chemical. The researcher then counts how many times the snake flicks its tongue in a certain time period (data presented p. 301, Glover and Mitchell 2016).

    Table \(\PageIndex{5}\). Tongue flick counts of naïve newborn snakes to extracts
    Snake Control (dH2O) Fish mucus Worm mucus
    1 3 6 6
    2 0 22 22
    3 0 12 12
    4 5 24 24
    5 1 16 16
    6 2 16 16
    You’ll need to arrange the data like the data set for the worked example.

    Question 1. Describe the problem and identify the treatment and blocking factors.

    Question 2. What is the statistical model?

    Question 3. Test the model.

    Question 4. Conclusions?


    Additional questions

    1. The advantage of a randomized block design over a completely randomized design is that we may compare treatments by using ________________ experimental units.
      A. randomly selected
      B. the same or nearly the same
      C. independent
      D. dependent
      E. All of the above
    2. Which of the following is NOT found in an ANOVA table for a randomized block design?
      A. Sum of squares due to interaction
      B. Sum of squares due to factor 1
      C. Sum of squares due to factor 2
      D. None of the above are correct
    3. A clinician wishes to compare the effectiveness of three competing brands of blood pressure medication. She takes a random sample of 60 people with high blood pressure and randomly assigns 20 of these 60 people to each of the three brands of blood pressure medication. She then measures the decrease in blood pressure that each person experiences. This is an example of (select all that apply)
      A. a completely randomized experimental design
      B. a randomized block design
      C. a two-factor factorial experiment
      D. a random effects or Type II ANOVA
      E. a mixed model or Type III ANOVA
      F. a fixed effects model or Type I ANOVA
    4. A clinician wishes to compare the effectiveness of three competing brands of blood pressure medication. She takes a random sample of 60 people with high blood pressure and randomly assigns 20 of these 60 people to each of the three brands of blood pressure medication. She then measures the blood pressure before treatment and again 6 weeks after treatment for each person. This is an example of (select all that apply)
      A. a completely randomized experimental design
      B. a randomized block design
      C. a two-factor factorial experiment
      D. a random effects or Type II ANOVA
      E. a mixed model or Type III ANOVA
      F. a fixed effects model or Type I ANOVA

    Data sets used in this page

    Worked Example data set

    Block Dam Wheel
    B1 D1 1100
    B1 D2 1566
    B1 D3 945
    B1 D4 450
    B1 D1 1245
    B1 D2 1478
    B1 D3 877
    B1 D4 501
    B1 D1 1115
    B1 D2 1502
    B1 D3 892
    B1 D4 394
    B2 D1 999
    B2 D2 451
    B2 D3 644
    B2 D4 605
    B2 D1 899
    B2 D2 405
    B2 D3 650
    B2 D4 612
    B2 D1 745
    B2 D2 344
    B2 D3 605
    B2 D4 700
    B3 D1 1245
    B3 D2 702
    B3 D3 1712
    B3 D4 790
    B3 D1 1300
    B3 D2 612
    B3 D3 1745
    B3 D4 850
    B3 D1 1750
    B3 D2 508
    B3 D3 1680
    B3 D4 910

    This page titled 14.4: Randomized block design is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Michael R Dohm via source content that was edited to the style and standards of the LibreTexts platform.

    • Was this article helpful?