Skip to main content
Statistics LibreTexts

Biometrics Lab #4

  • Page ID
    2942
  • [ "article:topic", "authorname:dkiernan" ]

    Name: ______________________________________________________

    Experiment 1

    The following data were collected on Old Faithful geyser in Yellowstone Park. The x-variable is time between eruptions and the y-variable is length of eruptions.

    X

    Y

    12.17

    1.88

    11.63

    1.77

    12.03

    1.83

    12.15

    1.83

    11.30

    1.70

    11.70

    1.82

    12.27

    1.93

    11.60

    1.77

    11.72

    1.83

    12.10

    1.89

    11.70

    1.80

    11.40

    1.72

    11.22

    1.75

    11.42

    1.73

    11.53

    1.74

    11.50

    1.77

    11.90

    1.87

    11.86

    1.84

     a) Determine if a relationship exists between the 2 variables using a scatterplot and the linear correlation coefficient. Select GraphScatterplot. Select the Simple plot and click OK. Enter the response variable (length of eruptions) in the Y variables box, and the predictor variable (time between eruptions) in the X variables box. Click OK. Describe the relationship that you see.

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    b) Calculate the linear correlation coefficient. StatisticsBasic StatsCorrelation. Enter the 2 variables in the Variables box and click OK.

    r = ____________________________________

    What two pieces of information about the relationship between these two variables does the linear correlation coefficient tell you?

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    c) Find a least squares regression line treating “time between eruptions” as the predictor variable (x) and “length of eruptions” as the response variable (y). Stat>RegressionGeneral Regression.Enter “length of eruptions” in the Response box. Enter “time between eruptions” in the Model box. Click on Options and make sure that 95% is selected for all confidence intervals. Click on Graphsand select the Residual plot “Residual versus fits.” Click Results and make sure the Regression equation, Coefficient table, Display confidence intervals, Summary of model, Analysis of Variance table, and prediction tables are checked. Click OK.

    Write the regression equation __________________

    What is the value of R2? _______________________

    What does this mean?

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    Examine the residual model. Do you see any problems?

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    What is the value of the regression standard error? _____________________________

    Write the confidence intervals for the y-intercept ______________________________

    and slope ______________________________________________________________

    Use the output to test if the slope is significantly different from zero. Write the null and alternative hypotheses for this test.

    H0:____________________________________ 

    H1: ____________________________________

    Using the test statistic and p-value from the Minitab output to test this claim.

    Test statistic_______________________________ p-value _______________________________

    Conclusion:

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    d) Using the regression equation, what would be the length of the eruption if the time between eruptions is 11.42 min.?

    Experiment 2

    The index of biotic integrity (IBI) is a measure of water quality in streams. The sample data given in the table below comes from the Piedmont forest region. The table gives the data for IBI and forested area in square kilometers. Let Forest Area be the predictor variable (x) and IBI be the response variable (y).

    15108.png

    Create a scatterplot and describe the relationship between these variables. Compute the linear correlation coefficient.

    r = ____________________________________

    Create a regression model for this data set following the steps from the first example. Write the regression model.

    ________________________________________________________________________

    Is there significant evidence to support the claim that IBI increases with Forest Area? Write the test statistic/p-value used for this slope test along with your answer.

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    The researcher wants to estimate the population mean IBI for streams that have an average forested area of 48 sq. km. Click STAT>REGRESSION> GENERAL REGRESSION. Making sure that IBI is in the Response box and Forest Area is in the Model box, click on Prediction and enter 48 in the New observation for continuous predictors box and check Confidence limits. Click OK. Write the 95% confidence interval for mean IBI for streams in an average forested area of 48 sq. km. ______________________________________________________

    You are working with a stream in an area with 19 sq. km. of forested area. Your management plan includes an afforestation project that will increase the forested area to 23 sq. km. You need to predict what the specific IBI would be for this stream when the forested area is increased. Create a prediction interval to estimate this IBI if the forested area increased to 23 sq. km.

    Click STAT>REGRESSION>GENERAL REGRESSION. Making sure that IBI is in the Response box and Forest Area is in the Model box, click on Prediction and enter 23 in the New observation for continuous predictors box and check Prediction limits. Click OK. Write the 95% prediction interval for the IBI for this stream when the forested area is increased to 23 sq. km. ___________________________________________________

    Explain the difference between the confidence and prediction intervals you just computed.

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________

    ________________________________________________________________________