Assignment 7

Due Date: Friday, March 9, 2006

The Effect of Fungal Spore Treatment on European Corn-borer—A Regression Model

The information given below is identical to what was presented in Assignment 4. You may just want to skip over it to the questions that follow.

These data are taken from Beall (1940) and are reproduced in Evans (1953). The object is to determine the effect of fungus (Beauveria Bassiana) spore treatment on the control of European corn borer, Pyrausta nubilalis. Four different treatment were used as described below. Recorded applications are in grams per acre.

Treatment
Application on July 8
Application on July 19
1
0
0
2
0
40
3
40
0
4
40
40

The first treatment occurred at the beginning of the period of oviposition while the second treatment was administered at the height of the period of oviposition. Eggs were allowed to hatch and the larva to grow to full size. What is recorded are the number of borers that were found on a unit area occupied by a hill of corn. For each treatment 120 hills of corn were examined. The table below records the number of hills exhibiting different levels of infestation (first column) under the the four different treatment regimes.

# of borers per hill of corn
Frequency
Trt 1
Trt 2
Trt 3
Trt 4
0
19
24
43
47
1
12
16
35
23
2
18
16
17
27
3
18
18
11
9
4
11
15
5
7
5
12
9
4
3
6
7
6
1
1
7
8
5
2
1
8
4
3
2
0
9
4
4
0
10
1
3
1
11
0
0
1
12
1
1
13
1
15
1
17
1
19
1
26
1

The Questions

As promised in Assignment 4 we'll now learn how to analyze these data more efficiently.

Question 1 Fit a single regression model (or at most two models) and test these data for a treatment effect. Use whatever methodology you think appropriate.

Hint 1 : You may use what you learned in Assignment 4 as a basis for choosing an appropriate model. I don't expect you to screen a large number of models here.

Hint 2 : By a treatment effect I'm referring to what might be called an omnibus test of treatment, i.e., does a model that includes treatment as part of the model do a better job of summarizing the data than one that ignores treatment altogether?

Question 2  I would now like you to fit a single regression model that not only tests for an overall treatment effect, but also tests the following three hypotheses simultaneously.

    1. Treatment 4 involves a double application of fungal spores. Test whether a double application yields a different result than a single application (ignoring the timing of the single application). This should be a contrast of treatment 4 with treatments 2 and 3 combined.
    2. Treatments 2 and 3 are both single applications of fungal spores, but their timing is different. Test whether the timing of the application makes a difference. This should be a contrast of treatment 2 with treatment 3.
    3. Test whether any application of fungal spores at all yields a result different from no application. This should be a contrast of treatment 1 with treatments 2, 3, and 4 combined.

What conclusions do you draw for each of these tests?

Question 3  Are the contrasts specified in Question 2 orthogonal contrasts? Why or why not?

Question 4  Create an error bar plot in which you display 95% Wald confidence intervals for the mean much like you did in Question 3 of Assignment 4. This time though use only the results from the regression model you fit in Question 1 of this assignment.

Hint 3 : Start by writing out the equations for treatments 1 through 4 based on the regression model fit in Question 1 and the coding scheme that was used for the treatment regressors that appear in the model. This will give you the equations you need for calculating the treatment means that are predicted by the model.

Hint 4 : Now that you have formulas for each of the treatment means based on parameters in the regression model, use these formulas to obtain estimates of the standard errors of the individual means. Note: To calculate the standard errors you will need to extract various elements of the estimated covariance matrix of the parameter estimates.

Hint 5 : To obtain the covariance matrix I recommend applying the summary function to your model object and saving the results to a variable. If you then apply the names function to this variable you'll see both cov.unscaled and cov.scaled as list elements. These matrices should be identical here and either one can be used. If you use the ls.diag function on the other hand the cov.unscaled and cov.scaled elements are different, and they should not be. The values in the cov.unscaled element of the ls.diag object are correct. I don't know what the values in cov.scaled represent in this case. The vcov function also returns these same mysterious values.

Hint 6 : Don't worry about adjusting the α-level when constructing the confidence interval. Just use α = .05. If you don't know what I'm talking about here, then just ignore this hint.

Hint 7 : If there's a link function involved, then remember that all of your parameter values are on the scale of the link. In calculating confidence intervals calculate the endpoints of the intervals on the scale of the link (it's on this scale that the distributional assumptions are supposed to hold) and only at the very end back-transform using the inverse link to the original scale of the variables.

Question 5  How does the picture you drew in Question 4 compare to the comparable picture you constructed in Assignment 4?

Cited References

Beall, Geoffrey. 1940. The fit and significance of contagious distributions when applied to observations on larval insects. Ecology 21(4): 460–474.
Evans, D. A. 1953. Experimental evidence concerning contagious distributions in ecology. Biometrika 40: 186–211.

Course Home Page


Jack Weiss
Phone: (919) 962-5930
E-Mail: jack_weiss@unc.edu
Address: Curriculum in Ecology, Box 3275, University of North Carolina, Chapel Hill, 27516
Copyright © 2006
Last Revised--March 23, 2006
URL: http://www.unc.edu/courses/2006spring/ecol/145/001/docs/assignments/assign7.htm