3.2.5.3 Statistical Approach to Define Developed Imperviousness
A statistical methodology is designed for the specific purpose of relating the imperviousness training statistics to statistics derived from coincidentally located Thematic Mapper imagery. Figure 24 illustrates one of the sample sites, and its corresponding Thematic Mapper composite, along with the statistics generated. Through some simple plotting of airphoto statistics versus TM statistics for the 15 sites, some strong linear trends are observed. (Figure 25) This prompted the pursuit of a multiple linear regression approach using the airphoto interpreted imperviousness as the response, and some combination of the TM statistics for the sample sites as the explanatory variables.
Selection of a regression was based upon the dual criteria of goodness of fit through the adjusted R-squared statistics, balanced with a desire for a set of explanatory variables that are not overly extensive, fitting expectations of what set of TM bands would be required. The first selected regression included TM Bands 3, 4, and 5, the ratio of band 4 to band 3, and the variance in TM band 5 (Table 7). While this produced favourable predictive results on the 15 sample sites, this selection did not take into account how the regression might operate on substantially different sets of TM statistics, found throughout the watershed.
|
Regress. Stats. |
||||||||
|
Multiple R |
0.96 |
ANOVA |
||||||
|
R Square |
0.93 |
df |
SS |
MS |
F |
Sig. F |
||
|
Adjusted R |
0.89 |
Regression |
5 |
4214.58 |
842.92 |
24.37 |
5.57 E-05 |
|
|
Std Error |
5.88 |
Residual |
9 |
311.34 |
34.59 |
|||
|
Observations |
15 |
Total |
14 |
4525.92 |
||||
|
Coefs |
Std Err |
t Stat |
P-value |
Lower 95% |
Upper 95% |
Lower 95% |
Upper 95% |
|
|
Intercept |
-37.77 |
59.21 |
-0.64 |
0.54 |
-171.72 |
96.17 |
-171.72 |
96.17 |
|
TM3 |
4.19 |
1.38 |
3.04 |
0.01 |
1.07 |
7.32 |
1.07 |
7.32 |
|
TM4 |
-1.30 |
0.53 |
-2.47 |
0.04 |
-2.49 |
-0.11 |
-2.49 |
-0.11 |
|
TM5 |
-0.83 |
0.24 |
-3.50 |
0.01 |
-1.37 |
-0.29 |
-1.37 |
-0.29 |
|
TM5V |
-0.08 |
0.03 |
-2.89 |
0.02 |
-0.13 |
-0.02 |
-0.13 |
-0.02 |
|
TM4OVER3 |
47.49 |
19.22 |
2.47 |
0.04 |
4.01 |
90.97 |
4.01 |
90.97 |
Testing the performance of the regression for a large number of possible combinations of TM statistics is implemented by using a kernel aggregation approach. GRASS's r.mfilter module is used to generate all possible combinations of TM statistics of a given sample size in the watershed.(Figure 26).

One consideration is that the sample statistics for 11x11 kernels of TM pixels might be different from the round sample of pixels. Correlation experiments on the 15 sample sites (Figure 27) show statistics for the round 800 hectare sample and the 11x11 kernel sample to be virtually identical (Table 8).
|
Photo |
tm3circ |
tm3sq11 |
Photo |
tm3circ |
tm3sq13 |
|
|
#L20-433 |
50775 |
49595 |
#L20-433 |
50775 |
49521 |
|
|
#L21-246 |
37462 |
37512 |
#L21-246 |
37462 |
37645 |
|
|
#L21-248 |
46200 |
45917 |
#L21-248 |
46200 |
46124 |
|
|
pearson |
0.99905 |
pearson |
0.99797 |
|||
|
Photo |
tm4circ |
tm4sq11 |
Photo |
tm4circ |
tm4sq13 |
|
|
#L20-433 |
75326 |
75066 |
#L20-433 |
75326 |
75509 |
|
|
#L21-246 |
98970 |
98711 |
#L21-246 |
98970 |
97799 |
|
|
#L21-248 |
69400 |
69727 |
#L21-248 |
69400 |
69112 |
|
|
pearson |
0.99986 |
pearson |
0.99972 |
|||
|
Photo |
tm5circ |
tm5sq11 |
Photo |
tm5circ |
tm5sq13 |
|
|
#L20-433 |
75589 |
73694 |
#L20-433 |
75589 |
74249 |
|
|
#L21-246 |
79955 |
79620 |
#L21-246 |
79955 |
79237 |
|
|
#L21-248 |
66454 |
66050 |
#L21-248 |
66454 |
65935 |
|
|
pearson |
0.99179 |
pearson |
0.99833 |
|||
|
Photo |
tm5vcirc |
tm5vsq11 |
Photo |
tm5vcirc |
tm5vsq13 |
|
|
#L20-433 |
136009569 |
93947383 |
#L20-433 |
136009569 |
118568822 |
|
|
#L21-246 |
125845246 |
131354270 |
#L21-246 |
125845246 |
119479360 |
|
|
#L21-248 |
48823435 |
52980854 |
#L21-248 |
48823435 |
49501691 |
|
|
pearson |
0.82296 |
pearson |
0.99303 |
|||
|
Photo |
4over3circ |
4over3sq11 |
Photo |
4over3circ |
4over3sq13 |
|
|
#L20-433 |
1512 |
1546 |
#L20-433 |
1512 |
1562 |
|
|
#L21-246 |
2707 |
2691 |
#L21-246 |
2707 |
2670 |
|
|
#L21-248 |
1535 |
1552 |
#L21-248 |
1535 |
1530 |
|
|
pearson |
0.99992 |
pearson |
0.99912 |
|||
|
Photo |
5over3circ |
5over3sq11 |
Photo |
5over3circ |
5over3sq13 |
|
|
#L20-433 |
1495 |
1496 |
#L20-433 |
1495 |
1509.426036 |
|
|
#L21-246 |
2167 |
2152 |
#L21-246 |
2167 |
2139.544379 |
|
|
#L21-248 |
1453 |
1453 |
#L21-248 |
1453 |
1442.568047 |
|
|
pearson |
1.00000 |
pearson |
0.99941 |
The resulting regressed imperviousness map revealed many prediction errors resulting from this regression (Figure 29). Thus while this first regression is the best choice based upon the 15 member sample set, it does not perform well in the many possible statistical scenarios found throughout the watershed. Examination of the areas of extreme prediction errors (imperviousness above 100 and below 0) revealed that most errors are coincident with areas of high TM 5 variance, an included parameter. Another regression, reducing this type of error is calculated, using the criterion that no variance parameters be included. The explanatory variables of this next best regression were TM bands 3, 4, 5 and band 5 over 3. (Table 9)
|
Regression Statistics |
||||||||
|
Multiple R |
0.93 |
ANOVA |
||||||
|
R Square |
0.87 |
df |
SS |
MS |
F |
Significance F |
||
|
Adjusted R |
0.82 |
Regression |
4 |
3954.18 |
988.54 |
17.29 |
1.73E-04 |
|
|
Std Error |
7.56 |
Residual |
10 |
571.74 |
57.17 |
|||
|
Observations |
15 |
Total |
14 |
4525.92 |
||||
|
Coefs. |
Std Error |
t Stat |
P-value |
Lower 95% |
Upper 95% |
Lower 95% |
Upper 95% |
|
|
Intercept |
-15.36 |
106.41 |
-0.14 |
0.89 |
-252.47 |
221.74 |
-252.47 |
221.74 |
|
TM3 |
0.00 |
0.00 |
1.42 |
0.19 |
0.00 |
0.01 |
0.00 |
0.01 |
|
TM4 |
0.00 |
0.00 |
-0.96 |
0.36 |
0.00 |
0.00 |
0.00 |
0.00 |
|
TM5 |
0.00 |
0.00 |
-1.78 |
0.11 |
0.00 |
0.00 |
0.00 |
0.00 |
|
TM5over3 |
0.05 |
0.05 |
1.04 |
0.32 |
-0.06 |
0.16 |
-0.06 |
0.16 |
This new regression, when subjected to the same 11x11 kernel testing methodology, produced maps of impervious similar to the previous attempt with far fewer extreme errors (Figure 30). The new regression still distinguished between more and less impervious land covers within the selected areas (as based upon visual comparison with the high resolution airphotos), producing similar patterns of percent imperviousness, and did not suffer from the extreme sensitivity to a single explanatory parameter (Figures 31, 32, and 33).