3.2.5.3 Statistical Approach to Define Developed Imperviousness

A statistical methodology is designed for the specific purpose of relating the imperviousness training statistics to statistics derived from coincidentally located Thematic Mapper imagery. Figure 24 illustrates one of the sample sites, and its corresponding Thematic Mapper composite, along with the statistics generated. Through some simple plotting of airphoto statistics versus TM statistics for the 15 sites, some strong linear trends are observed. (Figure 25) This prompted the pursuit of a multiple linear regression approach using the airphoto interpreted imperviousness as the response, and some combination of the TM statistics for the sample sites as the explanatory variables.

Selection of a regression was based upon the dual criteria of goodness of fit through the adjusted R-squared statistics, balanced with a desire for a set of explanatory variables that are not overly extensive, fitting expectations of what set of TM bands would be required. The first selected regression included TM Bands 3, 4, and 5, the ratio of band 4 to band 3, and the variance in TM band 5 (Table 7). While this produced favourable predictive results on the 15 sample sites, this selection did not take into account how the regression might operate on substantially different sets of TM statistics, found throughout the watershed.

Regress. Stats.

Multiple R

0.96

ANOVA

R Square

0.93

df

SS

MS

F

Sig. F

Adjusted R

0.89

Regression

5

4214.58

842.92

24.37

5.57 E-05

Std Error

5.88

Residual

9

311.34

34.59

Observations

15

Total

14

4525.92

Coefs

Std Err

t Stat

P-value

Lower 95%

Upper 95%

Lower 95%

Upper 95%

Intercept

-37.77

59.21

-0.64

0.54

-171.72

96.17

-171.72

96.17

TM3

4.19

1.38

3.04

0.01

1.07

7.32

1.07

7.32

TM4

-1.30

0.53

-2.47

0.04

-2.49

-0.11

-2.49

-0.11

TM5

-0.83

0.24

-3.50

0.01

-1.37

-0.29

-1.37

-0.29

TM5V

-0.08

0.03

-2.89

0.02

-0.13

-0.02

-0.13

-0.02

TM4OVER3

47.49

19.22

2.47

0.04

4.01

90.97

4.01

90.97

Testing the performance of the regression for a large number of possible combinations of TM statistics is implemented by using a kernel aggregation approach. GRASS's r.mfilter module is used to generate all possible combinations of TM statistics of a given sample size in the watershed.(Figure 26).

One consideration is that the sample statistics for 11x11 kernels of TM pixels might be different from the round sample of pixels. Correlation experiments on the 15 sample sites (Figure 27) show statistics for the round 800 hectare sample and the 11x11 kernel sample to be virtually identical (Table 8).

Photo

tm3circ

tm3sq11

Photo

tm3circ

tm3sq13

#L20-433

50775

49595

#L20-433

50775

49521

#L21-246

37462

37512

#L21-246

37462

37645

#L21-248

46200

45917

#L21-248

46200

46124

pearson

0.99905

pearson

0.99797

Photo

tm4circ

tm4sq11

Photo

tm4circ

tm4sq13

#L20-433

75326

75066

#L20-433

75326

75509

#L21-246

98970

98711

#L21-246

98970

97799

#L21-248

69400

69727

#L21-248

69400

69112

pearson

0.99986

pearson

0.99972

Photo

tm5circ

tm5sq11

Photo

tm5circ

tm5sq13

#L20-433

75589

73694

#L20-433

75589

74249

#L21-246

79955

79620

#L21-246

79955

79237

#L21-248

66454

66050

#L21-248

66454

65935

pearson

0.99179

pearson

0.99833

Photo

tm5vcirc

tm5vsq11

Photo

tm5vcirc

tm5vsq13

#L20-433

136009569

93947383

#L20-433

136009569

118568822

#L21-246

125845246

131354270

#L21-246

125845246

119479360

#L21-248

48823435

52980854

#L21-248

48823435

49501691

pearson

0.82296

pearson

0.99303

Photo

4over3circ

4over3sq11

Photo

4over3circ

4over3sq13

#L20-433

1512

1546

#L20-433

1512

1562

#L21-246

2707

2691

#L21-246

2707

2670

#L21-248

1535

1552

#L21-248

1535

1530

pearson

0.99992

pearson

0.99912

Photo

5over3circ

5over3sq11

Photo

5over3circ

5over3sq13

#L20-433

1495

1496

#L20-433

1495

1509.426036

#L21-246

2167

2152

#L21-246

2167

2139.544379

#L21-248

1453

1453

#L21-248

1453

1442.568047

pearson

1.00000

pearson

0.99941

The resulting regressed imperviousness map revealed many prediction errors resulting from this regression (Figure 29). Thus while this first regression is the best choice based upon the 15 member sample set, it does not perform well in the many possible statistical scenarios found throughout the watershed. Examination of the areas of extreme prediction errors (imperviousness above 100 and below 0) revealed that most errors are coincident with areas of high TM 5 variance, an included parameter. Another regression, reducing this type of error is calculated, using the criterion that no variance parameters be included. The explanatory variables of this next best regression were TM bands 3, 4, 5 and band 5 over 3. (Table 9)

Regression Statistics

Multiple R

0.93

ANOVA

R Square

0.87

df

SS

MS

F

Significance F

Adjusted R

0.82

Regression

4

3954.18

988.54

17.29

1.73E-04

Std Error

7.56

Residual

10

571.74

57.17

Observations

15

Total

14

4525.92

Coefs.

Std Error

t Stat

P-value

Lower 95%

Upper 95%

Lower 95%

Upper 95%

Intercept

-15.36

106.41

-0.14

0.89

-252.47

221.74

-252.47

221.74

TM3

0.00

0.00

1.42

0.19

0.00

0.01

0.00

0.01

TM4

0.00

0.00

-0.96

0.36

0.00

0.00

0.00

0.00

TM5

0.00

0.00

-1.78

0.11

0.00

0.00

0.00

0.00

TM5over3

0.05

0.05

1.04

0.32

-0.06

0.16

-0.06

0.16

This new regression, when subjected to the same 11x11 kernel testing methodology, produced maps of impervious similar to the previous attempt with far fewer extreme errors (Figure 30). The new regression still distinguished between more and less impervious land covers within the selected areas (as based upon visual comparison with the high resolution airphotos), producing similar patterns of percent imperviousness, and did not suffer from the extreme sensitivity to a single explanatory parameter (Figures 31, 32, and 33).

PREVIOUS | CONTENTS | NEXT


Questions? Email wisqqih2
HTML by David Tenenbaum