Back

Accuracy Assessment

Next

Image classification results in a raster file in which the individual raster elements are class labelled. As image classification is based on samples of the classes, the actual quality should be checked and quantified afterwards. This is usually done by a sampling approach in which a number of raster elements are selected and both the classification result and the true world class are computed. Comparison is done by creating an error matrix from which different accuracy measures can be calculated.

This chapter deals with the accuracy assessment of the LISS-3 MSS classified image (of spatial resolution 23.5 m) and MODIS classified image (of spatial resolution 250 m). The chapter is arranged into 3 sections. The first section (6.1) discusses the accuracy assessment of the LISS-3 supervised and unsupervised classification. MODIS classification accuracy assessment is shown in second section (6.2). Section 6.2 is further sub-divided into 2 sub sections. The first subsection (6.2.1) discusses the accuracy estimation of the MODIS hard classified maps. This assessment has been done at three levels – with the ground truth data collected from the field, at the administrative boundary level (Taluk level) and at the pixel level; by comparing on pixel by pixel basis with respect to LISS-3 classified map. The second subsection (6.2.2) shows the accuracy assessment of the soft classification at the administrative boundary level and at the pixel level, compared to LISS-3 classified image. The chapter concludes with a short discussion on the results of hard classification, soft classification and accuracy assessment (section 6.3).

Acronym used in this chapter:

•  MLC (LISS-3) – Maximum Likelihood classification on LISS-3 MSS.

•  K-Means (B1 to B7) – K- Means Clustering algorithm applied on MODIS Bands 1 to 7.

•  MLC (B1 to B7) – Maximum Likelihood classification on MODIS Bands 1 to 7

•  SAM (B1 to B7) – Spectral Angle Mapper applied on MODIS Bands 1 to 7.

•  NN (B1 to B7) – Neural Network applied on MODIS Bands 1 to 7.

•  DTA (B1 to B7) – Decision Tree Approach applied on MODIS Bands 1 to 7.

•  MLC (PCA) – Maximum Likelihood classification applied on PC 1 of MODIS 36 bands.

•  SAM (PCA) – Spectral Angle Mapper applied on PC 1 of MODIS 36 bands.

•  NN (PCA) - Neural Network applied on PC 1 of MODIS 36 bands.

•  DTA (PCA) – Decision Tree Approach applied on PC 1 of MODIS 36 bands.

•  MLC (MNF) – Maximum Likelihood classification on MNF 2 components of MODIS 36 bands.

•  SAM (MNF) – Spectral Angle Mapper applied on MNF 2 components of MODIS 36 bands.

•  NN (MNF) – Neural Network applied on MNF 2 components of MODIS 36 bands.

•  DTA (MNF) – Decision Tree Approach applied on MNF 2 components of MODIS 36 bands.

1 PC – Principal Components

2 MNF – Minimum Noise Fraction

Back
6.1 LISS-3 MSS Classification Accuracy

Accuracy assessment was done for Chikballapur administrative unit (taluk), where field data were collected on 5 th September and 25 th November 2005. Figure 6.1 shows overview of the Chikballapur taluk and its location in Kolar district. Table 6.1 gives the area of each taluk in hectares.

Figure 6. 1 : Chikballapur taluk in Kolar district where field data were collected.

Table 6. 1 : Area statistics of each taluk in Kolar district.

Taluks

Area (in hectares)

Bagepalli

92898

Gudibanda

22727

Gauribidanur

88859

Chikballapur

63821

Sidlaghatta

67057

Chintamani

88904

Srinivaspur

86279

Kolar

79218

Mulbagal

81974

Malur

64500

Bangarpet

86806

Total

823043

The producer's and user's accuracy (as explained in chapter 4) corresponding to the various categories, and the overall accuracy were computed, along with the error matrices for supervised and unsupervised classified MSS data of LISS-3, which is summarised in table 6.2 and Annexure D (table 1 and 2). The LISS-3 supervised classification accuracy assessment gave a kappa (k) value of 0.95 indicating that an observed classification is in agreement to the order of 95 percent.

Table 6. 2 : Producer's accuracy, user's accuracy and overall accuracy of land cover classification using LISS-3 MSS data for Chikballapur Taluk.
Supervised Classification Unsupervised Classification

Category

Producer's Accuracy (%)

User's Accuracy (%)

Overall accuracy (%)

Producer's Accuracy (%)

User's Accuracy (%)

Overall Accuracy (%)

Agriculture

94.21

84.54

95.63

94.47

83.39

90.22

Built up

96.47

83.11

89.68

80.30

Forest

94.73

96.20

86.77

89.71

Plantation

92.27

91.73

84.44

90.10

Waste land

97.49

89.88

93.03

93.37

Water bi\odies

96.13

98.33

92.91

94.89

Back

6.2 MODIS Classification Accuracy

6.2.1 Accuracy Assessment of hard classification for MODIS

Using Error matrix

User's (3) , Producer's ( 4 ) and Overall accuracy assessment of the MODIS classified maps (using hard classifier) was done for Chikballapur taluk with the ground truth data and the results are listed in Table 6.3, 6.4 and 6.5 respectively.

Table 6. 3 : Overall Accuracy of classified MODIS Data of Chikballapur taluk.

Algorithms

Overall Accuracy

Rank

NN (MNF)

86.11

1

MLC (B1 to B7)

75.99

2

DTA (MNF)

74.66

3

DTA (B1 to B7)

71.89

4

NN (PCA)

71.02

5

DTA (PCA)

70.10

6

SAM (B1 to B7)

69.41

7

NN (B1 to B7)

68.88

8

K-Means (B1 to B7)

59.88

9

SAM (MNF)

49.27

10

MLC (MNF)

42.22

11

SAM (PCA)

35.22

12

MLC (PCA)

30.44

13

 

Table 6. 4 : User's Accuracy of classified MODIS Data of Chikballapur taluk.

Algorithms

Agriculture

Built up

Forest

Plantation

Waste land

Waterbodies

K-Means (B1 to B7)

54.23

75.44

79.57

61.88

82.63

48.07

MLC (B1 to B7)

84.01

84.77

94.21

51.33

84.33

51.49

SAM (B1 to B7)

46.63

71.71

84.05

61.18

87.53

45.61

NN (B1 to B7)

94.00

80.80

94.65

59.40

93.87

45.55

DTA (B1 to B7)

64.44

98.27

89.21

63.35

62.55

88.91

MLC (PCA)

71.22

69.99

89.05

86.48

67.33

64.41

SAM (PCA)

53.44

65.31

88.32

54.97

73.99

61.45

NN (PCA)

97.33

95.18

67.67

95.38

74.07

48.00

DTA (PCA)

63.12

79.11

81.11

69.33

66.29

63.01

MLC (MNF)

43.33

61.02

91.23

61.22

85.11

41.22

SAM (MNF)

57.45

49.94

53.22

63.07

81.05

70.34

NN (MNF)

93.89

94.46

89.13

85.60

74.22

59.40

DTA (MNF)

74.44

62.22

74.58

61.37

71.35

55.18

(3) - It is a measure of commission error indicating the probability that a pixel classified into a given category actually

represents that category on the ground.

(4) - It indicates how well training set pixels of the given cover type are classified.

Table 6. 5 : Producer's Accuracy of classified MODIS Data of Chikballapur taluk.

Algorithms

Agriculture

Built up

Forest

Plantation

Waste land

Waterbodies

K-Means (B1 to B7)

73.33

87.48

62.88

41.68

71.19

30.83

MLC (B1 to B7)

76.49

81.66

73.64

36.31

76.82

89.35

SAM (B1 to B7)

63.49

81.53

71.42

27.99

51.61

70.01

NN (B1 to B7)

56.73

99.00

73.07

96.60

89.53

68.56

DTA (B1 to B7)

40.91

93.04

90.71

44.98

71.62

23.36

MLC (PCA)

53.21

69.78

48.26

69.91

92.20

91.11

SAM (PCA)

86.79

86.21

24.52

67.05

33.09

44.07

NN (PCA)

57.55

93.00

94.00

93.00

93.00

73.51

DTA (PCA)

51.93

81.33

86.21

35.47

53.02

36.49

MLC (MNF)

71.66

56.77

45.35

66.13

45.77

99.00

SAM (MNF)

47.66

31.89

31.42

51.01

45.51

67.22

NN (MNF)

69.99

91.89

87.24

99.00

95.00

56.93

DTA (MNF)

49.91

79.66

81.66

83.33

77.09

36.84

From the overall accuracy assessment, NN on MNF and MLC classification on MODIS bands 1 to 7 performed better than the other techniques.

Comparison based on land cover class percentage area

Accuracy Assessment of MODIS classified maps was also performed at two spatial scales – at the administrative boundary level (Taluk) and at the pixel level. Land cover statistics were computed for all taluks pertaining to each classification algorithm. Land cover percentage area versus various algorithms for Chikballapur is shown in table 6.6. Percentage area statistics for the six land cover classes were computed for all the taluks and were compared on the basis of the different algorithms (please see Annexure D, table 3 to table 13). Figure 6.2 shows the best (left) to worst (right) algorithms for land cover class mapping in Chikballapur taluk.

Table 6. 6 : Land Cover statistics for Chikballapur Taluk.

Algorithms

Agriculture

Built up

Forest

Plantation

Waste land

Waterbodies

MLC (LISS3)

28.07

11.51

17.06

11.56

31.04

0.76

K-Means (B1 to B7)

18.1

19.48

16.45

16.74

28.72

0.03

MLC (B1 to B7)

28.31

11.71

16.35

10.4

32.72

0.51

SAM (B1 to B7)

48.16

21.38

12.97

14.91

35.38

0.5

NN (B1 to B7)

15.83

20.56

11.11

28.37

23.23

0.89

DTA (B1 to B7)

18.79

16.99

23.43

9.68

30.72

0.39

MLC (PCA)

40.27

7.09

13.06

4.89

34.01

0.68

SAM (PCA)

17.4

8.28

15.37

14.96

43.82

0.16

NN (PCA)

21.34

0

21.46

8.04

49.03

0.13

DTA (PCA)

27.73

24.65

13.9

11.19

21.86

0.67

MLC (MNF)

23.72

3.48

14.62

16.1

41.87

0.21

SAM (MNF)

30.73

0

28.74

4.02

36.21

0.3

NN (MNF)

28.69

10.1

15.72

10.66

34.13

0.7

DTA (MNF)

16.08

14.84

25.73

7.06

35.64

0.65

 

Figure 6. 2 : Best (left) to worst (right) classification algorithms for mapping land cover classes in Chikballapur taluk. LISS-3 MSS classified image is used for comparison.

The overall taluk wise percentage area for various algorithms shows that NN based classification on MODIS MNF is most comparable to LISS-3 classified image and is suitable for land cover classification at regional level. MLC on MODIS bands 1 to 7 also proved to be second best algorithm for classification.

Pixel to pixel analysis with LISS-3 MSS classified image

MODIS classified data were also compared with LISS-3 MSS classified data on a pixel by pixel basis for accuracy assessment of pure (homogenous) pixels. One pixel of MODIS spatially corresponds to 121 pixels (that is approximately equal to 258.5 m) of LISS-3. The error matrix was generated with user's accuracy, producer's accuracy and overall accuracy for the taluk and is listed in table 6.7, 6.8 and 6.9. This analysis also indicates that NN (MNF) classification technique is relatively better among 13 techniques.

Table 6. 7 : Overall Accuracy obtained from pixel to pixel analysis with LISS-3 image comparison for Chikballapur Taluk.

Algorithms

Overall Accuracy

Rank

NN (MNF)

69.87

1

MLC(B1 to B7)

65.77

2

NN (PCA)

63.69

3

DTA (MNF)

62.38

4

DTA (PCA)

62.22

5

DTA(B1 to B7)

61.40

6

K-Means(B1 to 7)

57.64

7

SAM(B1 to B7)

56.66

8

MLC (PCA)

53.02

9

SAM (PCA)

51.38

10

NN(B1 to B7)

51.34

11

SAM (MNF)

47.38

12

MLC (MNF)

41.55

13

 

Table 6. 8 : User's Accuracy obtained from pixel to pixel analysis with LISS-3 image comparison for Chikballapur taluk .

Algorithms

Agriculture

Built up

Forest

Plantation

Waste land

Waterbodies

K-Means (B1 to B7)

41

11

63

56

85

40

MLC (B1 to B7)

69

39

72

65

85

51

SAM (B1 to B7)

37

29

75

64

61

49

NN (B1 to B7)

37

17

59

44

87

29

DTA (B1 to B7)

41

29

49

34

88

48

MLC (PCA)

36

24

50

48

82

49

SAM (PCA)

17

31

71

64

79

51

NN (PCA)

20

45

61

69

81

56

DTA (PCA)

34

41

49

67

82

37

MLC (MNF)

40

45

53

69

92

55

SAM (MNF)

21

45

39

61

88

33

NN (MNF)

41

55

61

75

81

65

DTA (MNF)

31

27

64

61

81

55

 

Table 6. 9 : Producer's Accuracy obtained from pixel to pixel analysis with LISS-3 image comparison for Chikballapur taluk.

Algorithms

Agriculture

Built up

Forest

Plantation

Waste land

Waterbodies

K-Means (B1 to B7)

51

44

55

56

62

55

MLC (B1 to B7)

45

56

66

73

66

55

SAM (B1 to B7)

41

55

31

54

59

41

NN (B1 to B7)

46

65

19

41

60

45

DTA (B1 to B7)

51

44

53

67

65

43

MLC (PCA)

65

49

32

61

66

39

SAM (PCA)

21

41

29

66

74

37

NN (PCA)

61

29

38

69

78

45

DTA (PCA)

41

56

38

59

61

42

MLC (MNF)

58

57

41

57

60

48

SAM (MNF)

29

55

45

61

61

42

NN (MNF)

59

41

55

76

83

65

DTA (MNF)

31

44

53

69

71

58

Back
6.2.2 Accuracy Assessment of Soft classification for MODIS

Comparison on the basis of percentage area with LISS-3 MSS

Table 6.10 shows the land cover statistics for Chikballapur taluk obtained from the abundance maps generated using Linear Mixture Model (LMM), which is compared with LISS-3 classified map. All categories show comparable values of MODIS to LISS-3 except waterbodies that were overestimated in the area. For the land cover details of each taluk see Annexure D (table 14).

Table 6. 10 : Land Cover details of fraction images for Chikballapur taluk.

Class

LMM

LISS-3

Agriculture (%)

25.34

28.07

Built up (%)

11.94

11.51

Forest (%)

16.44

17.06

Plantation (%)

12.12

11.56

Waste land (%)

31.83

31.04

Waterbodies (%)

02.33

0.76

 

•  Accuracy Assessment based on Pixel Analysis

The taluk was divided into 16 quadrants of 5 min by 5 min (9 km by 9 km) gird and homogeneous (pure) pixels were verified in each grid. At the pixel level, a total of 51 parcels with respect to the six land cover classes were verified using handheld Magellan GPS in Chikballapur taluk and table 6.11 provides the validation results.

Table 6. 11 : Validation of land cover classes in Chikballapur.

Class

Total no. of pixels identified

Pure pixels

Mixed pixels

Wrongly classified

Agriculture

23

18

5

0

Built up (Urban / Rural)

6

4

2

0

Evergreen / Semi-Evergreen Forest

2

1

1

0

Plantation/Orchards

11

7

4

0

Waste land / Barren Rock / Stony waste

6

4

2

0

Waterbodies

3

1

0

2

Total

51

35

14

2

 

Waterbodies which constitute less than 1% of the total area in the district were misclassified and were wrongly estimated. Field verification showed that most of the pure pixels as well as few mixed pixels (for example agriculture with plantations/orchards, open forest class mixed with plantation and barren land mixed with urban class) were classified correctly.

Back

6.3 Discussion

6.3.1 Hard Classification

Hard classification technique performs well with high spatial resolution as in the case with LISS-3 image when the actual ground cover is heterogeneous in nature. Good training site and ancillary data, along with minimum difference between the date of acquisition and date of ground truth collection further aids in achieving better classification result. MODIS classified image with coarse spatial resolution had many pixels that were misclassified as is clear from the Accuracy Assessment.

Further, within the land cover parameter, errors are generated when the classification algorithm selects the wrong class. With respect to a particular class, errors of omission occur when pixels of that class are assigned wrong labels; errors of commission occur when other pixels are wrongly assigned the label of the class considered. These errors occur when the signal of a pixel is ambiguous, perhaps as a result of spectral mixing, or when the signal is produced by a cover type that is not accounted for in the training process. These errors are a normal part of the classification process. They can be minimized, but not avoided entirely.

As additional argument it can be said that as pixel size increases, the chances of high accuracies in hard classifications being product of random assignment of values declines.

6.3.2 Soft Classification using Linear Spectral Unmixing

For Linear Mixture Modelling (LMM), the selection of the endmembers has an important effect. This is essential to properly identify different endmembers for MODIS since it proved to be highly relevant, as it is clear from the results that a high adjacency effect exists between contrasting features (forest and plantation, for example). When comparing LISS-3 MSS and MODIS, another important fact that can be attributed to LISS-3 is its spatial resolution (23.5), the reason for its superior results in the detection of the land cover class distribution and MODIS data having a coarse spatial resolution (250 m), where misclassification of the pixels are more likely.

The reason for the MODIS low accuracy can be the presence of another singularity: adjacency effects. The results suggest that this could be a limitation to the use of MODIS data in areas of high contrasting nature, as is the case of waterbodies, and raises concerns about its application especially on heavily fragmented or small isolated areas.

When unmixing an image to generate the abundance maps, from the presence of endmembers which do not mix in a linear fashion is a matter of concern. Errors in sensor operation and sensor noise can also add to the ones already mentioned. These errors take place when the selection of endmembers is wrong, for example when these do not correspond to pure pixels. The determination of endmember fractions for MODIS in the study area exhibits good behaviour in places where the endmembers are present, generally showing the capacity of estimating values within 20 % to 25 % of the actual values.

The Linear Spectral Unmixing (LSU) of MODIS on the other hand presents more errors which are avoidable. The overall values on the difference maps show only slight divergence, producing good estimates across the whole maps, except for the waterbodies endmembers. It suggest that the selection for this endmember could be improved, which is reasonable as this endmember shows high spectral variability (for example, clear water is different from turbid water or muddy water). When no more recognizable patterns are found and the RMSE is overall small, it can be deduced that a near perfect model has been reached.

The spectra of the different endmembers were modelled by linear equations. This introduces another form of quality assurance, as these equations can be inverted taking the endmember spectra to reconstruct the spectra of each pixel from the original image. Divergences between the original and

reconstructed image also give idea of the goodness of fit of the model, and give insights to which bands add more to the errors.

Back
6.3.3 Classification Accuracy

When assessing the accuracies of the image classifications, several things have to be kept in mind. The most important one is the way of data collection that served as input for the classification of the reference set.

Reference data include both ground truth data and other ancillary data. The time difference for the source data used in developing a reference dataset, the reference (field based) date and the MODIS and LISS-3 MSS acquisition dates impact both utility and accuracy of classified maps. Errors may have been introduced since the training data, the image and validation data are not temporally coincident with the sensor observations. Every possible measure was taken to minimize this effect, relating other ancillary data and vegetation presence information with the visual interpretation of the fused LISS-3 MSS (23.5 m) and IRS-1C/1D Panchromatic data (of spatial resolution 5.8 m) and having acquired information about that time from other sources. Nevertheless, even when counting with a high resolution image, which certainly helps to ameliorate the effects of the sampling, care has to be taken in the interpretation of the results, as the presence of bias is likely to happen. To summarize, it is not certain that the reference data in this case actually is representative of the entire classification.

Another source of bias that influences the conclusions that can be drawn from the classification is the quality in the co-registration of the images [134]. This is still a matter of consideration when working with MODIS data, as the quality of its georeferencing could mean a maximum of 10 to 15 % of error in the area of a pixel. Although care was taken in the geo-referencing process so as to minimize the RMSE error yet, small misregistration may be another significant source of error. Since each MODIS measurement is geolocated, this problem amounts to uncertainty in true geolocation. Therefore, although no accurate value can be given, it becomes evident that MODIS subpixel classification performance is definitely better, when mixed pixels are present.