Spatial Data Mining and Modeling for Visualisation of Rapid Urbanisation

Results and discussions

1.  Classification of IRS LISS-III (2006) data : IRS LISS-III data acquired on 28th March, 2006 having 3 spectral bands (Green, Red and NIR) with 23.5 m spatial resolution were georegistered with known GCPs and geometrically corrected and resampled to 25 m (1360 x 1400 pixels) which helped in pixel level comparison of abundance maps obtained by unmixing MODIS data (of 2006) with LISS-III classified map. The class spectral characteristics for four LC categories (buildings/urban, vegetation, water bodies and others) using LISS-III MSS bands 2, 3 and 4 were obtained from the training pixels spectra to assess their inter-class separability and the images were classified using Gaussian Maximum Likelihood classifier with training data uniformly distributed over the study area collected with pre calibrated GPS (figure 2).


Figure 2: LC classification using LISS-III MSS (2006). (Self Compiled)

This was validated with the representative field data (training sets collected covering the entire city and validation covering ~ 10% of the study area) and also using Google Earth image (http://www.earth.google.com). LC statistics, producer’s accuracy, user’s accuracy and overall accuracy computed are listed in table 2. The urban pixels were extracted from the classified image and were used as a benchmark to validate the output obtained from OSP.

Table 2: LC details using LISS -III MSS (Self Compiled)

Class Area (Ha) Area (%) P*
(%)
U*
(%)
O*
(%)
Builtup 29535 43.2 85.2 84.5

84.6
Vegetation 19696 28.8 83.5 88.1
Water bodies 1073 01.6 90.7 80.2
Others 18017 26.4 78.3 77.7
P* - Producer’s Accuracy, U* - User’s Accuracy, O* - Overall Accuracy

2.  Unmixing of MODIS data using OSP : MODIS data (of 2006) having 7 bands with band 1 and 2 at 250 m and band 3 to 7 at 500 m spatial resolution were georegistered with GCPs, geometrically corrected and bands 3 to 7 were resampled to 250 m (136 x 140 pixels) so that all 7 bands are at same spatial resolution. Unmixing of MODIS data (250 m) were done to obtain abundance map of the impervious surface. Visual inspection as well as accuracy assessment of abundance image showed that OSP algorithm maps impervious surface spatially and proportionally similar to supervised classified image of LISS-III (of 2006). The proportions of urban pixels in the image (figure 3) range from 0 to 1, with 0 indicating absence and increasing value showing higher abundance.


Figure 3: LC abundance maps (of 2006) for builtup, vegetation, water bodies and others. (Self Compiled)

Accuracy assessment at pixel level: A pixel of MODIS (250 m) corresponds to a kernel of 10 x 10 pixels of LISS-III spatially (resampled to 25 m). The builtup abundance map was validated pixel by pixel with the LISS-III builtup map. The proportion of builtup percentage obtained in these 136 x 140 cells were compared using three measures of performance – Root Mean Square (RMS) error (0.160), Bivariate Distribution Functions (BDFs) between LISS-III classified (real proportion) and MODIS based abundance map (predicted proportions), and Wilcoxon signed rank test. The bivariate plot (figure 4) was helpful in visualising the accuracy of prediction by mixture models. While most of the points fall between the two 20 % difference lines, some points are outside the lines indicating that there could be higher or lower estimation than the actual values. The Pearson's product-moment correlation was 0.84 (p-value < 2.2e-16). 10 % of the difference pixels between the LISS-III proportion and the MODIS abundance were randomly selected at the same spatial locations in the two images by random number generation. These difference data values failed when subjected to normality test (table 3).


Figure 4: BDF of LISS-III classified and MODIS based abundance map. (Self Compiled)

Table 3: Normality tests for the difference in LISS-III and MODIS proportions (Self Compiled)

Normality test p-value
Shapiro-Wilk 2.2e-16
Anderson-Darling 2.2e-16
Kolmogorov-Smirnov 2.2e-16
Cramer-von Mises Infinity
Lilliefors 2.2e-16

The data points corresponding to builtup proportion in a MODIS pixel and corresponding LISS-III pixels were then subjected to box-plot and histogram analysis. The distribution was symmetrical, hence Wilcoxon signed rank test with continuity correction was performed (r = 0.66, p-value < 2.2e-16), which showed less evidence against null hypothesis – “true location of median of the difference between two data sets is equal to 0”.

The abundance image was further analysed with a minimum detectable threshold of 10 % and in increments of 10 % (i.e., 20 to 30 %, 30-40 %, …., 90 to 100 %) as shown in figure 5. Objects representing different brightness classes of impervious surface are typically selected to be mapped based on the tonal variations in the builtup landscape, which constitute spectral classes. The results hold great promise for improved accessibility and accuracy of this important urban indictor at the local and city level. Table 4 depicts the percentage impervious surface from abundance map.


Figure 5: Percent impervious surface layer for the Bangalore city. (Self Compiled)

Table 4: Percent Impervious surface from abundance map (Self Compiled)

Percent Impervious surface Area
Ha %
0 – 10 % 7019.57 9.97
11 – 20 % 5853.89 8.31
21 – 30 % 7410.17 10.52
31 – 40 % 10126.27 14.38
41 – 50 % 10199.57 14.48
51 – 60 % 9149.63 12.99
61 – 70 % 9863.94 14.01
71 – 80 % 7050.32 10.01
81 – 90 % 3485.47 4.95
91 – 100 % 274.69 0.39

3.  Forest fragmentation indices : Supervised classification was performed on temporal datasets for Bangalore city with the Bayesian classifier using the open source programs (i.gensig, i.class and i.maxlik) of Geographic Resources Analysis Support System (http://wgbis.ces.iisc.ernet.in/grass). Accuracy assessment was done using field knowledge, and visual interpretation which showed an overall accuracy of 72 %, 75 %, 71 %, 77 % 85 % and 55 % for the year 1973, 1992, 1999, 2000, 2006 and 2007. The class statistics is given in table 5.

Table 5: Greater Bangalore land cover statistics (Self Compiled)

Class
Year
Built up Vegetation Water Bodies Others
1973 Ha 5448 46639 2324 13903
% 7.97 68.27 3.40 20.35
1992 Ha 18650 31579 1790 16303
% 27.30 46.22 2.60 23.86
1999 Ha 23532 31421 1574 11794
% 34.44 45.99 2.30 17.26
2000 Ha 24163 31272 1542 11346
% 35.37 45.77 2.26 16.61
2006 Ha 29535 19696 1073 18017
% 43.23 28.83 1.57 26.37
2007 Ha 30876 17298 1005 19143
% 45.19 25.32 1.47 28.01

From the classified raster data, urban class was extracted and converted to vector representation for computation of precise area in hectares indicating 466 % increase in built up area from 1973 to 2007 as evident from temporal analysis leading to a sharp decline of 61% area in water bodies mostly attributing to intense urbanisation process. Rapid urbanisation and sprawl has many detrimental effects including the loss of valuable agricultural and eco-sensitive (e.g. wetlands, forests) lands, enhanced energy consumption and greenhouse gas emissions from increasing private vehicle use and also from modern glazed architectures. Vegetation has decreased by 63 % from 1973 to 2007. Using the results from classification, forest fragmentation indices were computed as shown in figure 6 and the statistics are presented in table 6.

Applying forest fragmentation indices to a time series LC data provided a quantitative assessment of the pattern of forest fragmentation at each date and a means for tracking trends in forest fragmentation. Patch forest which was only 2.45 % in 1973 has increased up to 9.6 % in 2006 and interior forest has decreased from 54 % in 1972 to 27.7 % in 2006.

Table 6: Composition of forest fragmentation (Self Compiled)

Fragmentation
Categories
1973 1992 1999 2000 2006
Ha % Ha % Ha % Ha % Ha %
Patch 1120 2.45 1950 6.34 1788.6 5.91 1944 6.47 1810 9.57
Transitional 3789 8.28 4259 13.84 3816 12.62 3999 13.31 3956 20.92
Edge 14717 32.16 10709 34.81 9516 31.46 10593 35.25 6050 31.99
Perforated 1453 3.18 1447 4.70 1410 4.66 1219 4.06 1859 9.83
Interior 24675 53.93 12401 40.31 13715 45.35 12291 40.91 5236 27.69
Total 45756 100 30766 100 30244 100 30047 100 18909 100


Figure 6: Forest fragmentation map (1972, 1992, 1999, 2000 and 2006). (Self Compiled)

Using forest fragmentation indices, the state of the fragmentation for the city was developed. While the forest fragmentation map produced valuable information, it was difficult to visualise the state of forest fragmentation for tracking the trends and to identify the areas where forest restoration might prove appropriate to reduce the impact of deforestation. The values of TFP and FC calculated (table 7) for the temporal data were plotted on a graph that specifies six conditions of forest fragmentation in figure 7. The TFP designations revealed that forest fragmentation become more severe as forest cover decreases from 100 percent cover towards 80 percent. Between 60 and 80 percent forest cover, the opportunity for re-introduction of forest to connect forest patches is greatest, and below 60 percent, forest patches become small and more fragmented. The FC regions were evenly split and designated high forest continuity (above 0.5) or low forest continuity (below 0.5). The amount of forest and continuity were high in 1973 and decreased substantially in 2006 (figure 6). Forest edges and perforations have less meaning where the amount of forest is low and percolation theory [14] identifies two critical values of Pf. In a completely forested landscape, for a completely random forest conversion on an infinite grid (and evaluating adjacency in cardinal directions), the residual forest is guaranteed to occur in identifiable patches when Pf falls below a critical value of about 0.4. Below that value, the nonforest pixels form a continuous path across the window. Conversely, as long as the residual forest is above a critical value of about 0.6, the forest pixels form such a path, and forest edges and perforations have more meaning. These values are used to define patch and transitional categories recognising that they are approximate because actual LC pattern is not random.

Table 7: Total forest proportion and forest continuity (Self Compiled)

State of forest fragmentation index   1973 1992 1999 2000 2006
TFP 0.71 0.48 0.47 0.46 0.29
FC 0.45 0.30 0.34 0.30 0.19


Figure 7: Six forest fragmentation conditions based on the values for Total Forest Proportion and Forest Continuity. (Self Compiled)

Forest fragmentation also depends on the scale of analysis (window size) and various consequences of increasing the window size are reported in [11]. The measurements are also sensitive to pixel size. Higher fragmentation occurs when finer grain maps are used over a fixed extent (window size). Finer grain maps identify more nonforest area where forest cover is dominant but not exclusive. The strict criterion for interior forest is more difficult to satisfy over larger areas. Although knowledge of the feasible parameter space is not critical, there are geometric constraints. For example, it is not possible to obtain a low value of Pff when Pf is large. Percolation theory applies strictly to maps resulting from random processes; hence, the critical values of Pf (0.4 and 0.6) are only approximate and may vary with actual pattern. As a practical matter, when Pf > 0.6, nonforest types generally appeared as “islands” on a forest background, and when Pf < 0.4, forests appeared as “islands” on a nonforest background.

4.  Urban Dynamics Characterisation : Input data consisted of multi date images having three classes: urban, non-urban, and water bodies obtained from the supervised classification algorithm. Each single date fragmentation map consisted of three fragmentation classes – interior, perforated and patch. The final step in the urban growth model was to use two date fragmented image to create change map (1973 to 1992, 1992 to 1999, 1999 to 2000 and 2000 to 2006). The model produced urban growth maps, which consisted of developed land, non-developed land, water and the three types of growth (Infill growth, Expansion and Outlying growth) as shown in figure 8 and statistics are given in table 8.


Figure 8: Urban growth map (A) 1973 to 1992, (B) 1992 to 1999, (C) 1999 to 2000, (D) 2000 to 2006. (Self Compiled)


Table 8: Changes in urban growth types from 1973 to 2006 (Self Compiled)

Urban Growth Type 1973 - 1992 1992 - 1999 1999 - 2000 2000 - 2006
Ha % Ha % Ha % Ha %
Developed 6889 1.0 3818 5.6 2876 4.2 32380 4.74
Non-developed 48529 71 44034 64.6 42610 63 37922 55.54
Water 905 1.3 692 1.0 1434 2.1 952 1.39
Infill 720 1.1 2146 3.2 3088 4.5 4232 6.20
Expansion 5720 8.4 10078 14.8 13110 19 16412 24.04
Outlying 11520 17 7453 10.9 5054 7.4 5526 8.09
Total 68082 100 68220 100 68172 100 68282 100

Outlying growth have been further classified as – isolated, linear branching and clustered growth (figure 9). There were no perfect linear growth (linear branching surrounded by non- developed land). In all the cases, linear branching were surrounded by one of the other growth classes such as expansion growth, infill growth etc. Combining several temporal urban growth maps created an informative picture of the urban dynamics. Ward wise, impermeable areas were plotted against the urban population density to understand the increase in population density with the increase in the impermeable area. Increased paved surface and concentrated human activities has led to urban heat island effect, as there is increase in temperature between urban pixels compared to its surroundings. Visualisation helps the Local decision makers to see the implications of earlier decisions and policies and evolve appropriate LU policies for sustainable planning and management.


Figure 9: Types of urban outlying growth highlighted in boxes – (A) isolated growth, (B) linear branching (road/corridor), (C) clustered growth. (Self Compiled)

E-mail    |    Sahyadri    |    GRASS    |    ENVIS    |    Energy    |    CES    |    CST    |    CiSTUP    |    IISc    |    E-mail