Spatial Data Mining and Modeling for Visualisation of Rapid Urbanisation

Abstract

Introduction

Methods

Study area and Data

Results and Discussions

Conclusion

Acknowledgement

References

PDF

Home

Results and discussions

1. Classification of IRS LISS-III (2006) data : IRS LISS-III data acquired on 28^th March, 2006 having 3 spectral bands (Green, Red and NIR) with 23.5 m spatial resolution were georegistered with known GCPs and geometrically corrected and resampled to 25 m (1360 x 1400 pixels) which helped in pixel level comparison of abundance maps obtained by unmixing MODIS data (of 2006) with LISS-III classified map. The class spectral characteristics for four LC categories (buildings/urban, vegetation, water bodies and others) using LISS-III MSS bands 2, 3 and 4 were obtained from the training pixels spectra to assess their inter-class separability and the images were classified using Gaussian Maximum Likelihood classifier with training data uniformly distributed over the study area collected with pre calibrated GPS (figure 2).

Figure 2: LC classification using LISS-III MSS (2006). (Self Compiled)

This was validated with the representative field data (training sets collected covering the entire city and validation covering ~ 10% of the study area) and also using Google Earth image (http://www.earth.google.com). LC statistics, producer’s accuracy, user’s accuracy and overall accuracy computed are listed in table 2. The urban pixels were extracted from the classified image and were used as a benchmark to validate the output obtained from OSP.

Table 2: LC details using LISS -III MSS (Self Compiled)

Class	Area (Ha)	Area (%)	P* (%)	U* (%)	O* (%)
Builtup	29535	43.2	85.2	84.5	84.6
Vegetation	19696	28.8	83.5	88.1
Water bodies	1073	01.6	90.7	80.2
Others	18017	26.4	78.3	77.7

P* - Producer’s Accuracy, U* - User’s Accuracy, O* - Overall Accuracy

2. Unmixing of MODIS data using OSP : MODIS data (of 2006) having 7 bands with band 1 and 2 at 250 m and band 3 to 7 at 500 m spatial resolution were georegistered with GCPs, geometrically corrected and bands 3 to 7 were resampled to 250 m (136 x 140 pixels) so that all 7 bands are at same spatial resolution. Unmixing of MODIS data (250 m) were done to obtain abundance map of the impervious surface. Visual inspection as well as accuracy assessment of abundance image showed that OSP algorithm maps impervious surface spatially and proportionally similar to supervised classified image of LISS-III (of 2006). The proportions of urban pixels in the image (figure 3) range from 0 to 1, with 0 indicating absence and increasing value showing higher abundance.

Figure 3: LC abundance maps (of 2006) for builtup, vegetation, water bodies and others. (Self Compiled)

Accuracy assessment at pixel level: A pixel of MODIS (250 m) corresponds to a kernel of 10 x 10 pixels of LISS-III spatially (resampled to 25 m). The builtup abundance map was validated pixel by pixel with the LISS-III builtup map. The proportion of builtup percentage obtained in these 136 x 140 cells were compared using three measures of performance – Root Mean Square (RMS) error (0.160), Bivariate Distribution Functions (BDFs) between LISS-III classified (real proportion) and MODIS based abundance map (predicted proportions), and Wilcoxon signed rank test. The bivariate plot (figure 4) was helpful in visualising the accuracy of prediction by mixture models. While most of the points fall between the two 20 % difference lines, some points are outside the lines indicating that there could be higher or lower estimation than the actual values. The Pearson's product-moment correlation was 0.84 (p-value < 2.2e^-16). 10 % of the difference pixels between the LISS-III proportion and the MODIS abundance were randomly selected at the same spatial locations in the two images by random number generation. These difference data values failed when subjected to normality test (table 3).

Figure 4: BDF of LISS-III classified and MODIS based abundance map. (Self Compiled)

Table 3: Normality tests for the difference in LISS-III and MODIS proportions (Self Compiled)

Normality test	p-value
Shapiro-Wilk	2.2e^-16
Anderson-Darling	2.2e^-16
Kolmogorov-Smirnov	2.2e^-16
Cramer-von Mises	Infinity
Lilliefors	2.2e^-16

The data points corresponding to builtup proportion in a MODIS pixel and corresponding LISS-III pixels were then subjected to box-plot and histogram analysis. The distribution was symmetrical, hence Wilcoxon signed rank test with continuity correction was performed (r = 0.66, p-value < 2.2e^-16), which showed less evidence against null hypothesis – “true location of median of the difference between two data sets is equal to 0”.

The abundance image was further analysed with a minimum detectable threshold of 10 % and in increments of 10 % (i.e., 20 to 30 %, 30-40 %, …., 90 to 100 %) as shown in figure 5. Objects representing different brightness classes of impervious surface are typically selected to be mapped based on the tonal variations in the builtup landscape, which constitute spectral classes. The results hold great promise for improved accessibility and accuracy of this important urban indictor at the local and city level. Table 4 depicts the percentage impervious surface from abundance map.

Figure 5: Percent impervious surface layer for the Bangalore city. (Self Compiled)

Table 4: Percent Impervious surface from abundance map (Self Compiled)

Percent Impervious surface	Area
Percent Impervious surface	Ha	%
0 – 10 %	7019.57	9.97
11 – 20 %	5853.89	8.31
21 – 30 %	7410.17	10.52
31 – 40 %	10126.27	14.38
41 – 50 %	10199.57	14.48
51 – 60 %	9149.63	12.99
61 – 70 %	9863.94	14.01
71 – 80 %	7050.32	10.01
81 – 90 %	3485.47	4.95
91 – 100 %	274.69	0.39

3. Forest fragmentation indices : Supervised classification was performed on temporal datasets for Bangalore city with the Bayesian classifier using the open source programs (i.gensig, i.class and i.maxlik) of Geographic Resources Analysis Support System (http://wgbis.ces.iisc.ac.in/grass). Accuracy assessment was done using field knowledge, and visual interpretation which showed an overall accuracy of 72 %, 75 %, 71 %, 77 % 85 % and 55 % for the year 1973, 1992, 1999, 2000, 2006 and 2007. The class statistics is given in table 5.

Table 5: Greater Bangalore land cover statistics (Self Compiled)

Class Year		Built up	Vegetation	Water Bodies	Others
1973	Ha	5448	46639	2324	13903
1973	%	7.97	68.27	3.40	20.35
1992	Ha	18650	31579	1790	16303
1992	%	27.30	46.22	2.60	23.86
1999	Ha	23532	31421	1574	11794
1999	%	34.44	45.99	2.30	17.26
2000	Ha	24163	31272	1542	11346
2000	%	35.37	45.77	2.26	16.61
2006	Ha	29535	19696	1073	18017
2006	%	43.23	28.83	1.57	26.37
2007	Ha	30876	17298	1005	19143
2007	%	45.19	25.32	1.47	28.01

From the classified raster data, urban class was extracted and converted to vector representation for computation of precise area in hectares indicating 466 % increase in built up area from 1973 to 2007 as evident from temporal analysis leading to a sharp decline of 61% area in water bodies mostly attributing to intense urbanisation process. Rapid urbanisation and sprawl has many detrimental effects including the loss of valuable agricultural and eco-sensitive (e.g. wetlands, forests) lands, enhanced energy consumption and greenhouse gas emissions from increasing private vehicle use and also from modern glazed architectures. Vegetation has decreased by 63 % from 1973 to 2007. Using the results from classification, forest fragmentation indices were computed as shown in figure 6 and the statistics are presented in table 6.

Applying forest fragmentation indices to a time series LC data provided a quantitative assessment of the pattern of forest fragmentation at each date and a means for tracking trends in forest fragmentation. Patch forest which was only 2.45 % in 1973 has increased up to 9.6 % in 2006 and interior forest has decreased from 54 % in 1972 to 27.7 % in 2006.

Table 6: Composition of forest fragmentation (Self Compiled)

Fragmentation Categories	1973		1992		1999		2000		2006
Fragmentation Categories	Ha	%	Ha	%	Ha	%	Ha	%	Ha	%
Patch	1120	2.45	1950	6.34	1788.6	5.91	1944	6.47	1810	9.57
Transitional	3789	8.28	4259	13.84	3816	12.62	3999	13.31	3956	20.92
Edge	14717	32.16	10709	34.81	9516	31.46	10593	35.25	6050	31.99
Perforated	1453	3.18	1447	4.70	1410	4.66	1219	4.06	1859	9.83
Interior	24675	53.93	12401	40.31	13715	45.35	12291	40.91	5236	27.69
Total	45756	100	30766	100	30244	100	30047	100	18909	100

Figure 6: Forest fragmentation map (1972, 1992, 1999, 2000 and 2006). (Self Compiled)

Using forest fragmentation indices, the state of the fragmentation for the city was developed. While the forest fragmentation map produced valuable information, it was difficult to visualise the state of forest fragmentation for tracking the trends and to identify the areas where forest restoration might prove appropriate to reduce the impact of deforestation. The values of TFP and FC calculated (table 7) for the temporal data were plotted on a graph that specifies six conditions of forest fragmentation in figure 7. The TFP designations revealed that forest fragmentation become more severe as forest cover decreases from 100 percent cover towards 80 percent. Between 60 and 80 percent forest cover, the opportunity for re-introduction of forest to connect forest patches is greatest, and below 60 percent, forest patches become small and more fragmented. The FC regions were evenly split and designated high forest continuity (above 0.5) or low forest continuity (below 0.5). The amount of forest and continuity were high in 1973 and decreased substantially in 2006 (figure 6). Forest edges and perforations have less meaning where the amount of forest is low and percolation theory [14] identifies two critical values of Pf. In a completely forested landscape, for a completely random forest conversion on an infinite grid (and evaluating adjacency in cardinal directions), the residual forest is guaranteed to occur in identifiable patches when Pf falls below a critical value of about 0.4. Below that value, the nonforest pixels form a continuous path across the window. Conversely, as long as the residual forest is above a critical value of about 0.6, the forest pixels form such a path, and forest edges and perforations have more meaning. These values are used to define patch and transitional categories recognising that they are approximate because actual LC pattern is not random.

Table 7: Total forest proportion and forest continuity (Self Compiled)

State of forest fragmentation index	1973	1992	1999	2000	2006
TFP	0.71	0.48	0.47	0.46	0.29
FC	0.45	0.30	0.34	0.30	0.19

Figure 7: Six forest fragmentation conditions based on the values for Total Forest Proportion and Forest Continuity. (Self Compiled)

Forest fragmentation also depends on the scale of analysis (window size) and various consequences of increasing the window size are reported in [11]. The measurements are also sensitive to pixel size. Higher fragmentation occurs when finer grain maps are used over a fixed extent (window size). Finer grain maps identify more nonforest area where forest cover is dominant but not exclusive. The strict criterion for interior forest is more difficult to satisfy over larger areas. Although knowledge of the feasible parameter space is not critical, there are geometric constraints. For example, it is not possible to obtain a low value of Pff when Pf is large. Percolation theory applies strictly to maps resulting from random processes; hence, the critical values of Pf (0.4 and 0.6) are only approximate and may vary with actual pattern. As a practical matter, when Pf > 0.6, nonforest types generally appeared as “islands” on a forest background, and when Pf < 0.4, forests appeared as “islands” on a nonforest background.

4. Urban Dynamics Characterisation : Input data consisted of multi date images having three classes: urban, non-urban, and water bodies obtained from the supervised classification algorithm. Each single date fragmentation map consisted of three fragmentation classes – interior, perforated and patch. The final step in the urban growth model was to use two date fragmented image to create change map (1973 to 1992, 1992 to 1999, 1999 to 2000 and 2000 to 2006). The model produced urban growth maps, which consisted of developed land, non-developed land, water and the three types of growth (Infill growth, Expansion and Outlying growth) as shown in figure 8 and statistics are given in table 8.

Table 8: Changes in urban growth types from 1973 to 2006 (Self Compiled)

Urban Growth Type	1973 - 1992		1992 - 1999		1999 - 2000		2000 - 2006
Urban Growth Type	Ha	%	Ha	%	Ha	%	Ha	%
Developed	6889	1.0	3818	5.6	2876	4.2	32380	4.74
Non-developed	48529	71	44034	64.6	42610	63	37922	55.54
Water	905	1.3	692	1.0	1434	2.1	952	1.39
Infill	720	1.1	2146	3.2	3088	4.5	4232	6.20
Expansion	5720	8.4	10078	14.8	13110	19	16412	24.04
Outlying	11520	17	7453	10.9	5054	7.4	5526	8.09
Total	68082	100	68220	100	68172	100	68282	100

Outlying growth have been further classified as – isolated, linear branching and clustered growth (figure 9). There were no perfect linear growth (linear branching surrounded by non- developed land). In all the cases, linear branching were surrounded by one of the other growth classes such as expansion growth, infill growth etc. Combining several temporal urban growth maps created an informative picture of the urban dynamics. Ward wise, impermeable areas were plotted against the urban population density to understand the increase in population density with the increase in the impermeable area. Increased paved surface and concentrated human activities has led to urban heat island effect, as there is increase in temperature between urban pixels compared to its surroundings. Visualisation helps the Local decision makers to see the implications of earlier decisions and policies and evolve appropriate LU policies for sustainable planning and management.

Figure 9: Types of urban outlying growth highlighted in boxes – (A) isolated growth, (B) linear branching (road/corridor), (C) clustered growth. (Self Compiled)