Machine learning based landslide assessment of the Belgrade metropolitan area: Pixel resolution effects and a cross-scaling concept
Abstract
Improvements of Machine Learning-based landslide prediction models can be made by optimizing scale, customizing training samples to provide sets with the best examples, feature selection, etc. Herein, a novel approach, named Cross-Scaling, is proposed that includes the mixing of training and testing set resolutions. Hypothetically, training on a coarser resolution dataset and testing the model on a finer resolution should help the algorithm to better generalize ambiguous examples of landslide classes and yield fewer over/underestimations in the model. This case study considers the City of Belgrade area for training and its south-eastern suburb for testing. The dataset is exceptionally rich with detailed geological, morphological and environmental data, so 24 landslide predictors were used for multi-class mapping: Class 0 – stable ground, Class 1 - dormant landslides, and Class 2 – active landslides. Two state-of-the-art algorithms were implemented: Support Vector Machines and Random Fo...rest. Additionally, our modelling included variants with an implemented feature selection by using the Information Gain and Correlation Feature Selection. All these variants were modelled across four resolutions - 25, 50, 100 and 200 m, whereby Cross-Scaling was implemented as follows: training on 50 and testing on 25, training on 100 and testing on 25, training on 100 and testing on 50, training on 200 and testing on 25, training on 200 and testing on 50, and finally, training on 200 and testing on 100 m resolution datasets. The results clearly show that Cross-Scaling improves the performance of the model, especially for Class 2, when compared to the performance of their non-Cross-Scaled counterparts; this thereby proves the initial hypothesis. Random Forest models tend to be less sensitive to scale and feature selection effects than the SVM. Class 1 remains the most difficult to discern, leaving some room for even further customization and adjustments. In conclusion, the Cross-Scaling technique is proposed as a method that could become a promising tool for training/testing protocols in landslide assessment.
Keywords:
novel / approach / pixel / terrain / mixed / unstableSource:
Engineering Geology, 2019, 256, 23-38Publisher:
- Elsevier
Funding / projects:
Note:
- This work was supported by the Ministry of Education, Science and Technological Development of the Republic of Serbia Project No TR 36009 and would not have been possible without support from the Urban Planning Institute of Belgrade and colleagues from the Department of Hydrogeology (University of Belgrade, Faculty of Mining and Geology) for providing valuable datasets.
DOI: 10.1016/j.enggeo.2019.05.007
ISSN: 0013-7952
WoS: 000472690300002
Scopus: 2-s2.0-85065440937
Institution/Community
GraFarTY - JOUR AU - Đurić, Uroš AU - Marjanović, Miloš AU - Radić, Zoran AU - Abolmasov, Biljana PY - 2019 UR - https://grafar.grf.bg.ac.rs/handle/123456789/1729 AB - Improvements of Machine Learning-based landslide prediction models can be made by optimizing scale, customizing training samples to provide sets with the best examples, feature selection, etc. Herein, a novel approach, named Cross-Scaling, is proposed that includes the mixing of training and testing set resolutions. Hypothetically, training on a coarser resolution dataset and testing the model on a finer resolution should help the algorithm to better generalize ambiguous examples of landslide classes and yield fewer over/underestimations in the model. This case study considers the City of Belgrade area for training and its south-eastern suburb for testing. The dataset is exceptionally rich with detailed geological, morphological and environmental data, so 24 landslide predictors were used for multi-class mapping: Class 0 – stable ground, Class 1 - dormant landslides, and Class 2 – active landslides. Two state-of-the-art algorithms were implemented: Support Vector Machines and Random Forest. Additionally, our modelling included variants with an implemented feature selection by using the Information Gain and Correlation Feature Selection. All these variants were modelled across four resolutions - 25, 50, 100 and 200 m, whereby Cross-Scaling was implemented as follows: training on 50 and testing on 25, training on 100 and testing on 25, training on 100 and testing on 50, training on 200 and testing on 25, training on 200 and testing on 50, and finally, training on 200 and testing on 100 m resolution datasets. The results clearly show that Cross-Scaling improves the performance of the model, especially for Class 2, when compared to the performance of their non-Cross-Scaled counterparts; this thereby proves the initial hypothesis. Random Forest models tend to be less sensitive to scale and feature selection effects than the SVM. Class 1 remains the most difficult to discern, leaving some room for even further customization and adjustments. In conclusion, the Cross-Scaling technique is proposed as a method that could become a promising tool for training/testing protocols in landslide assessment. PB - Elsevier T2 - Engineering Geology T1 - Machine learning based landslide assessment of the Belgrade metropolitan area: Pixel resolution effects and a cross-scaling concept EP - 38 SP - 23 VL - 256 DO - 10.1016/j.enggeo.2019.05.007 ER -
@article{ author = "Đurić, Uroš and Marjanović, Miloš and Radić, Zoran and Abolmasov, Biljana", year = "2019", abstract = "Improvements of Machine Learning-based landslide prediction models can be made by optimizing scale, customizing training samples to provide sets with the best examples, feature selection, etc. Herein, a novel approach, named Cross-Scaling, is proposed that includes the mixing of training and testing set resolutions. Hypothetically, training on a coarser resolution dataset and testing the model on a finer resolution should help the algorithm to better generalize ambiguous examples of landslide classes and yield fewer over/underestimations in the model. This case study considers the City of Belgrade area for training and its south-eastern suburb for testing. The dataset is exceptionally rich with detailed geological, morphological and environmental data, so 24 landslide predictors were used for multi-class mapping: Class 0 – stable ground, Class 1 - dormant landslides, and Class 2 – active landslides. Two state-of-the-art algorithms were implemented: Support Vector Machines and Random Forest. Additionally, our modelling included variants with an implemented feature selection by using the Information Gain and Correlation Feature Selection. All these variants were modelled across four resolutions - 25, 50, 100 and 200 m, whereby Cross-Scaling was implemented as follows: training on 50 and testing on 25, training on 100 and testing on 25, training on 100 and testing on 50, training on 200 and testing on 25, training on 200 and testing on 50, and finally, training on 200 and testing on 100 m resolution datasets. The results clearly show that Cross-Scaling improves the performance of the model, especially for Class 2, when compared to the performance of their non-Cross-Scaled counterparts; this thereby proves the initial hypothesis. Random Forest models tend to be less sensitive to scale and feature selection effects than the SVM. Class 1 remains the most difficult to discern, leaving some room for even further customization and adjustments. In conclusion, the Cross-Scaling technique is proposed as a method that could become a promising tool for training/testing protocols in landslide assessment.", publisher = "Elsevier", journal = "Engineering Geology", title = "Machine learning based landslide assessment of the Belgrade metropolitan area: Pixel resolution effects and a cross-scaling concept", pages = "38-23", volume = "256", doi = "10.1016/j.enggeo.2019.05.007" }
Đurić, U., Marjanović, M., Radić, Z.,& Abolmasov, B.. (2019). Machine learning based landslide assessment of the Belgrade metropolitan area: Pixel resolution effects and a cross-scaling concept. in Engineering Geology Elsevier., 256, 23-38. https://doi.org/10.1016/j.enggeo.2019.05.007
Đurić U, Marjanović M, Radić Z, Abolmasov B. Machine learning based landslide assessment of the Belgrade metropolitan area: Pixel resolution effects and a cross-scaling concept. in Engineering Geology. 2019;256:23-38. doi:10.1016/j.enggeo.2019.05.007 .
Đurić, Uroš, Marjanović, Miloš, Radić, Zoran, Abolmasov, Biljana, "Machine learning based landslide assessment of the Belgrade metropolitan area: Pixel resolution effects and a cross-scaling concept" in Engineering Geology, 256 (2019):23-38, https://doi.org/10.1016/j.enggeo.2019.05.007 . .