Date of Graduation

5-2025

Document Type

Thesis

Degree Name

Master of Science in Geology (MS)

Degree Level

Graduate

Department

Geosciences

Advisor/Mentor

Aly, Mohamed H.

Committee Member

Befus, Kevin M.

Second Committee Member

Lamb, Andrew P.

Keywords

Geohazards; Geology; Landslides; Machine Learning; Science & Technology

Abstract

Landslides pose significant hazards to human safety, infrastructure, and the environment, particularly in regions of high elevation that experience extended periods of heavy rainfall. This research focuses on preparing and evaluating landslide susceptibility maps (LSMs) for the Blue Ridge Mountains, a portion of the Appalachian Mountains in western North Carolina, utilizing three machine learning algorithms: Logistic Regression, Random Forest, and Gradient Boosting Regression. Sixteen landslide conditioning factors, reflecting topographic, geological, environmental, and anthropogenic influences, were identified for model input. The landslide inventory database, comprising 7,350 locations, was randomly divided into training (80%) and testing (20%) sets. The performance of each model was evaluated and compared using confusion matrices. The Random Forest algorithm had the highest performance in predicting landslide locations, with a strong emphasis on elevation, slope, and proximity to roads as the most influential factors. The Logistic Regression model provided useful insights into the linear relationships between the conditioning factors and landslide susceptibility, performing well in areas where the relationship between predictors and landslide occurrence is more straightforward. In contrast, the Gradient Boosting model, known for its ability to capture complex nonlinear relationships, identified similar critical factors as the Random Forest model but with a higher sensitivity to variations in slope and distance to drainages. The differences in model outcomes can be attributed to the inherent characteristics of the algorithms: Logistic Regression’s simplicity and linearity, Random Forest’s ability to handle complex interactions through ensemble learning, and Gradient Boosting’s strength in optimizing weak learners for more nuanced pattern recognition. Overall, the LSMs produced by these models suggest that 20% of the study area is highly susceptible to landslides. These LSMs can serve as a valuable tool for land use planning, disaster preparedness, and risk mitigation efforts at a large scale.

Available for download on Thursday, June 18, 2026

Included in

Geology Commons

Share

COinS