image: Flowchart of the operation of this experiment.
Credit: Journal of Geographical Sciences
Understanding how cities grow is vital for shaping sustainable urban futures—but mapping the true extent of urban expansion remains a formidable technical hurdle. A new global study takes on this challenge by evaluating the performance of various machine learning and deep learning algorithms for extracting urban footprints (UF) from high-resolution satellite images. Analyzing eight diverse cities, the research reveals that while traditional classification methods are fast and accessible, they falter in complex urban settings. In contrast, deep learning and advanced pre-trained models like Segment Anything Model (SAM) offer greater accuracy, albeit with higher computational costs. This comparative analysis provides a crucial guide for selecting the right algorithm to match the unique structure of each urban landscape.
As urban centers become denser and more intricate, traditional tools for land-use monitoring are reaching their limits. The concept of the urban footprint has evolved to include not just concrete structures, but also smaller green spaces and semi-natural features that reflect the livability and environmental quality of cities. Yet, existing extraction techniques often struggle with challenges like shadows from tall buildings, spectral similarities between natural and built environments, and irregular development patterns. The absence of a unified standard for UF definitions further complicates the picture, limiting the comparability of urban studies. Given these issues, there is an urgent need to reassess how UF are detected—especially across varied and evolving urban forms.
In a study (DOI: 10.1007/s11442-025-2339-y) published in March 2025 in the Journal of Geographical Sciences, researchers from the University of Aberdeen put several leading UF extraction algorithms to the test. Drawing on PlanetScope’s high-resolution satellite imagery, they evaluated performance across eight cities worldwide—ranging from compact historical hubs to rapidly expanding informal settlements. The team examined traditional pixel-based and object-based classification methods, two deep learning models, and the emerging Segment Anything (SAM) model. Their aim was to measure how well each approach handles the trade-offs between precision, scalability, and operational efficiency in different urban contexts.
The results showed a nuanced performance landscape shaped by urban form and algorithm design. Pixel-based methods such as K-means and Support Vector Machine (SVM) performed adequately in cities with simple, well-defined boundaries but stumbled in areas with high-rise buildings or integrated green infrastructure. Object-based methods improved segmentation through texture analysis but were sensitive to classifier selection, with Random Forest and SVM outperforming others inconsistently. Deep learning models—particularly U-Net and DeeplabV3—achieved the highest overall accuracy, exceeding 90% in most cases. U-Net excelled at capturing fine-grained urban details, while DeeplabV3 offered smoother segmentation in large, noisy urban areas. However, both demanded extensive training and computational power. Meanwhile, the SAM model provided an efficient, user-friendly alternative, outperforming traditional methods and approaching deep learning accuracy in cities with moderate complexity. Yet, SAM’s performance declined in fragmented or irregular urban environments. These findings underscore the need for algorithm selection tailored to specific urban scenarios—and highlight the promise of hybrid approaches that blend traditional and modern techniques.
“Urban morphology varies tremendously from one city to another, so no single model fits all,” said Baoling Gui, the study’s lead author and a PhD candidate in remote sensing and urban geography. “Our findings show that algorithm performance hinges on factors like city layout, density, and the integration of green spaces. While deep learning models deliver exceptional precision, they come at a cost. SAM offers a practical balance between ease of use and accuracy. Understanding these trade-offs is key to designing scalable urban mapping strategies for global applications.”
The implications of this research extend far beyond academic benchmarking. City planners, policymakers, and geospatial analysts can use these insights to build smarter, more responsive monitoring systems for tracking urban growth and managing environmental change. As global cities navigate the challenges of sustainability, resilience, and climate adaptation, accurate and efficient UF mapping becomes a foundational tool. Moreover, the integration of object-based methods into deep learning frameworks could open the door to next-generation hybrid models—ones that offer both high performance and practical flexibility across diverse urban settings.
###
References
DOI
Original Source URL
https://doi.org/10.1007/s11442-025-2339-y
About Journal of Geographical Sciences
Journal of Geographical Sciences is an international and multidisciplinary peer-reviewed journal focusing on human-nature relationships. It publishes papers on physical geography, natural resources, environmental sciences, geographic information, remote sensing and cartography. Manuscripts come from different parts of the world.
Journal
Journal of Geographical Sciences
Subject of Research
Not applicable
Article Title
Comparative analysis of different machine learning algorithms for urban footprint extraction in diverse urban contexts using high-resolution remote sensing imagery
Article Publication Date
8-Mar-2025
COI Statement
The authors declare that they have no competing interests