TY - GEN
T1 - ADVANCED CRACK DETECTION IN BUILDING STRUCTURES USING PIX2PIX AND U-NET ARCHITECTURES
AU - Ogun, Emmanuella
AU - Kim, Jinki
AU - Lee, Doyun
N1 - Publisher Copyright:
Copyright © 2025 by ASME.
PY - 2025/9/8
Y1 - 2025/9/8
N2 - Traditional crack detection methods, including manual inspection and classical computer vision techniques, often suffer from inconsistencies due to variations in lighting, surface texture, and environmental noise, resulting in unreliable and inefficient evaluations for large-scale inspections. This study presents a deep learning-based crack detection system designed to enhance segmentation accuracy and structural coherence in building inspections. The approach integrates a Pix2Pix conditional GAN framework with a U-Net generator, trained on the CrackSeg9k dataset comprising 9,255 crack images collected under diverse conditions. While U-Net provides strong baseline segmentation through its encoder-decoder structure and skip connections, its outputs can suffer from discontinuities and misclassification. To address these limitations, Pix2Pix adversarial learning refines the segmentation by enforcing continuity and edge sharpness through a discriminator-guided loss. The training pipeline includes a tailored augmentation strategy and was optimized using TensorFlow with GPU acceleration. Through staged experimentation, the best performance was observed at 100 epochs with a batch size of 8, achieving a mean IoU of 74.9% and F1-score of 71.1%. The proposed method demonstrates the viability of GAN-based refinement for crack segmentation and provides a robust foundation for integration with robotic inspection systems.
AB - Traditional crack detection methods, including manual inspection and classical computer vision techniques, often suffer from inconsistencies due to variations in lighting, surface texture, and environmental noise, resulting in unreliable and inefficient evaluations for large-scale inspections. This study presents a deep learning-based crack detection system designed to enhance segmentation accuracy and structural coherence in building inspections. The approach integrates a Pix2Pix conditional GAN framework with a U-Net generator, trained on the CrackSeg9k dataset comprising 9,255 crack images collected under diverse conditions. While U-Net provides strong baseline segmentation through its encoder-decoder structure and skip connections, its outputs can suffer from discontinuities and misclassification. To address these limitations, Pix2Pix adversarial learning refines the segmentation by enforcing continuity and edge sharpness through a discriminator-guided loss. The training pipeline includes a tailored augmentation strategy and was optimized using TensorFlow with GPU acceleration. Through staged experimentation, the best performance was observed at 100 epochs with a batch size of 8, achieving a mean IoU of 74.9% and F1-score of 71.1%. The proposed method demonstrates the viability of GAN-based refinement for crack segmentation and provides a robust foundation for integration with robotic inspection systems.
KW - Computer Vision
KW - Crack Detection
KW - Deep Learning
KW - Generative Adversarial Networks
KW - Structural Health Monitoring
UR - https://www.scopus.com/pages/publications/105023064912
U2 - 10.1115/SMASIS2025-166103
DO - 10.1115/SMASIS2025-166103
M3 - Conference article
AN - SCOPUS:105023064912
SN - 9780791889275
T3 - ASME 2025 Conference on Smart Materials, Adaptive Structures and Intelligent Systems
BT - Proceedings of ASME 2025 Conference on Smart Materials, Adaptive Structures and Intelligent Systems, SMASIS 2025
PB - American Society of Mechanical Engineers (ASME)
T2 - 18th Annual Conference of the Smart Materials, Adaptive Structures and Intelligent Systems, SMASIS 2025
Y2 - 8 September 2025 through 10 September 2025
ER -