TY - JOUR
T1 - Few-shot crack detection based on image processing and improved YOLOv5
AU - Hu, Na
AU - Yang, Jingjing
AU - Jin, Xiaochao
AU - Fan, Xueling
N1 - Publisher Copyright:
© 2022, Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2023/1
Y1 - 2023/1
N2 - The outstanding generalization ability of deep learning methods, such as You Only Look Once Version 5 (YOLOv5), is acquired by training with a large amount of labeled data. However, sufficient data is difficult to be obtained in industrial practices. There are some effective ways for the few-shot object detection such as data augmentation and optimization of the model. This research attempts a method for promoting detection performance of crack detector with only 80 images, that is, running two-type image processing initially, aiming at improving the quality of samples and expanding the quantity of data. All images are sharpened through a Laplacian filter before they are input into the network. Additionally, offline and online data augmentations are used simultaneously for increasing the diversity of the training set. The optimal online data augmentation strategy is obtained in fewer experiments by using a proposed search algorithm. After image processing, improved YOLOv5 is achieved via integrating Convolutional Block Attention Modules and transformer blocks into the YOLOv5l model which can refine the feature maps for prediction. The results show that Recall is improved from 0.560 to 0.742 by 39%, and F(0.8), a variant of the F1-score for crack detection, is improved by 28%. It can be seen that the preceding procedures successfully improve performance of the crack detector. Also, this research can provide a reference and guide for object detection tasks with small dataset.
AB - The outstanding generalization ability of deep learning methods, such as You Only Look Once Version 5 (YOLOv5), is acquired by training with a large amount of labeled data. However, sufficient data is difficult to be obtained in industrial practices. There are some effective ways for the few-shot object detection such as data augmentation and optimization of the model. This research attempts a method for promoting detection performance of crack detector with only 80 images, that is, running two-type image processing initially, aiming at improving the quality of samples and expanding the quantity of data. All images are sharpened through a Laplacian filter before they are input into the network. Additionally, offline and online data augmentations are used simultaneously for increasing the diversity of the training set. The optimal online data augmentation strategy is obtained in fewer experiments by using a proposed search algorithm. After image processing, improved YOLOv5 is achieved via integrating Convolutional Block Attention Modules and transformer blocks into the YOLOv5l model which can refine the feature maps for prediction. The results show that Recall is improved from 0.560 to 0.742 by 39%, and F(0.8), a variant of the F1-score for crack detection, is improved by 28%. It can be seen that the preceding procedures successfully improve performance of the crack detector. Also, this research can provide a reference and guide for object detection tasks with small dataset.
KW - Convolutional Block Attention Modules (CBAM)
KW - Crack detection
KW - Data augmentation
KW - Few shot
KW - Laplacian filter
KW - Transformer block
UR - https://www.scopus.com/pages/publications/85138683971
U2 - 10.1007/s13349-022-00632-x
DO - 10.1007/s13349-022-00632-x
M3 - 文章
AN - SCOPUS:85138683971
SN - 2190-5452
VL - 13
SP - 165
EP - 180
JO - Journal of Civil Structural Health Monitoring
JF - Journal of Civil Structural Health Monitoring
IS - 1
ER -