Road Damage Detection with Improved YOLOv8
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [30]
  • |
  • Related [16]
  • | | |
  • Comments
    Abstract:

    This study proposes an algorithm for road damage detection based on an improved YOLOv8 to address challenges in road damage detection, including multi-scale targets, complex target structures, uneven sample distribution, and the impact of hard and easy samples on bounding box regression. The algorithm introduces dynamic snake convolution (DSConv) to replace some of the Conv modules in the original faster implementation of CSP bottleneck with 2 convolutions (C2f) module, aiming to adaptively focus on small and intricate local features, thereby enhancing the perception of geometric structures. By incorporating an efficient multi-scale attention (EMA) module before each detection head, the algorithm achieves cross-dimensional interaction and captures pixel-level relationships, improving its generalization capability for complex global features. Additionally, an extra small object detection layer is added to enhance the precision of small object detection. Finally, a strategy termed Flex-PIoUv2 is proposed, which alleviates sample distribution imbalance and anchor box inflation through linear interval mapping and size-adaptive penalty factors. Experimental results demonstrate that the improved model increases the F1 score, mAP50, and mAP50-95 on the RDD2022 dataset by 1.5%, 2.1%, and 1.2%, respectively. Additionally, results on the GRDDC2020 and China road damage datasets validate the strong generalization of the proposed algorithm.

    Reference
    [1] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus: IEEE, 2013. 580–587.
    [2] Ren SQ, He KM, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137–1149.
    [3] Redmon J, Divvala SK, Girshick RB, et al. You only look once: Unified, real-time object detection. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2015. 779–788.
    [4] Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector. Proceedings of the 14th European Conference on Computer Vision. Amsterdam: Springer, 2015. 21–37.
    [5] Zhang Z, Xu Y, Yang J, et al. A survey of sparse representation: Algorithms and applications. IEEE Access, 2015, 3: 490–530.
    [6] Tarabalka Y, Chanussot J, Benediktsson JA. Segmentation and classification of hyperspectral images using watershed transformation. Pattern Recognition, 2010, 43(7): 2367–2379.
    [7] Zhang L, Yang F, Zhang YD, et al. Road crack detection using deep convolutional neural network. Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP). Phoenix: IEEE, 2016. 3708–3712.
    [8] Pereira V, Tamura S, Hayamizu S, et al. A deep learning-based approach for road pothole detection in timor leste. Proceedings of the 2018 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI). IEEE, 2018. 279–284.
    [9] Feng H, Xu GS, Guo YH. Multi-scale classification network for road crack detection. IET Intelligent Transport Systems, 2019, 13(2): 398–405.
    [10] Nie MX, Wang K. Pavement distress detection based on transfer learning. Proceedings of the 5th International Conference on Systems and Informatics (ICSAI). Nanjing: IEEE, 2018. 435–439.
    [11] Du YC, Pan N, Xu ZH, et al. Pavement distress detection and classification based on YOLO network. International Journal of Pavement Engineering, 2021, 22(13): 1659–1672.
    [12] Katsaliros A, Panagos II, Sfikas G, et al. Road crack detection using quaternion neural networks. Proceedings of the 14th IEEE Image, Video, and Multidimensional Signal Processing Workshop (IVMSP). Nafplio: IEEE, 2022. 1–5.
    [13] Parcollet T, Morchid M, Linarès G. A survey of quaternion neural networks. Artificial Intelligence Review, 2020, 53(4): 2957–2982.
    [14] 李松, 史涛, 井方科. 改进YOLOv8的道路损伤检测算法. 计算机工程与应用, 2023, 59(23): 165–174.
    [15] 魏陈浩, 杨睿, 刘振丙, 等. 具有双层路由注意力的YOLOv8道路场景目标检测方法. 图学学报, 2023, 44(6): 1104–1111.
    [16] Dai JF, Qi HZ, Xiong YW, et al. Deformable convolutional networks. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV). Venice: IEEE, 2017. 764–773.
    [17] 武兵, 田莹. 基于注意力机制的多尺度道路损伤检测算法研究. 图学学报, 2024, 45(4): 770–778.
    [18] 胥铁峰, 黄河, 张红民, 等. 基于改进YOLOv8的轻量化道路病害检测方法. 计算机工程与应用, 2024, 60(14): 175–186.
    [19] 王海群, 王炳楠, 葛超. 重参数化YOLOv8路面病害检测算法. 计算机工程与应用, 2024, 60(5): 191–199.
    [20] Zeng JY, Zhong H. YOLOv8-PD: An improved road damage detection algorithm based on YOLOv8n model. Scientific Reports, 2024, 14(1): 12052.
    [21] Qi YL, He YT, Qi XM, et al. Dynamic snake convolution based on topological geometric constraints for tubular structure segmentation. Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV). Paris: IEEE, 2023. 6047–6056.
    [22] Ouyang DL, He S, Zhang GZ, et al. Efficient multi-scale attention module with cross-spatial learning. Proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes: IEEE, 2023. 1–5.
    [23] Hu J, Shen L, Sun G, et al. Squeeze-and-excitation networks. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2017. 7132–7141.
    [24] Yu J, Jiang YN, Wang ZY, et al. UnitBox: An advanced object detection network. Proceedings of the 24th ACM International Conference on Multimedia. Amsterdam: Association for Computing Machinery, 2016. 516–520.
    [25] Rezatofighi SH, Tsoi N, Gwak J, et al. Generalized intersection over union: A metric and a loss for bounding box regression. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach: IEEE, 2019. 658–666.
    [26] Zheng Z, Wang P, Liu W, et al. Distance-IoU loss: Faster and better learning for bounding box regression. Proceedings of the 38th AAAI Conference on Artificial Intelligence. Vancouver: AAAI Press, 2019. 12993–13000.
    [27] Zhang H, Zhang SJ. Focaler-IoU: More focused intersection over union loss. arXiv:2401.10525, 2024.
    [28] Liu C, Wang KG, Li Q, et al. Powerful-IoU: More straightforward and faster bounding box regression loss with a nonmonotonic focusing mechanism. Neural Networks, 2024, 170: 276–284.
    [29] Arya D, Maeda H, Ghosh SK, et al. RDD2022: A multi-national image dataset for automatic road damage detection. arXiv:2209.08538, 2022.
    [30] Arya D, Maeda H, Ghosh SK, et al. RDD2020: An annotated image dataset for automatic road damage detection using deep learning. Data in Brief, 2021, 36: 107133.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

王瀚毅,李春彪,宋衡.改进YOLOv8的道路损伤检测.计算机系统应用,2025,34(1):179-189

Copy
Share
Article Metrics
  • Abstract:640
  • PDF: 528
  • HTML: 138
  • Cited by: 0
History
  • Received:June 12,2024
  • Revised:July 18,2024
  • Online: November 15,2024
Article QR Code
You are the first992104Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063