改进YOLOv7的视频监控小目标检测

doi:10.15888/j.cnki.csa.009523

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年7月27日 23:20 星期日

首页 > 过刊浏览>2024年第33卷第7期 >52-62. DOI:10.15888/j.cnki.csa.009523

PDF HTML阅读 XML下载导出引用引用提醒

改进YOLOv7的视频监控小目标检测
DOI:
                        10.15888/j.cnki.csa.009523
                    
CSTR:
                        32024.14.csa.009523
                    
作者:
                        夏翔夏翔
中国科学技术大学 信息科学技术学院, 合肥 230026
在期刊界中查找
在百度中查找
在本站中查找
朱明朱明
中国科学技术大学 信息科学技术学院, 合肥 230026
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:科技创新特区计划(20-163-14-LZ-001-004-01)

Small Target Detection in Video Surveillance Based on Improved YOLOv7

Author:

XIA Xiang
XIA Xiang
School of Information Science and Technology, University of Science and Technology of China, Hefei 230026, China
在期刊界中查找
在百度中查找
在本站中查找
ZHU Ming
ZHU Ming
School of Information Science and Technology, University of Science and Technology of China, Hefei 230026, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

小目标检测作为目标检测中一项极具挑战性的项目, 广泛分布于日常生活中, 在视频监控场景中, 距离摄像头约20 m远处的行人人脸就可以被认为是小目标. 由于人脸可能相互遮挡并容易受到噪声和天气光照条件的影响, 现有的目标检测模型在这类小目标上的性能劣于中大型目标. 针对此类问题, 本文提出了改进后的YOLOv7模型, 添加了高分辨率检测头, 并基于GhostNetV2对骨干网络进行了改造; 同时基于BiFPN和SA注意力模块替换PANet结构, 增强多尺度特征融合能力; 结合Wasserstein距离改进了原来的CIoU损失函数, 降低了小目标对锚框位置偏移的敏感性. 本文在公开数据集VisDrone2019以及自制的视频监控数据集上进行了对比实验. 实验表明, 本文提出的改进方法mAP指标在VisDrone2019数据集上提高到了50.1%, 在自制视频监控数据集上高于现有方法1.6个百分点, 有效提高了小目标检测的能力, 并在GTX1080Ti上达到了较好的实时性.

关键词:小目标检测;注意力机制;特征融合;损失函数

Abstract:

As a very challenging project in target detection, small target detection is widely distributed in daily life. In video surveillance scenarios, pedestrians’ faces about 20 meters away from the camera can be considered small targets. Due to the possibility of mutual occlusion of faces and their susceptibility to noise and weather, lighting conditions, the performance of existing target detection models on such small targets is inferior to that on medium and large targets. To address these issues, this study proposes an improved YOLOv7 model with a high-resolution detection head and transforms the backbone network based on GhostNetV2. At the same time, the PANet structure is replaced by the BiFPN and SA attention modules combined to enhance the multi-scale feature fusion capability; the original CIoU loss function is improved by combining the Wasserstein distance, reducing the sensitivity of small targets to anchor frame position offset. This study conducts comparative experiments on the public dataset VisDrone2019 and a self-made video surveillance dataset. Results show that the mAP of the improved method proposed in this study improved to 50.1% on the VisDrone2019 dataset and is 1.6 percentage points higher than existing methods on the self-made video surveillance dataset, which effectively improves the ability of small target detection and achieves good real-time performance on the GTX1080Ti.

Key words:small target detection;attention mechanism;feature fusion;loss function

引用本文

夏翔,朱明.改进YOLOv7的视频监控小目标检测.计算机系统应用,2024,33(7):52-62

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-12-14
最后修改日期:2024-01-17
录用日期:
在线发布日期: 2024-05-31
出版日期:

微信公众号

网站二维码

引用本文

相关视频

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

相关视频

分享

微信扫一扫：分享

文章指标

历史

文章二维码