基于历史信息及改进SimSiam的道路目标检测

doi:10.15888/j.cnki.csa.009520

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月24日 2:52 星期四

首页 > 过刊浏览>2024年第33卷第6期 >192-200. DOI:10.15888/j.cnki.csa.009520

PDF HTML阅读 XML下载导出引用引用提醒

基于历史信息及改进SimSiam的道路目标检测
DOI:
                        10.15888/j.cnki.csa.009520
                    
CSTR:
                        32024.14.csa.009520
                    
作者:
                        姜世豪姜世豪
中国科学技术大学 信息科学技术学院, 合肥 230026
在期刊界中查找
在百度中查找
在本站中查找
朱明朱明
中国科学技术大学 信息科学技术学院, 合肥 230026
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:科技创新特区计划 (20-163-14-LZ-001-004-01)

Road Object Detection Based on Historical Information and Improved SimSiam

Author:

JIANG Shi-Hao
JIANG Shi-Hao
School of Information Science and Technology, University of Science and Technology of China, Hefei 230026, China
在期刊界中查找
在百度中查找
在本站中查找
ZHU Ming
ZHU Ming
School of Information Science and Technology, University of Science and Technology of China, Hefei 230026, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

视觉导航旨在通过环境中的视觉信息提供导航依据, 其中关键任务之一就是目标检测. 传统的目标检测方法需要大量的标注, 且只关注图像本身, 并未充分利用视觉导航任务中的数据相似性. 针对以上问题, 本文提出一种基于历史图像信息的自监督训练任务. 该方法聚合同一位置的多时刻图像, 通过信息熵区分前景与背景, 将图像增强后传入SimSiam自监督范式进行训练. 并改进SimSiam投影层和预测层中的MLP为卷积注意力模块和卷积模块, 改进损失函数为多维向量间损失, 以提取图像中的多维特征. 最后, 将自监督预训练所得模型用于下游任务的训练. 实验表明, 在处理后的nuScenes数据集上, 本文提出的方法有效提高了下游分类及检测任务的精度, 在下游分类任务上Top5准确率达到66.95%, 检测任务上mAP达到40.02%.

关键词:历史信息;自监督学习;目标检测

Abstract:

Visual navigation uses the visual information in the environment as the navigation basis, and one of the key tasks of visual navigation is object detection. Traditional object detection methods require a large number of annotations and only focus on the image itself, failing to fully utilize the data similarity in visual navigation tasks. To solve the above problem, this paper proposes a self-supervised training task based on historical image information. In this method, multi-moment images at the same location are aggregated. Furthermore, the foreground and the background are distinguished by information entropy, and the images are enhanced and then sent into the simple siamese (SimSiam) self-supervised paradigm for training. In addition, the multi-layer perception (MLP) networks in the projection and prediction layers of the SimSiam paradigm are upgraded into a convolutional attention module and a convolution module, and the loss function is improved into one of the losses among multi-dimensional vectors, thereby extracting multi-dimensional features from the images. Finally, the model pre-trained by the self-supervised paradigm is used to train the model for downstream tasks. Experiments reveal that the proposed method effectively improves the precision of downstream classification and detection tasks on the processed nuScenes dataset. Its Top5 precision on downstream classification tasks reaches 66.95%, and its mean average precision (mAP) on downstream detection tasks reaches 40.02%.

Key words:historical information;self-supervised learning;object detection

引用本文

姜世豪,朱明.基于历史信息及改进SimSiam的道路目标检测.计算机系统应用,2024,33(6):192-200

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-12-14
最后修改日期:2024-01-17
录用日期:
在线发布日期: 2024-04-30
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码