智慧城市边缘场景下的人-物一体化检测

doi:10.15888/j.cnki.csa.010076

AIPUB归智期刊联盟

微信公众号

网站二维码

首页 > 过刊浏览>2026年第35卷第2期 >262-268. DOI:10.15888/j.cnki.csa.010076

PDF HTML阅读 XML下载导出引用引用提醒

智慧城市边缘场景下的人-物一体化检测
DOI:
                        10.15888/j.cnki.csa.010076
                    
CSTR:
                        32024.14.csa.010076
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:

Person-object Integrated Detection in Smart City Edge Scenarios

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

随着智慧城市建设的深入推进, 建筑边缘区域的安全问题日益严峻, 人员意外坠落与高空坠物事件频发, 亟需更加智能、高效的监测手段. 针对当前目标检测方法在小目标、遮挡目标及高速运动目标识别中的时序建模能力不足的问题, 本文提出一种融合多种时间语义增强机制的视频检测框架, 用于实现人员与坠落物的一体化检测. 所提方法在 Faster R-CNN 主干结构上集成了3种时序感知模块: 运动感知模块(MAM)、时间区域兴趣点对齐操作符(TROI Align)和序列级语义聚合头部(SELSA Head), 分别从运动显著性建模、空间对齐和语义聚合这3个角度, 提升模型对复杂时序场景中动态目标的感知能力. 为支撑模型训练与评估, 本文构建了一个覆盖建筑边缘多场景、多类风险目标的视频数据集. 实验结果表明, 本文方法在“人员临边行为检测”与“高空坠物检测”两个子任务中表现出良好效果, 展现出良好的跨任务鲁棒性与实际应用潜力.

Abstract:

With the continuous advancement of smart city development, safety issues in building edge areas have become increasingly severe, as incidents of accidental falls and falling objects occur frequently. There is an urgent need for more intelligent and efficient monitoring solutions. To address the limited temporal modeling capabilities of current object detection methods, particularly in recognizing small, occluded, and fast-moving targets, this study proposes a video detection framework that integrates multiple temporal semantic enhancement mechanisms for the unified detection of both people and falling objects. The proposed method is built upon a faster R-CNN backbone and incorporates three temporal-aware modules: motion-aware module (MAM), temporal region of interest align (TROI Align), and sequence-level semantic aggregation head (SELSA Head). These modules enhance the model’s perception of dynamic objects in complex temporal scenarios from three perspectives: motion saliency modeling, spatial alignment, and semantic aggregation. To support model training and evaluation, a dedicated video dataset covering multiple building edge scenarios and various types of risk targets is constructed. Experimental results demonstrate that the proposed method achieves strong performance in both “detection of personnel behavior at building edges” and “falling object detection” tasks, showing excellent cross-task robustness and practical application potential.

参考文献

相似文献

引证文献

引用本文

白珊,单卓然.智慧城市边缘场景下的人-物一体化检测.计算机系统应用,2026,35(2):262-268

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-07-25
最后修改日期:2025-08-28
录用日期:
在线发布日期: 2025-12-26
出版日期:

微信公众号

网站二维码

引用本文

分享

相关视频

文章指标

历史

文章二维码