基于循环卷积网络和逆变换贝叶斯损失的室内人群计数

doi:10.15888/j.cnki.csa.009930

AIPUB归智期刊联盟

微信公众号

网站二维码

首页 > 过刊浏览>2025年第34卷第9期 >253-263. DOI:10.15888/j.cnki.csa.009930

PDF HTML阅读 XML下载导出引用引用提醒

基于循环卷积网络和逆变换贝叶斯损失的室内人群计数
DOI:
                        10.15888/j.cnki.csa.009930
                    
CSTR:
                        32024.14.csa.009930
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(62202235)

Circular Convolution Network with Inverse Transform Bayesian Loss for Indoor Crowd Counting

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

本文关注室内人群计数这一具有挑战性的任务. 在室内场景中, 人们经常聚集在一起并在有限空间内执行相似的任务. 因此, 室内人群的大多数行为都非常相似. 获取全局感受野并找出室内人群特征中的相似性是很重要的. 本文设计了一种循环卷积网络, 该网络结合了卷积神经网络和Transformer的优势, 以获取人群特征的局部和全局相关性. 与基于Transformer的方法相比, 采用了更简单且高效的循环卷积模块. 此外, 提出了一种逆变换贝叶斯损失函数, 该函数适用于具有大尺度变化的稀疏和拥挤的室内场景. 最后, 为了减轻标注偏差问题的影响, 提出了一种标签扩散策略来扩大标注区域, 假设每个原始标注点的相邻像素也有一定概率成为头部中心. 在Class A、Class B、Canteen和Mall数据集上与次优方法相比, MAE/RMSE分别提高了4.1%/4.4%、5.8%/8.0%、3.9%/1.6%和3.9%/1.6%.

Abstract:

This study focuses on the challenging task of indoor crowd counting. In indoor scenes, people often get together and perform similar tasks in constrained spaces. As most behaviors of indoor crowds are consequently quite similar, it is important to acquire a global receptive field and identify similarities in indoor crowd features. To address this problem, this study designs a circular convolution network, which combines the advantages of convolution neural networks and Transformer, to obtain both local and global correlations of the crowd features. Compared with the Transformer-based methods, this network adopts a much simpler and more efficient circular convolutional module. Moreover, a novel inverse-transform Bayesian loss function, which suits both sparse and crowded indoor scenes with large-scale variations, is proposed. Finally, to alleviate the influence of the annotation deviation, a label diffusion strategy that expands annotation areas by assuming adjacent pixels of each original annotation point may also potentially represent head centers. Compared with the second-best method on Class A, Class B, Canteen, and Mall datasets, this method improves MAE/RMSE by 4.1%/4.4%, 5.8%/8.0%, 3.9%/1.6%, and 3.9%/1.6%, respectively.

参考文献

相似文献

引证文献

引用本文

刘永文,孙君宇,凌妙根,苏健.基于循环卷积网络和逆变换贝叶斯损失的室内人群计数.计算机系统应用,2025,34(9):253-263

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-01-26
最后修改日期:2025-02-17
录用日期:
在线发布日期: 2025-07-25
出版日期:

微信公众号

网站二维码

引用本文

分享

相关视频

文章指标

历史

文章二维码