###
计算机系统应用英文版:2023,32(10):140-146
本文二维码信息
码上扫一扫!
基于层次信息融合的声学场景分类
(江苏大学 计算机科学与通信工程学院, 镇江 212013)
Acoustic Scene Classification Based on Hierarchical Information Fusion
(School of Computer Science and Communication Engineering, Jiangsu University, Zhenjiang 212013, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 478次   下载 1361
Received:March 22, 2023    Revised:April 28, 2023
中文摘要: 声学场景分类技术可以通过在公共区域中录制的音频分析出它的录制环境, 在日常生活中发挥着重要的作用. 与传统分类问题类与类之间没有关系不同, 声学场景分类的类别间存在着层次结构关系(父类与子类), 如机场和购物中心的父类为室内. 而现有的方法在设计时并未考虑声学场景分类任务的这一特性, 忽略了父类和子类间的依赖关系. 因此, 本文利用声学场景类别间的层次结构关系, 提出了一种基于层次信息融合的声学场景分类方法. 该方法为父类和子类分别设计了单独的分类器, 在子类分类的过程中融合了父类的信息, 并设计了层次依赖损失来对预测的父类和子类不匹配的情况进行惩罚. 在TAU城市声学场景2020移动开发数据集上的实验结果表明, 基于层次信息融合的方法有效地提升了声学场景分类模型的性能, 分类准确率提升了1.1%.
Abstract:Acoustical scene classification technology plays an important role in daily life by analyzing its recording environment through the audio recorded in public areas. Different from the traditional classification problem in which there is no relationship between classes, there is an implicit hierarchical structure relationship between the classes of acoustic scene classification (parent class and subclass). For example, the parent class of the airport and shopping mall is indoor. However, the existing methods do not consider this characteristic of acoustic scene classification task and ignore the dependency relationship between the parent class and the subclass. Therefore, an acoustic scene classification method is proposed, which is based on hierarchical information fusion by using the hierarchical structure relationship between acoustic scene classes. In this method, two separate classifiers are designed to classify the parent class and the subclass respectively. The information of the parent class is fused in the process of the subclass classification, and the hierarchical dependency loss is designed to punish the predicted mismatch between the parent class and the subclass. The experimental results on TAU urban acoustic scenes 2020 mobile development dataset show that the method based on hierarchical information fusion effectively improves the performance of the acoustic scene classification model with an increase of 1.1% in classification accuracy.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(62006098); 中国博士后科学基金(2020M681515)
引用文本:
江港,马忠臣.基于层次信息融合的声学场景分类.计算机系统应用,2023,32(10):140-146
JIANG Gang,MA Zhong-Chen.Acoustic Scene Classification Based on Hierarchical Information Fusion.COMPUTER SYSTEMS APPLICATIONS,2023,32(10):140-146