跨库语音情感识别研究进展

doi:10.15888/j.cnki.csa.008811

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年5月2日 17:53 星期五

首页 > 过刊浏览>2022年第31卷第11期 >31-48. DOI:10.15888/j.cnki.csa.008811

PDF HTML阅读 XML下载导出引用引用提醒

跨库语音情感识别研究进展
DOI:
                        10.15888/j.cnki.csa.008811
                    
CSTR:
                        
                    
作者:
                        张石清张石清
浙江科技学院 理学院, 杭州 310023;台州学院 智能信息处理研究所, 台州 317000
在期刊界中查找
在百度中查找
在本站中查找
刘瑞欣刘瑞欣
浙江科技学院 理学院, 杭州 310023
在期刊界中查找
在百度中查找
在本站中查找
赵小明赵小明
台州学院 智能信息处理研究所, 台州 317000
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金 (61976149); 浙江省自然科学基金 (LZ20F020002)

Research Advance of Cross-corpus Speech Emotion Recognition

Author:

ZHANG Shi-Qing
ZHANG Shi-Qing
School of Science, Zhejiang University of Science and Technology, Hangzhou 310023, China;Institute of Intelligent Information Processing, Taizhou University, Taizhou 317000, China
在期刊界中查找
在百度中查找
在本站中查找
LIU Rui-Xin
LIU Rui-Xin
School of Science, Zhejiang University of Science and Technology, Hangzhou 310023, China
在期刊界中查找
在百度中查找
在本站中查找
ZHAO Xiao-Ming
ZHAO Xiao-Ming
Institute of Intelligent Information Processing, Taizhou University, Taizhou 317000, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

语音情感识别在人机交互过程中发挥极为重要的作用, 近年来备受关注. 目前, 大多数的语音情感识别方法主要在单一情感数据库上进行训练和测试 . 然而, 在实际应用中训练集和测试集可能来自不同的情感数据库. 由于这种不同情感数据库的分布存在巨大差异性, 导致大多数的语音情感识别方法取得的跨库识别性能不尽人意. 为此, 近年来不少研究者开始聚焦跨库语音情感识别方法的研究. 本文系统性综述了近年来跨库语音情感识别方法的研究现状与进展, 尤其对新发展起来的深度学习技术在跨库语音情感识别中的应用进行了重点分析与归纳. 首先, 介绍了语音情感识别中常用的情感数据库, 然后结合深度学习技术, 从监督、无监督和半监督学习角度出发, 总结和比较了现有基于手工特征和深度特征的跨库语音情感识别方法的研究进展情况, 最后对当前跨库语音情感识别领域存在的挑战和机遇进行了讨论与展望.

关键词:语音情感识别;跨库;深度学习;手工特征;深度特征;语音情感

Abstract:

Speech emotion recognition (SER) plays an extremely important role in the process of human-computer interaction (HCI), which has attracted much attention in recent years. At present, most SER approaches are mainly trained and tested on a single emotion corpus. In practical applications, however, the training set and testing set may come from different emotion corpora. Due to the huge difference in the distribution of different emotion corpora, the cross-corpus recognition performance achieved by most SER methods is unsatisfactory. To address this issue, many researchers have started focusing on the studies of cross-corpus SER methods in recent years. This study systematically reviews the research status and progress of cross-corpus SER methods in recent years. In particular, the application of the newly developed deep learning techniques on cross-corpus SER tasks is analyzed and summarized. Firstly, the emotion corpora commonly used in SER are introduced. Then, on the basis of deep learning techniques, the research progress of existing cross-corpus SER methods based on hand-designed features and deep features is summarized and compared from the perspectives of supervised, unsupervised, and semi-supervised learning. Finally, the challenges and opportunities in the field of cross-corpus SER are discussed and predicted.

Key words:speech emotion recognition;cross-corpus;deep learning;hand-designed features;deep features;speech emotion

引用本文

张石清,刘瑞欣,赵小明.跨库语音情感识别研究进展.计算机系统应用,2022,31(11):31-48

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2022-03-05
最后修改日期:2022-04-02
录用日期:
在线发布日期: 2022-07-14
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码