基于注意力融合网络的方面级多模态情感分类

doi:10.15888/j.cnki.csa.009385

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月23日 20:48 星期三

首页 > 过刊浏览>2024年第33卷第2期 >94-104. DOI:10.15888/j.cnki.csa.009385

PDF HTML阅读 XML下载导出引用引用提醒

基于注意力融合网络的方面级多模态情感分类
DOI:
                        10.15888/j.cnki.csa.009385
                    
CSTR:
                        32024.14.csa.009385
                    
作者:
                        冼广铭冼广铭
华南师范大学 软件学院, 佛山 528225
在期刊界中查找
在百度中查找
在本站中查找
招志锋招志锋
华南师范大学 软件学院, 佛山 528225
在期刊界中查找
在百度中查找
在本站中查找
阳先平阳先平
华南师范大学 软件学院, 佛山 528225
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(61070015)

Aspect-level Multimodal Sentiment Classification Based on Attention Fusion Network

Author:

XIAN Guang-Ming
XIAN Guang-Ming
School of Software, South China Normal University, Foshan 528225, China
在期刊界中查找
在百度中查找
在本站中查找
ZHAO Zhi-Feng
ZHAO Zhi-Feng
School of Software, South China Normal University, Foshan 528225, China
在期刊界中查找
在百度中查找
在本站中查找
YANG Xian-Ping
YANG Xian-Ping
School of Software, South China Normal University, Foshan 528225, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

方面级多模态情感分类任务的一个关键是从文本和视觉两种不同模态中准确地提取和融合互补信息, 以检测文本中提及的方面词的情感倾向. 现有的方法大多数只利用单一的上下文信息结合图片信息来分析, 存在对方面和上下文信息、视觉信息的相关性的识别不敏感, 对视觉中的方面相关信息的局部提取不够精准等问题, 此外, 在进行特征融合时, 部分模态信息不全会导致融合效果一般. 针对上述问题, 本文提出一种注意力融合网络AF-Net模型去进行方面级多模态情感分类, 利用空间变换网络STN学习图像中目标的位置信息来帮助提取重要的局部特征; 利用基于Transformer的交互网络对方面和文本以及图像之间的关系进行建模, 实现多模态交互; 同时补充了不同模态特征间的相似信息以及使用多头注意力机制融合多特征信息, 表征出多模态信息, 最后通过Softmax层取得情感分类的结果. 在两个基准数据集上进行实验和对比, 结果表明AF-Net能获得较好的性能, 提升方面级多模态情感分类的效果.

关键词:多模态;情感分类;空间变换网络;交互网络;相似信息;注意力融合网络

Abstract:

One of the key tasks of aspect-level multimodal sentiment classification is to accurately extract and fuse complementary information from two different modals of text and vision, so as to detect the sentiment orientation of the aspect words mentioned in the text. Most of the existing methods only use single context information combined with image information for analysis, revealing the problems such as insensitive to the recognition of the correlation between aspect-, context- and visual-information, and imprecise in local extraction of aspect-related information in vision. In addition, when performing feature fusion, insufficient partial modal information will lead to mediocre fusion effect. To solve the above problems, an attention fusion network AF-Net model is proposed to perform aspect-level multimodal sentiment classification in this study. The spatial transformation network (STN) is used to learn the location information of objects in images to help extract important local features. The Transformer based interaction network is used to model the relationship between aspects, texts and images, and realize multi-modal interaction. At the same time, the similar information between different modal features is supplemented and the multi-feature information is fused by multi-attention mechanism to represent the multi-modal information. Finally, the result of sentiment classification is obtained through Softmax layer. Experiments and comparisons carried out on the two benchmark datasets show that AF-Net can achieve better performance and improve the effect of aspect-level multimodal sentiment classification.

Key words:multimodal;sentiment classification;spatial transformation network (STN);interaction network;similar information;attention fusion network

引用本文

冼广铭,招志锋,阳先平.基于注意力融合网络的方面级多模态情感分类.计算机系统应用,2024,33(2):94-104

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-08-01
最后修改日期:2023-09-01
录用日期:
在线发布日期: 2023-12-25
出版日期: 2023-02-05

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码