Multimodal Sentiment Analysis Incorporating Modal Representation Learning

doi:10.15888/j.cnki.csa.009492

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 13

Home > Archive>Volume 33, Issue 5, 2024 >280-287. DOI:10.15888/j.cnki.csa.009492

PDF HTML XML Export Cite reminder

Multimodal Sentiment Analysis Incorporating Modal Representation Learning
DOI:
                        10.15888/j.cnki.csa.009492
                    
CSTR:
                        [cstr]
                    
Author:
                        LIU Ruo-ChenLIU Ruo-Chen
School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
FENG GuangFENG Guang
School of Automation, Guangdong University of Technology, Guangzhou 510006, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LUO Liang-YuLUO Liang-Yu
School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LIN Hao-ZeLIN Hao-Ze
School of Automation, Guangdong University of Technology, Guangzhou 510006, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In the context of current multi-modal emotion analysis in videos, the influence of modality representation learning on modality fusion and final classification results has not been adequately considered. To this end, this study proposes a multi-modal emotion analysis model that integrates cross-modal representation learning. Firstly, the study utilizes Bert and LSTM to extract internal information from text, audio, and visual modalities separately, followed by cross-modal representation learning to obtain more information-rich unimodal features. In the modal fusion stage, the study fuses the gating mechanism and improves the traditional Transformer fusion mechanism to control the information flow more accurately. Experimental results on the publicly available CMU-MOSI and CMU-MOSEI datasets demonstrate that the accuracy and F1 score of this model are improved compared with the traditional models, validating the effectiveness of this model.

Key words:multimodal sentiment analysis;representation learning;feature fusion;gating mechanism;multi-head attention mechanism

Get Citation

刘若尘,冯广,罗良语,林浩泽.结合模态表征学习的多模态情感分析.计算机系统应用,2024,33(5):280-287

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:November 21,2023
Revised:December 22,2023
Adopted:
Online: March 15,2024
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063