###
计算机系统应用英文版:2023,32(2):63-74
本文二维码信息
码上扫一扫!
基于全连接张量网络的多模态与多样性推荐算法
(1.北京师范大学 人工智能学院, 北京 100875;2.北京师范大学 数学科学学院, 北京 100875;3.上海交通大学, 上海 200240;4.中国国家博物馆, 北京 100006)
Multimodal and Diverse Recommendation Algorithm Based on Fully-connected Tensor Networks
(1.School of Artificial Intelligence, Beijing Normal University, Beijing 100875, China;2.School of Mathematical Sciences, Beijing Normal University, Beijing 100875, China;3.Shanghai Jiao Tong University, Shanghai 200240, China;4.National Museum of China, Beijing 100006, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 758次   下载 1551
Received:June 20, 2022    Revised:July 08, 2022
中文摘要: 在全媒体时代下, 基于多模态数据的推荐具有重要意义. 本文使用文本、音频、图像3种模态数据进行推荐, 通过两个阶段进行张量融合: 第1阶段通过3个平行分支对任意两个模式的相关性进行建模和融合, 第2阶段再将3个分支的结果进行融合, 不仅考虑了两模态之间的局部交互作用, 并且消除了模态融合顺序对结果的影响; 在推荐模块中, 将融合特征通过堆叠降噪自编码器作为协同过滤的辅助特征进行推荐. 本文所构建的推荐系统中模态融合与推荐采用端到端的训练过程. 同时, 为了解决推荐结果中存在的相似度高、多样性差的问题, 我们基于二阶段的张量模态融合特征构建相似度矩阵, 在已有推荐结果的基础上进一步精化结果, 实现快速的多样性推荐. 实验证明, 基于本文提出的多模态融合特征的推荐模型不仅能够有效地提升推荐性能, 并且能够增强推荐结果的多样性.
Abstract:In the all-media era, recommendation based on multimodal data is of great significance. This study proposes recommendation based on data in three modalities: text, audio, and image. Tensor fusion is implemented in two stages: The correlation between any two modes is modeled and fused by three parallel branches in the former stage, and the results of the three branches are then fused in the latter stage. This approach not only considers the local interaction between two modalities but also eliminates the influence of the modality fusion order on the result. In the recommen-dation module, the fused features are input to the stacked denoising auto-encoder and are then used as auxiliary features of collaborative filtering for recommendation. In the recommendation system constructed, an end-to-end training process is adopted for modality fusion and recommendation. Moreover, to overcome the high similarity and poor diversity of the recommendation results, this study also constructs a similarity matrix with the fused features of the tensor modalities in the two stages to further refine the results on the basis of the available recommendation results and thereby achieve rapid diversified recommendation. The experimental results show that the recommendation model based on the proposed multimodal fused features can not only effectively improve recommendation performance but also enhance the diversity of recommendation results.
文章编号:     中图分类号:    文献标志码:
基金项目:国家重点研发计划 (2019YFC1521100); 国家自然科学基金(61977063); 国家自然科学基金重大项目(72192821); 上海市科委重大项目(21511101200)
引用文本:
孟诗蓓,郑睿,常亮,陈玉珑,孟睿伟,程诺.基于全连接张量网络的多模态与多样性推荐算法.计算机系统应用,2023,32(2):63-74
MENG Shi-Bei,ZHENG Rui,CHANG Liang,CHEN Yu-Long,MENG Rui-Wei,CHENG Nuo.Multimodal and Diverse Recommendation Algorithm Based on Fully-connected Tensor Networks.COMPUTER SYSTEMS APPLICATIONS,2023,32(2):63-74