Fine-grained Occluded Facial Expression Recognition Based on Contrastive Learning

doi:10.15888/j.cnki.csa.008813

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 14

Home > Archive>Volume 31, Issue 11, 2022 >175-183. DOI:10.15888/j.cnki.csa.008813

PDF HTML XML Export Cite reminder

Fine-grained Occluded Facial Expression Recognition Based on Contrastive Learning
DOI:
                        10.15888/j.cnki.csa.008813
                    
CSTR:
                        [cstr]
                    
Author:
                        XI YanXI Yan
School of Computer Science and Communication Engineering, Jiangsu University, Zhenjiang 212013, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Different from the laboratory environment, the scenes of facial expression images in real life are complex, and local occlusion, the most common problem, will cause a significant change in the facial appearance. As a result, the global feature extracted by a model contains redundant information unrelated to emotions, which reduces the discrimination of the model. Considering this problem, a facial expression recognition method combining contrastive learning and the channel-spatial attention mechanism is proposed in this study, which learns local salient emotion features and pays attention to the relationship between local features and global features. Firstly, contrastive learning is introduced. A new positive and negative sample selection strategy is designed through a specific data augmentation method, and a large amount of easily accessible unlabeled emotion data is pre-trained to learn the representation with occlusion-aware ability. Then, the representation is transferred to the downstream facial expression recognition task to improve recognition performance. In the downstream task, the expression analysis of each face image is transformed into the emotion detection of multiple local regions. The fine-grained attention maps of different local regions of a face are learned using the channel-spatial attention mechanism, and the weighted features are fused to weaken the noise effect caused by the occlusion content. Finally, the constraint loss for joint training is proposed to optimize the final fusion feature for classification. The experimental results indicate that the proposed method achieves comparable results to existing state-of-the-art methods on both public non-occluded facial expression datasets (RAF-DB and FER2013) and synthetic occluded facial expression datasets.

Key words:facial expression recognition;contrastive learning;local occlusion;attention mechanism;deep learning

Get Citation

奚琰.基于对比学习的细粒度遮挡人脸表情识别.计算机系统应用,2022,31(11):175-183

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:March 05,2022
Revised:April 12,2022
Adopted:
Online: July 14,2022
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063