本文已被:浏览 1930次 下载 2774次
Received:February 01, 2021 Revised:March 10, 2021
Received:February 01, 2021 Revised:March 10, 2021
中文摘要: 为快速构建大尺度、高质量中国人脸识别数据集, 本文提出一种半自动构建方法. 相较于现有的数据集构建方法, 该方法可以快速地构建大尺度中国名人人脸数据集, 将此数据集命名为CCFace (Chinese Celebrities Face). CCFace数据集包含431个人物, 506874 张人脸图像, 平均每个人物包含1176张不同年龄、姿态的图像, 该构建方法在一定程度上解决了当前社区中可用的中国人人脸图像数据集短缺问题. 在实验部分中以多个模型测试该数据集的有效性, 表明其可以作为SOTA (State Of The Art)模型的训练集使用, 相信这种方法以及该数据集将引来更多的人来从事人脸识别的研究工作, 并促进国内人脸识别应用的发展.
Abstract:For quick construction of a large-scale and high-quality Chinese face recognition dataset, a semi-automatic construction method is proposed in this study. Compared with the existing dataset construction strategies, this method can quickly build a large-scale Chinese celebrity face dataset, which is named CCFace (Chinese Celebrities Face). The dataset contains 506874 face images of 431 persons, with an average of 1176 images of different ages and postures per person. This method makes up for the shortage of available Chinese face image datasets in the face recognition community. In the experimental section, the effectiveness of the dataset is tested on various models and the results show that it can be used as the training set of the State Of The Art (SOTA) model. It is believed that this method and the dataset will attract more people to join the research team of face recognition and promote the face recognition applications in China.
keywords: face recognition dataset semi-automatic construction method Chinese Celebrities Face (CCFace) computer vision
文章编号: 中图分类号: 文献标志码:
基金项目:国家自然科学基金青年基金(61602505)
引用文本:
杜潘飞,李雄伟,贾永杰.中国名人人脸数据集.计算机系统应用,2021,30(12):326-331
DU Pan-Fei,LI Xiong-Wei,JIA Yong-Jie.Chinese Celebrities Face Dataset.COMPUTER SYSTEMS APPLICATIONS,2021,30(12):326-331
杜潘飞,李雄伟,贾永杰.中国名人人脸数据集.计算机系统应用,2021,30(12):326-331
DU Pan-Fei,LI Xiong-Wei,JIA Yong-Jie.Chinese Celebrities Face Dataset.COMPUTER SYSTEMS APPLICATIONS,2021,30(12):326-331