Voiceprint Recognition Method Based on ResNet-LSTM

doi:10.15888/j.cnki.csa.007934

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 14

Home > Archive>Volume 30, Issue 6, 2021 >215-219. DOI:10.15888/j.cnki.csa.007934

PDF HTML XML Export Cite reminder

Voiceprint Recognition Method Based on ResNet-LSTM
DOI:
                        10.15888/j.cnki.csa.007934
                    
CSTR:
                        [cstr]
                    
Author:
                        LIU YongLIU Yong
College of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LIANG Hong-TaoLIANG Hong-Tao
College of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LIU Guo-ZhuLIU Guo-Zhu
College of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
HU QiangHU Qiang
College of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Aiming at the complex process and low recognition rate of traditional methods, this study proposes a voiceprint recognition method based on ResNet-LSTM. In this method, ResNet and LSTM are respectively used to extract the spatial and temporal features of voiceprints. Thus, the deep voiceprint features including both spatial and temporal features are obtained. The experimental results show that the equal error rate of the proposed method is 1.196%, which is 3.68% and 1.95% lower than that of the baseline methods d-vector and VGGNet, respectively, and the recognition accuracy reaches 98.8%.

Key words:voice recognition;ResNet-LSTM;spatial features;temporal features

Get Citation

刘勇,梁宏涛,刘国柱,胡强.基于ResNet-LSTM的声纹识别方法.计算机系统应用,2021,30(6):215-219

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 25,2020
Revised:October 21,2020
Adopted:
Online: June 05,2021
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063