###

DOI:

计算机系统应用英文版:2012,21(4):189-192,178

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于时频分布与MFCC的说话人识别

金银燕, 于凤芹, 何艳

(江南大学物联网工程学院, 无锡 214122)

Speaker Recognition Based on Time-Frequency Distribution and MFCC

JIN Yin-Yan, YU Feng-Qin, HE Yan

(School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 2012次下载 3998次
Received:July 14, 2011 Revised:September 07, 2011

中文摘要: 针对MFCC不能得到高效的说话人识别性能的问题，提出了将时频特征与MFCC相结合的说话人特征提取方法。首先得到语音信号的时频分布，然后将时频域转换到频域再提取MFCC+MFCC作为特征参数，最后通过支持向量机来进行说话人识别研究。仿真实验比较了MFCC、MFCC+MFCC分别作为特征参数时语音信号与各种时频分布的识别性能，结果表明基于CWD分布的MFCC和MFCC的识别率可提高到95.7%。

中文关键词: 短时傅里叶变换 Wigner-Ville分布 Choi-Williams分布 Mel频率倒谱系数说话人识别

Abstract:Because MFCC can't reflect the dynamic characteristics of speech signal and their own non-stationary, a feature extraction method by combining time-frequency distribution with MFCC is proposed. First get time-frequency distribution of speech signal, and convert time-frequency domain into frequency domain, then extract MFCC+MFCC as characteristic parameters. Finally speaker recognition uses the support vector machine. The simulation experiment compares recognition performance when MFCC and MFCC+MFCC are respectively as characteristic parameters by speech signal and all kinds of time-frequency distribution. Results show that the speaker recognition performance using MFCC+MFCC based on the CWD time-frequency distribution can be improved to 95.7%.

keywords: STFT WVD CWD MFCC speaker recognition

文章编号： 中图分类号： 文献标志码：

基金项目:国家自然科学基金(61075008)

Author Name	Affiliation
JIN Yin-Yan	School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China
YU Feng-Qin	School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China
HE Yan	School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China

Author Name	Affiliation
JIN Yin-Yan	School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China
YU Feng-Qin	School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China
HE Yan	School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China

引用文本：
金银燕,于凤芹,何艳.基于时频分布与MFCC的说话人识别.计算机系统应用,2012,21(4):189-192,178
JIN Yin-Yan,YU Feng-Qin,HE Yan.Speaker Recognition Based on Time-Frequency Distribution and MFCC.COMPUTER SYSTEMS APPLICATIONS,2012,21(4):189-192,178