Digital Speech Recognition System Based on HTK
CSTR:
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [6]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    Digital speech recognition is an extremely important branch of speech recognition. Its application in real life is used more and more widely. HTK is a C language-based toolkit developed by CUED mainly used for speech signal reorganization, speech synthesis, character reorganization, DNA compositor and so on. From HTK's general principles and software architecture, this paper designs a digital speech recognition system based on HTK, and verifies its recognition efficiency. Then by changing the identification unit and MFCC dimension, and by increasing the number of gaussian mixture components, it considers effects of different factors on the performance of the system. Finally, through the comparing test, it verifies the right combination of the identification unit and the number of gaussian mixture components, and also proves that MFCC dimension can enhance the system's correct rate.

    Reference
    1 马峻.语音识别技术研究.哈尔滨:哈尔滨工程大学,2004.
    2 Young S, Evermann G, Gales M. The HTK Book. Cambridge University Engineering Department. Version3.3, 2005.
    3 http://htk.eng.cam.ac.uk
    4 石现峰,张学智,张峰.基于HTK的语音识别系统设计.计算机技术与展,2006,(10):16-10.
    5 侯周国.基于HMM的汉语数字语音识别系统研究.长沙:湖南师范大学,2006.
    6 江官星.非特定人孤立词语音识别系统的研究.成都:西南交通大学,2006.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

魏巍,张海涛.一种基于HTK的数字语音识别系统.计算机系统应用,2011,20(9):17-21

Copy
Share
Article Metrics
  • Abstract:2015
  • PDF: 5840
  • HTML: 0
  • Cited by: 0
History
  • Received:December 16,2010
  • Revised:April 10,2011
Article QR Code
You are the first990500Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063