中国民航大学学报 ›› 2019, Vol. 37 ›› Issue (4): 36-40.

• 民用航空 • 上一篇    下一篇

基于DNN-HMM的陆空通话声学模型构建方法#br#

杨金锋,李凯涛,贾桂敏,师一华
  

  1. (中国民航大学天津市智能信号与图像处理重点实验室,天津300300)
  • 出版日期:2019-08-23 发布日期:2020-04-01
  • 作者简介:杨金锋(1971—),男,河南周口人,教授,博士,研究方向为图像处理、生物识别、计算机视觉.
  • 基金资助:
    国家自然科学基金项目(U1433120,61502498,61379102);中央高校基本科研业务费专项(3122017001)

Acoustic model building of radiotelephony communication based on DNN-HMM#br#

YANG Jinfeng, LI Kaitao, JIA Guimin, SHI Yihua#br#   

  1. (Intelligent Signal and Image Processing Key Lab of Tianjin, CAUC, Tianjin 300300, China)
  • Online:2019-08-23 Published:2020-04-01

摘要: 由于陆空通话特殊的语法结构与发音,通用语音识别声学模型不适用于陆空通话的声学建模。提出一种基于深度学习的民航陆空通话声学模型构建方法。基于建立的陆空通话语料库数据,利用DNN-HMM 模型对陆空通话语音特征进行声学建模,并通过语音特征增强方法提高模型输入特征的鲁棒性。通过实验对比分析不同语音特征、特征维数和连接帧数对陆空通话声学模型的影响。实验结果表明,提出的基于DNNHMM的陆空通话声学模型可使音素错误率降低至5.62%。

关键词: 陆空通话, 声学模型, DNN-HMM, 特征增强

Abstract: Due to the grammatical structure and pronunciation of radiotelephony communication, the acoustic model of generic speech recognition is not applicable to radiotelephony communication. An acoustic model constructing method of radiotelephony communication is proposed based on deep learning. DNN-HMM is used to extract the acoustic features. A professional database is built for model training, speech feature enhancing method is applied to improve the robustness of input speech features. Experiments are conducted to compare the influence of different speech features, feature dimensions and frame link sizes on the proposed model. Results show that the proposed acoustic model based on DNN-HMM is effective and the phoneme error rate can be reduced to 5.62%.

Key words: radiotelephony communication, acoustic model, DNN-HMM, feature enhancement

中图分类号: