系统仿真学报 ›› 2017, Vol. 29 ›› Issue (2): 255-263.doi: 10.16182/j.issn1004731x.joss.201702004

• 仿真建模理论与方法 • 上一篇    下一篇

基于DIVA模型的中文复合元音发音方法研究

张少白, 陈燕俐, 何利文   

  1. 南京邮电大学计算机学院,江苏 南京 210046
  • 收稿日期:2016-03-15 修回日期:2016-08-09 出版日期:2017-02-08 发布日期:2020-06-01
  • 作者简介:张少白(1953-),男,河北定县,博士,教授,研究方向为智能系统与模式识别;陈燕俐(1969-),女,湖北襄阳,博士,教授,研究方向为智能系统与模式识别。
  • 基金资助:
    国家自然科学基金(61271334, 61373065)

Research of Chinese Diphthongs Pronunciation Based on DIVA Model

Zhang Shaobai, Chen Yanli, He Liwen   

  1. Computer Department, Nanjing University of Posts and Telecommunications, Nanjing 210046, China
  • Received:2016-03-15 Revised:2016-08-09 Online:2017-02-08 Published:2020-06-01

摘要: DIVA(Directions Into Velocities of Articulators)模型是一种被用来对涉及大脑中有关语音生成和理解区域的功能进行仿真和描述的自适应神经网络模型,其依赖的语言背景是英文29个基本音素。由于汉语与英语发音区别很大,且加工脑机制也颇为不同,要想将汉语者大脑思维过程“阅读”出来,需要对模型汉语背景的适应性进行专门研究。在DIVA模型的基础上研究汉语复合元音的发音方法,探讨汉语者脑区语音生成与获取的相关问题。通过调节模型的共振峰以及模拟声道对应器官的参数,新构建的模型能很好地辨识汉语与英语元音的区别。该研究为DIVA模型汉语语音生成与获取奠定了良好的基础。

关键词: DIVA模型, 语音发音, 汉语复合元音, LPMCC

Abstract: DIVA (Directions Into Velocities of Articulators) is a kind of adaptive neural network model which is used to simulate and describe some associative functions in brain regions involved speech production and understanding. DIVA takes 29 essential English phonemes as its language background. Since the number of Chinese pronunciation phonemes is much larger than English and the pronunciation brain mechanisms of both also make a big difference, the adaptability of DIVA model for Chinese background has to be studied specially, in order that the model can “read out” the thinking processes in Chinese brain. Based on DIVA, the Chinese pronunciation of diphthongs was explored and related issues on Chinese brain regions involved speech production and acquisition were discussed. The new modified model can distinguish Chinese vowels from English vowels clearly by adjusting formant and the parameters of the corresponding pronunciation organs in DIVA's simulative vocal tract. This research lays a solid foundation for further comprehensive Chinese speech production and acquisition on DIVA model.

Key words: DIVA model, speech sound, Chinese compound vowel, LPMCC

中图分类号: