A new method to synthesize formant targeted sounds based on speech production model and RTLA articulatory synthesis model is studied. A method of using codebook and dynamic interpolation constraint to solve the inverse speech problem. The synthesis model is implemented with scattering process derived from reflection-type line analog of vocal tract system according to the acoustic model of speech production. The vocal-tract area function which controls the synthesis model is derived from the first three formant trajectories using the inverse solution of speech production. The proposed method not only gives good naturalness and dynamic smoothness, but also is capable to control or modify speech timbres easily and flexibly. Furthermore, it needs less number of control parameters and very low system sampling rate.
研究语音生成的逆向问题,基于语音声学,运用声道扰动理论,由语音信号特征确定动态声道形状。研究基于发音模型的语音合成方法,把逆向解得的声道参数作为合成器控制参数,用少量参数合成出音色可控制的语音。为提高合成语音的动态过渡和自然度及文语转换质量提供新的方法和途径。对发展多媒体通信和信息高速公路中的语音技术有科学及实用价值。
{{i.achievement_title}}
数据更新时间:2023-05-31
一种基于多层设计空间缩减策略的近似高维优化方法
融合字符串特征的维吾尔语形态切分
黄河支流汾河流域水资源开发利用现状及生态环境问题
新型非易失存储环境下事务型数据管理技术研究
A Fast Algorithm for Computing Dominance Classes
语音生成的计算建模及在言语障碍康复中的应用
模块神经网络的研究及其在语音处理中的应用
语速自适应参数模型及其在语音识别中的应用
时频信号分析理论及其在语音处理中的应用