基于MRI 数据的个性化发音机理研究

基本信息

批准号：61573254

项目类别：面上项目

资助金额：65.00

负责人：本多清志

学科分类：

依托单位：天津大学

批准年份：2015

结题年份：2019

起止时间：2016-01-01 - 2019-12-31

项目状态：已结题

项目参与者：方强,路文焕,曹金鑫,张句,张聪聪,张敬姝,刘立成,李静,关雯丹

关键词：

语音生成发音运动分析语音生理模型生理语音

结项摘要

This project aims at expanding our scientific knowledge on human mechanisms for voice and speech production so as to facilitate technological development and medical application in future. The principal mean for the project is the magnetic resonance imaging (MRI) technique to visualize anatomical structures and physic-physiological mechanisms involved in voice and speech production. High-quality static and motion images will be obtained for this purpose by solving technical difficulties that are known to date regarding MRI. Newer analysis techniques will be established to visualize smaller structures, explore unknown mechanisms, and elucidate acoustic phenomena. The structures and functions for speech production by articulation are recorded by both static and motion imaging techniques. The static images are collected and analyzed so as to distinguish muscles and cartilages from the surrounding structures. The motion images are acquired for the entire articulatory organs and analyzed for tissue-contact detection and marker-tracking analysis, which will describe time-space patterns of speech articulation numerically. Real-time MRI during sentence production is used for cross-linguistic studies in this project, which will contribute to the studies for language learning and clinical examination. Exploring individualized mechanism of voice and speech production is another topic in this project. Three-dimensional shapes of the vocal tract are visualized with MRI, and they are analyzed to discover certain unknown features in speech signals, such as the causal process of deriving individual vocal characteristics, or articulatory normalization of vowels cross male and female speakers. New knowledge thus obtained by MRI-based analysis is applied to refine a physiological articulatory model. Anatomical findings are used to revise muscle geometry in the model, and kinematic data are used to evaluate the performance of the model in speech articulation and synthesis. The project team is formed by the principal investigator having a 20-year experience in the research field and his colleagues at Tianjin University and the Chinese Academy of Social Science. His international colleagues in China, Japan, USA, and France will also support this project as volunteers or consultants. The MRI systems used for this project are modern powerful ones with a 3-Tesla static magnetic field. Both researchers’ experience and advanced research systems promote rapid advancement of the related fields in China and further stimulate speech science worldwide.

该项目旨在揭示人类语音产生机制，尤其是发音运动中的个性化语音特征的生理机理。通过利用MRI作为观测手段，结合cine-MRI和tagged-MRI的优点实现对于完整发音器官的观测及标志点追踪研究。通过MRI图像对于鼻腔、下咽腔等发音器官的几何形态观测及声学仿真，来揭示人的个性化发音特征的生理学机理。通过对女性发音人的下咽腔形态分析及相应的声学仿真，揭示女性下咽腔对于语音频域的影响。通过建立固态机械发音模型来定量研究发音器官与独立元音的声学特征对应关系。进而利用数字声学模型对动态发音过程进行仿真。利用本团队已有的生理发音运动模型，对个性化发音运动过程进行仿真研究。利用MRI观测数据及声学计算模型，从生理层、发音运动层到声学特征深入揭示人的个性化发音机理。

项目摘要

发音人的声道结构影响着语音的声学特性，本项目主要研究说话人个性化语音产生的生理机制。基于共振成像（MRI）以及快速扫描技术，我们建立了高分辨率的口腔、喉腔、咽腔以及鼻腔的三维中文发音数据库，包括静态和部分动态发音器官数据，实现了语音生成过程的可视化。对于说话人的静态特性研究，主要基于数据库中女性的元音数据，建立固体声道模型并进行声学实验，更进一步使用时域有限差分方法（FDTD）建立声学计算模型，并与已有的男性说话人研究结果进行比较。研究结果揭示出下咽腔结构在语音生成过程中对个性化语音特征的贡献，同时表明语音中男性和女性说话人在频谱特性上的明显差异。对于说话人的动态特性研究，我们使用MRI数据库中的静态数据定义了相对舌体大小（RTS），作为不会被控制的生理结构来表征说话人的个性化信息，并使用动态MRI图像计算元音到元音的舌体移动速度。结果表明相对舌体大小通过影响说话人的舌体运动速率改变了共振峰的变化速率。本项目的研究成果扩展了语音个性化的表征方式，进一步完善了语音生成的基础理论。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.1051/jnwpu/20213920292

发表时间：2021

DOI：

发表时间：

DOI：10.11842/wst.20190724002

发表时间：2020

DOI：10.16383/j.aas.c180673

发表时间：2021

DOI：

发表时间：2020

本多清志的其他基金

相似国自然基金

基于多模态观测的跨语言语音发音机理研究

批准号：61862054

批准年份：2018

负责人：安见才让

学科分类：F02

资助金额：37.00

项目类别：地区科学基金项目

基于大数据的个性化动态定价策略研究

批准号：91646115

批准年份：2016

负责人：毕文杰

学科分类：G0103

资助金额：43.00

项目类别：重大研究计划

发音阈压（PTP）和发音阈气流（PTF）在启动发音机制中的探讨

批准号：81070773

批准年份：2010

负责人：庄佩耘

学科分类：H1402

资助金额：31.50

项目类别：面上项目

基于网上弱标注数据的个性化图像标注研究

批准号：61303184

批准年份：2013

负责人：李锡荣

学科分类：F0211

资助金额：28.00

项目类别：青年科学基金项目

基于MRI 数据的个性化发音机理研究

{{i.achievement_title}}

暂无此项成果

其他相关文献

一种基于多层设计空间缩减策略的近似高维优化方法

基于LS-SVM香梨可溶性糖的近红外光谱快速检测

基于文献计量学和社会网络分析的国内高血压病中医学术团队研究

二维FM系统的同时故障检测与控制

扶贫资源输入对贫困地区分配公平的影响

本多清志的其他基金

相似国自然基金