多点同步激光侦听系统语音重构理论研究及实验研究

基本信息

批准号：61571435

项目类别：面上项目

资助金额：57.00

负责人：郑成诗

学科分类：

依托单位：中国科学院声学研究所

批准年份：2015

结题年份：2019

起止时间：2016-01-01 - 2019-12-31

项目状态：已结题

项目参与者：李晓东,江峰,Antoine Deleforge,彭任华,厉剑,丁建策,崔洋,于光喜

关键词：

激光侦听系统非声学传感器语音重构有限元分析语音去混响

结项摘要

Due to advantages in strong concealment and long-distance voice detection, laser listening systems have very important potential applications for national security purpose. Our recent experimental study reveals that there are two critical problems resulting in both decreasing speech quality and reducing speech intelligibility for multi-point synchronous laser listening systems, where one is missing middle and high frequency components for the speech signal and the other is the noise and the reverberation in adverse environments. To solve these two problems, we need to study in three scientific fields. First, to physically model the system, we introduce a statistical acoustic model to study the impact of different types of sound sources on both the acoustic transfer function (ATF) and the vibration transfer function (VTF). This can give us some clear guidelines on choosing detection objects and developing suitable speech processing methods. Second, to reduce the noise and the reverberation in a robust way, we study how to calibrate the non-acoustic sensor array automatically and propose new speech dereverberation and new noise reduction algorithms under mismatched signal models. We further study the robustness and the stability of these new proposed algorithms by using experimental data. Third, in order to dramatically improve both speech quality and speech intelligibility, we propose to extend the recently proposed phase-optimized K-SVD (PO-K-SVD) algorithm to reconstruct speech from the enhanced speech. To solve the phase continuity problem in the speech reconstruction stage, we focus on investigating the impact of spectral estimators and the frame length/overlap on the overcomplete dictionary of multi-channel spectra and that of instantaneous phases, respectively. This study can significantly improve both the theory and the method of multi-point synchronous laser listening systems, and we expect to make them become more applicable in our lives.

激光侦听系统具有隐蔽性强和拾音距离远的突出优点，对国家安全具有潜在的重要价值。本项目研究先期实验数据分析表明：语音中高频能量缺失和干扰噪声混响引起的语音声品质和可懂度下降是多点同步激光侦听系统的焦点问题，因此需重点解决三个科学问题。首先，针对系统的物理建模这一首要问题，引入统计声学方法研究不同特性声源的声学传递函数和振动系统响应，为优选探测目标和研究语音信号处理方法提供物理依据；其次，针对去噪去混响这一核心问题，研究信号模型失配的校准方案和稳健去噪去混响新理论和新方法，同时建立语音数据库验证去噪去混响新方法的有效性和稳定性；最后，针对语音中高频能量缺失这一关键问题，拓展基于相位优化的K-SVD理论以实现语音重构，研究谱估计对过完备幅度字典训练的影响并研究分帧策略实现连续相位。开展本项目研究将极大的完善多点同步激光侦听系统的理论和方法，推动其逐步走向实用化。

项目摘要

激光侦听系统具有隐蔽性强和拾音距离远的突出优点，对国家安全具有潜在的重要价值。本项目立项之前的研究先期实验数据分析表明：语音中高频能量缺失和干扰噪声混响引起的语音声品质和可懂度下降是多点同步激光侦听系统的焦点问题，因此需重点解决三个科学问题。首先，针对系统的物理建模这一首要问题；其次，针对去噪去混响这一核心问题；最后，针对语音中高频能量缺失这一关键问题。通过对这三个问题的研究，将极大的完善多点同步激光侦听系统的理论和方法，推动其逐步走向实用化。 . 本项目主要在以下三个方面进行了为期四年的项目研究：首先，研究不同特性声源的声学传递函数和振动系统响应，为优选探测目标和研究语音信号处理方法提供物理依据；其次，研究信号模型失配的校准方案和稳健去噪去混响新理论和新方法，同时建立了语音数据库验证了去噪去混响新方法的有效性和稳定性；最后，研究了基于深度学习的语音重构和去噪去混响方法。主要成果包括：1)搭建了双点激光语音侦听系统平台，可用于实验和实际应用；2) 建立了激光侦听系统大型数据库，包括语音库和噪声库；3)研究了适用于激光语音侦听系统的传统去噪去混响方法，并得到了应用；4)研究了适用于激光语音侦听系统的深度学习去噪去混响方法和语音重构方法，并得到了应用。本项目的理论研究和算法研究都在实验平台上获得了实验验证，同时发表了11篇杂志论文和6篇会议论文，其中SCI论文5篇，EI论文11篇；授权发明专利4项，还有多项专利已提交申请；算法研究成果应用于国家安全领域，得到了专家组的一致肯定。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.16796/j.cnki.1000-3770.2022.03.003

发表时间：2022

DOI：10.12202/j.0476-0301.2022178

发表时间：2022

DOI：

发表时间：2020

DOI：10.3788/CJL201946.0801003

发表时间：2019

DOI：

发表时间：2021

郑成诗的其他基金

批准号：61201403

批准年份：2012

资助金额：24.00

项目类别：青年科学基金项目

相似国自然基金

人类语音复杂系统的量化及理论研究

批准号：11505071

批准年份：2015

负责人：邓为炳

学科分类：A2503

资助金额：18.00

项目类别：青年科学基金项目

基于激光致热效应的水下声信息相干侦听技术研究

批准号：61905220

批准年份：2019

负责人：张烈山

学科分类：F0510

资助金额：24.00

项目类别：青年科学基金项目

具有同步误差方差约束的混沌系统多目标同步理论研究

批准号：60404005

批准年份：2004

负责人：卢俊国

学科分类：F0301

资助金额：24.00

项目类别：青年科学基金项目

基于同步的混沌动力系统重构技术研究

批准号：10602026

批准年份：2006

负责人：禹东川

学科分类：A0702

资助金额：23.00

项目类别：青年科学基金项目

多点同步激光侦听系统语音重构理论研究及实验研究

{{i.achievement_title}}

暂无此项成果

其他相关文献

EBPR工艺运行效果的主要影响因素及研究现状

复杂系统科学研究进展

奥希替尼治疗非小细胞肺癌患者的耐药机制研究进展

基于腔内级联变频的0.63μm波段多波长激光器

长链基因间非编码RNA 00681竞争性结合miR-16促进黑素瘤细胞侵袭和迁移

郑成诗的其他基金

多传感信息融合语音分离和虚拟声合成研究

相似国自然基金