语音识别可信测度和拒识模型新方法研究

基本信息

批准号：69975007

项目类别：面上项目

资助金额：12.00

负责人：刘加

学科分类：

依托单位：清华大学

批准年份：1999

结题年份：2002

起止时间：2000-01-01 - 2002-12-31

项目状态：已结题

项目参与者：肖熙,郝杰,张昊天,刘镜,汪鹏,单翼翔

关键词：

说话人自适应语音识别可信测度和拒识

结项摘要

This project is a basic research in the applications of speech recognition technology. Confidence measures and rejection models have become important parts in speech recognition systems. In this project, a novel garbage model based on Gaussian mixtures is described; a new utterance verification method based on the on-line garbage model is proposed; the different training methods of the filler model, the competition model, the anti-model and the impostor model are presented; the new algorithms of the supervised and unsupervised speaker adaptation combined with the confidence measures are developed; an method of the confidence estimation of posterior probability based on Multi-Layer Perceptrons (MLP) is proposed; the method of the hierachical averaging and normalized score of the confidence measure based on Chinese pronunciation characteristic are described; a new kind of Viterbi beam searching and pruning algorithm based the confidence measure is proposed; an integrated model based on the multiple confidence information sources is described. Some valuable results are obtained and used for 863 High-Tech project and international cooperation projects. The system robustness and the performance of rejecting noises are improved by using confidence measures in the practical speech recognition systems. . Thirty papers have been published, and one patent applied. One PhD student and 7 MSc have graduated.

语音识别可信测度和拒识模型是口语对话和命令控制系统的关键技术之一。本申请从可信测度估值方法、不同层次结构上可信测度和拒识模型构成及其规一化方法、结合可信测度的说话人自适应方法、可信测度和拒识模型评估方法等方面入手，进行创新性研究，结合汉语特点提出一个完整的新型可信测度和拒识模型算法。该研究具有重要的理论意义和实用价值。

项目摘要

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.16383/j.aas.2016.c150880

发表时间：2016

DOI：10.11821/dlyj201810008

发表时间：2018

DOI：10.3969/j.issn.1003-0077.2018.11.009

发表时间：2018

DOI：10.3724/sp.j.1089.2022.19009

发表时间：2022

DOI：10.19783/j.cnki.pspc.200521

发表时间：2021

刘加的其他基金

批准号：61273268

批准年份：2012

资助金额：83.00

项目类别：面上项目

批准号：60272016

批准年份：2002

资助金额：24.00

项目类别：面上项目

批准号：69772020

批准年份：1997

资助金额：10.00

项目类别：面上项目

批准号：60572083

批准年份：2005

资助金额：23.00

项目类别：面上项目

批准号：60776800

批准年份：2007

资助金额：28.00

项目类别：联合基金项目

相似国自然基金

基于听觉感知模型的说话人识别和语音语种识别新方法研究

批准号：60572083

批准年份：2005

负责人：刘加

学科分类：F0111

资助金额：23.00

项目类别：面上项目

基于非平稳测度与置信权的动态选择语音识别模型

批准号：10571103

批准年份：2005

负责人：葛余博

学科分类：A0403

资助金额：28.00

项目类别：面上项目

多语言语音识别声学建模理论和容错识别新方法研究

批准号：61273268

批准年份：2012

负责人：刘加

学科分类：F0605

资助金额：83.00

项目类别：面上项目

稳健（抗噪）语音识别新方法研究

批准号：69772020

批准年份：1997

负责人：刘加

学科分类：F0111

资助金额：10.00

项目类别：面上项目

语音识别可信测度和拒识模型新方法研究

{{i.achievement_title}}

暂无此项成果

其他相关文献

基于SSVEP 直接脑控机器人方向和速度研究

居住环境多维剥夺的地理识别及类型划分——以郑州主城区为例

基于细粒度词表示的命名实体识别研究

基于协同表示的图嵌入鉴别分析在人脸识别中的应用

适用于带中段并联电抗器的电缆线路的参数识别纵联保护新原理

刘加的其他基金

多语言语音识别声学建模理论和容错识别新方法研究

高鉴别特性的汉语非特定人连续语音识别声学模型研究

稳健（抗噪）语音识别新方法研究

基于听觉感知模型的说话人识别和语音语种识别新方法研究

基于内容的跨语言语音检索方法研究

相似国自然基金