动态视觉认知子空间构建与应用

基本信息

批准号：61876167

项目类别：面上项目

资助金额：64.00

负责人：张剑华

学科分类：

依托单位：天津理工大学

批准年份：2018

结题年份：2022

起止时间：2019-01-01 - 2022-12-31

项目状态：已结题

项目参与者：张剑华,刘盛,王振华,郭东岩,刘儒瑜,王其超,吴佳鑫,潘志颖,王曾媛,贵梦萍

关键词：

子空间学习图像解析主题模型语义属性零样本学习

结项摘要

Dynamic visual category hierarchy plays a vital role during human cognition. It is also the truth that the dynamic visual category hierarchy simulated by computer can improve the performance for lots of methods in the field of computer vision. In this project, the goal is to construct the “visual cognition sub-space”, and consequently the dynamic category hierarchy, by combing the dynamic cognition and sub-space theory. To achieve this goal, we will do a series of researches which will be detailed in the following. Firstly, we will propose a novel real-value representation of semantic attributes and the model which can simultaneously learn and predict multiply attributes, by employing sparse representation, sub-space clustering and multi-label learning. Secondly, we will propose a novel distance-dependent supervised hierarchical topic model based on the object description by employing real-valued semantic attribute, nonparametric probability model, continuous probability distribution, and reconfiguration of dependent relationship. Through this novel topic model, the dynamic category hierarchy can be obtained, and at the same time, we will construct the probability sub-space in the semantic attributes space according to category hierarchy, and form a novel representation of dynamic “visual cognition sub-space” with respect to the category hierarchy. Finally, based on the dynamic “visual cognition sub-space”, the accuracy of object recognition and discovery can also be improved, and the relationship of probability distribution of semantic attributes among seen categories and unseen categories can be formed according the probability sub-space, based on which we will research how to extract the shared pattern and distinctive pattern among categories, and improve efficiently the performance of zero-shot learning.

动态视觉类别层次结构在人类认知过程中起重要作用。同样，计算机模拟人类认知过程建立动态视觉类别层次结构并构建动态“视觉认知子空间”能够极大提升许多计算机视觉方法的性能。本项目从动态认知角度出发，结合子空间理论，构建动态“视觉认知子空间”表达。主要研究内容包括：借鉴稀疏表达、子空间聚类和多标签多样本学习等方法，研究大规模实值语义属性的表示方法，及学习和预测多语义属性的模型；通过语义属性描述物体，引入非参数化概率模型方法，采用连续概率分布建模，重构节点依赖关系，研究新的基于距离依赖和连续分布的有监督层次化主题模型，构建动态层次化类别结构，并创造性地使用语义属性概率子空间表示，形成动态“视觉认知子空间”。最终，应用动态“视觉认知子空间”提升对象识别与发现的准确性；并且建立已见类别和未见类别之间关于语义属性的概率分布的联系，研究类间的共享模式和特定模式的提取，有效提高零样本学习性能。

项目摘要

动态视觉类别层次结构在许多视觉任务中都有非常重要的作用。本项目从如何有效构建类别层次结构和如何应用类别层次结构方面展开研究，完成了无参数概率主题模型的研究、完成了语义属性表征的研究、完成了基于语义特征的零/少样本的半/弱监督深度学习的语义分割应用研究、完成了基于多模态信息融合的环境三维感知研究、完成了基于多终端协作的环境三维感知研究。提出了新的语义属性学习方法、新的层次化非参数概率模型的语义物体层级式表示，基于语义属性和层次化结构表示，实现了弱监督和半监督的语义分割方法，语义SLAM系统，以及多模态融合多终端协作的环境感知系统。项目研究成果提高了领域内对动态视觉类别层次结构的认知，以及在环境感知方面的应用方法，为智能移动终端在复杂环境中进行自主智能探索提供理论基础。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.3778/j.issn.1002-8331.1911-0012

发表时间：2020

DOI：10.1051/jnwpu/20213920292

发表时间：2021

DOI：10.6041/j.issn.1000-1298.2022.07.022

发表时间：2022

DOI：

发表时间：2019

DOI：10.16383/j.aas.c180673

发表时间：2021

张剑华的其他基金

批准号：61305021

批准年份：2013

资助金额：23.00

项目类别：青年科学基金项目

相似国自然基金

三维微视觉测量中动态认知匹配机理及视觉空间畸变场补偿理论研究

批准号：60605013

批准年份：2006

负责人：刘盛

学科分类：F0604

资助金额：23.00

项目类别：青年科学基金项目

基于NAM的动态视觉信息认知理解方法研究

批准号：60973085

批准年份：2009

负责人：陈传波

学科分类：F0210

资助金额：29.00

项目类别：面上项目

基于视觉动态认知特性的上下文视频语义捕捉

批准号：61071180

批准年份：2010

负责人：姚鸿勋

学科分类：F0116

资助金额：34.00

项目类别：面上项目

面向格式塔空间的地下洞室群安全标识视觉认知机理

批准号：51878385

批准年份：2018

负责人：郑霞忠

学科分类：E0806

资助金额：60.00

项目类别：面上项目

动态视觉认知子空间构建与应用

{{i.achievement_title}}

暂无此项成果

其他相关文献

针对弱边缘信息的左心室图像分割算法

一种基于多层设计空间缩减策略的近似高维优化方法

基于改进LinkNet的寒旱区遥感图像河流识别方法

基于主体视角的历史街区地方感差异研究———以北京南锣鼓巷为例

二维FM系统的同时故障检测与控制

张剑华的其他基金

面向未知环境探索的类别独立对象检测

相似国自然基金