运用贝叶斯场与深度学习算法开发蛋白质体系的多体能量模型

基本信息
批准号:21903061
项目类别:青年科学基金项目
资助金额:24.00
负责人:郑铮
学科分类:
依托单位:武汉理工大学
批准年份:2019
结题年份:2022
起止时间:2020-01-01 - 2022-12-31
项目状态: 已结题
项目参与者:
关键词:
数据挖掘深度学习分子自由能模拟贝叶斯网络分子多体能量
结项摘要

A central task for computational chemistry is to accurately and efficiently simulate the molecular free energy, which provides theoretical proof and quantitative support for many study fields, e.g. structure-based drug design, protein engineering, etc. To achieve that goal requires reliable molecular potential energy models constituting a good balance between computational accuracy and operational simplicity. Quantum mechanics methods achieve high prediction accuracy while are extremely expensive against complicated biomolecular systems. Force fields have long been the most widely used potential energy models for molecular free energy simulations, for their well-trained parameter set and easy-to-compute functional forms. However, the single-atom or fragment based pairwise potential functions are facing inherent difficulties for simulating quantum level phenomena caused by the many body effects e.g. electric polarizability and charge transfer, bringing the development of force fields to a bottleneck. Inspired by the ideas from the graph theory and machine learning techniques, we hereby propose a new protocol for the molecular potential energy model generation by using the Bayesian Field Theory (BFT) and Artificial Neural Network (ANN) based machine learning. This project aims to introduce the ANN-based potential model, for protein systems and protein-ligand complex systems, not only using the atom pairwise potentials, but also including the multi-dimensional configurational effects. In this project, we plan to introduce BFT to set up isolated close-ranged many body systems centered at every atom under study within a molecule, so that the environmental structural configurations regarding each atom descriptors can be tracked across all the molecules in the training database. We then plan to use the deep learning method for training a multi-layered ANN as the biomolecular energy model against massive number of high-level QM method calculated molecular single point energies together with high-quality molecular global minimum structures. This project will provide new insight for understanding the benefit of using machine learning methods to simulate and interpret the complicated configurational effects beyond the illustration of molecular mechanics, without the need to employ high-cost quantum level computation. Finally, this project plan to embed this new ANN-based potential model in the commercialized “Movable Type” free energy method initially developed by the applicant, to achieve both high speed and high accuracy in the free energy simulation for biomolecular systems.

分子自由能模拟是计算化学的一个重要研究方向,可为新药研发、蛋白质工程等多个前沿领域的研究提供理论依据与方法指导。兼具计算精度与效率的能量模型则是自由能模拟运算的基石。量子力学模型精度虽高,但对复杂大分子体系的运算负担过大。分子力学模型尽管计算成本相对较低,然而对于多体效应产生的相关能量缺乏准确描述,造成较大的累积误差。本项目总结近期科研实践及成果,拟采用贝叶斯场与深度学习相结合的方法研究蛋白质体系内的多体能量,建立基于多层神经网络的分子能量模型。贝叶斯场可对复杂分子的结构与能量数据进行有效的信息转化与数据降维,生成机器学习所需的输入变量,随后对有机小分子的构象能级与蛋白质大分子的全局最优构象进行逐层训练,结合分子结构与能量信息建立多层网络模型,并通过模型的数据结构分析研究原子间多体能量的机制,最终结合本研究组发明的“活字印刷”自由能算法,实现兼具高精度与高效率的生物大分子自由能模拟。

项目摘要

分子自由能模拟是计算化学的一个重要研究方向,可为新药研发、蛋白质工程等多个前沿领域的研究提供理论依据与方法指导。兼具计算精度与效率的能量模型则是自由能模拟运算的基石。量子力学模型精度虽高,但对复杂大分子体系的运算负担过大。分子力学模型尽管计算成本相对较低,然而对于多体效应产生的相关能量缺乏准确描述,造成较大的累积误差。本项目总结近期科研实践及成果,拟采用贝叶斯场与深度学习相结合的方法研究蛋白质体系内的多体能量,建立基于多层神经网络的分子能量模型。贝叶斯场可对复杂分子的结构与能量数据进行有效的信息转化与数据降维,生成机器学习所需的输入变量,随后对有机小分子的构象能级与蛋白质大分子的全局最优构象进行逐层训练,结合分子结构与能量信息建立多层网络模型,并通过模型的数据结构分析研究原子间多体能量的机制,最终结合本研究组发明的“活字印刷”自由能算法,实现兼具高精度与高效率的生物大分子自由能模拟。

项目成果
{{index+1}}

{{i.achievement_title}}

{{i.achievement_title}}

DOI:{{i.doi}}
发表时间:{{i.publish_year}}

暂无此项成果

数据更新时间:2023-05-31

其他相关文献

1

玉米叶向值的全基因组关联分析

玉米叶向值的全基因组关联分析

DOI:
发表时间:
2

论大数据环境对情报学发展的影响

论大数据环境对情报学发展的影响

DOI:
发表时间:2017
3

跨社交网络用户对齐技术综述

跨社交网络用户对齐技术综述

DOI:10.12198/j.issn.1673 − 159X.3895
发表时间:2021
4

主控因素对异型头弹丸半侵彻金属靶深度的影响特性研究

主控因素对异型头弹丸半侵彻金属靶深度的影响特性研究

DOI:10.13465/j.cnki.jvs.2020.09.026
发表时间:2020
5

转录组与代谢联合解析红花槭叶片中青素苷变化机制

转录组与代谢联合解析红花槭叶片中青素苷变化机制

DOI:
发表时间:

相似国自然基金

1

贝叶斯深度张量学习模型、理论及算法研究

批准号:61773129
批准年份:2017
负责人:赵启斌
学科分类:F0605
资助金额:16.00
项目类别:面上项目
2

基于贝叶斯理论和深度学习的立体图像质量评价

批准号:61906118
批准年份:2019
负责人:马健
学科分类:F0604
资助金额:24.00
项目类别:青年科学基金项目
3

耦合多智能体系统与深度学习算法的城市开发边界精细模拟研究

批准号:41871318
批准年份:2018
负责人:张鸿辉
学科分类:D0114
资助金额:57.50
项目类别:面上项目
4

多源迁移学习的贝叶斯网络预测方法与应用研究

批准号:71801044
批准年份:2018
负责人:白云
学科分类:G0104
资助金额:18.00
项目类别:青年科学基金项目