面向GPU的体系结构敏感型数值算法优化技术研究

基本信息

批准号：61202094

项目类别：青年科学基金项目

资助金额：24.00

负责人：张纪林

学科分类：

依托单位：杭州电子科技大学

批准年份：2012

结题年份：2015

起止时间：2013-01-01 - 2015-12-31

项目状态：已结题

项目参与者：任永坚,张伟,游新冬,张建惠,袁俊峰,陈恂,张剑亮,宣慧嘉,洪晓龙

关键词：

数值算法非规则计算性能模型GPU体系结构自动调优

结项摘要

GPU has been well recognized as the main platform for high performance computing along with the development of multi-core techniques. Its essential features in multilevel memory hierarchy and various thread management have distinguished GPU from any other previous computing platforms, such that the traditional optimization techniques cannot meet the high performance demand of arthitecture-sensitive numerical algorithms in real practices.To take full advantage of GPU's unique architecture, we must research the new optimization methods to make better the parallelism, locality and irregularity feature of numerical algorithms. In this work, we take quantitative measurement and analysis of various factors that affect the performance of arthitecture-sensitive numerical algorithms on GPU, and build the GPU performance model being described by these factors. Furthermore, targeting on the arthitecture-sensitive numerical algorithms, we breaking through traditional concept to study the autotuner technology to optimize the parallelism on multi-layers of the architecture in order to improve the locality of data access, thread load balancing and better data stream processing. We would like to study the unified optimization technique that serves both the regular and irregular computations, and guide the optimization of architecture-sensitive numerical algorithms with our GPU performance model. Our study can be applied to generate highly efficient codes for architecture-sensitive numerical algorithms, which are of significant importance both in scientific computing theory and real practices.

随着多核技术的不断发展，GPU已经成为高性能计算的主流平台，与以往相比，该系统的体系结构具有更多的存储层次和多样化的线程管理，传统的优化技术已经不能满足体系结构敏感型数值算法的性能要求。该类算法必须重构以实现深度挖掘自身的并行性、局部性和非规则计算特性,通过充分发挥GPU的体系结构优势,提高程序性能。为此，本课题面向GPU体系结构,通过定量的测试和分析影响体系结构敏感型数值算法执行效率的各种因素，形成GPU性能模型，刻画体系结构敏感性指标。在此基础上，研究体系结构敏感型数值算法的多层次优化方法及自动调优策略,改善访存局部性、线程间负载均衡、数据读写和流处理方式。研究规则计算和非规则计算统一的性能优化方法。并且将GPU性能模型用于指导体系结构敏感型数值算法的调优方法和策略。本项目研究成果可以很好地提高体系结构敏感型数值算法执行效率，具有重要的理论意义和应用价值。

项目摘要

随着多核技术的不断发展,GPU 已经成为高性能计算的主流平台,与以往相比该系统的体系结构具有更多的存储层次和多样化的线程管理,传统的优化技术已经不能满足体系结构敏感型数值算法的性能要求。该类算法必须重构以实现深度挖掘自身的并行性、局部性和非规则计算特性,通过充分发挥 GPU 的体系结构优势,提高程序性能。为此,本课题面向 GPU 体系结构,通过定量的测试和分析影响体系结构敏感型数值算法执行效率的各种因素形成 GPU 性能模型,刻画体系结构敏感性指标。在此基础上,研究体系结构敏感型数值算法的多层次优化方法及自动调优策略,改善访存局部性、线程间负载均衡、数据读写和流处理方式。研究规则计算和非规则计算统一的性能优化方法。并且将 GPU 性能模型用于指导体系结构敏感型数值算法的调优方法和策略。本项目研究成果可以很好地提高体系结构敏感型数值算法执行效率,具有重要的理论意义和应用价值。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.16368/j.issn.1674-8999.2018.12.569

发表时间：2018

DOI：

发表时间：2021

DOI：10.1051/jnwpu/20213920292

发表时间：2021

DOI：

发表时间：2020

DOI：10.13336/j.1003-6520.hve.20200528028

发表时间：2021

张纪林的其他基金

批准号：61672200

批准年份：2016

资助金额：65.00

项目类别：面上项目

相似国自然基金

面向GPU的实时系统时间分析与优化技术研究

批准号：61772123

批准年份：2017

负责人：吕鸣松

学科分类：F0202

资助金额：61.00

项目类别：面上项目

面向不规则GPU应用的分析与优化技术研究

批准号：61672048

批准年份：2016

负责人：梁云

学科分类：F0204

资助金额：63.00

项目类别：面上项目

面向数值优化的迁移演化算法及其应用

批准号：61573324

批准年份：2015

负责人：龚文引

学科分类：F0305

资助金额：66.00

项目类别：面上项目

搜索引擎在线算法的GPU优化关键技术研究

批准号：61070014

批准年份：2010

负责人：刘晓光

学科分类：F0204

资助金额：31.00

项目类别：面上项目

面向GPU的体系结构敏感型数值算法优化技术研究

{{i.achievement_title}}

暂无此项成果

其他相关文献

肥胖型少弱精子症的发病机制及中医调体防治

基于铁路客流分配的旅客列车开行方案调整方法

一种基于多层设计空间缩减策略的近似高维优化方法

奥希替尼治疗非小细胞肺癌患者的耐药机制研究进展

带有滑动摩擦摆支座的500 kV变压器地震响应

张纪林的其他基金

基于特性分析的大规模机器学习性能优化关键技术研究

相似国自然基金