面向机器学习系统的张量中间表示

A tensor intermediate representation for machine learning systems

导出

摘要随着各类机器学习算法的广泛应用,高能效地定制机器学习系统受到越来越多的关注.定制机器学习系统高效部署的关键在于其编程与编译环境.中间表示是编程与编译环境的核心,用于连接上层编程语言和底层硬件指令.当前的中间表示或是面向上层算法或是面向以标量处理为核心的传统处理器,难以高效应对以张量处理为核心的机器学习系统.本文提出了面向机器学习系统的张量中间表示,以提升机器学习系统的编程和运行效率.具体而言,我们定义了一系列张量类型,张量操作及张量存储空间,并在此基础上进行张量处理优化.我们将所提出的张量中间表示对TVM的底层标量中间表示进行了扩展并在典型机器学习系统上进行了实验.我们探索了原有中间表示没有发掘的优化并取得了1.62~2.85倍的性能提升,同时在典型算子的开发效率上平均提升了5.46倍. With the wide deployment of various machine learning algorithms,highly energy-efficient customized machine learning systems have gained popularity.The machine learning compilers are crucial to machine learning systems.The intermediate representation is the key to programming and compilation environments,and it connects the high-level programming language and the lower-level instruction set architectures.The current stateof-the-art intermediate representations are either oriented to high-level algorithms or classical processors based on scalar processing,but they cannot be effectively implemented on tensor-based machine learning systems.To address this problem,we propose a tensor intermediate representation for machine learning systems to improve programming productivity and performance.Concretely,we define a series of tensor types,tensor operations,and tensor memories and optimize the tensor processing based on these definitions.To validate our proposal,we extend the proposed tensor intermediate representation to the low-level scalar intermediate representation of TVM and perform experiments with Tensor Core on a typical machine learning system.Experimental results show that we explore optimizations that are not discovered in the original intermediate representation and achieve1.62×~2.85×performance improvement.Besides,the tensor intermediate representation improves the efficiency of programming by 5.46×on average.

作者庄毅敏文渊博李威郭崎 Yimin ZHUANG;Yuanbo WEN;Wei LI;Qi GUO(State Key Laboratory of Computer Architecture,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100049,China;School of Computer Science and Technology,University of Science and Technology of China,Hefei 230026,China;Cambricon Technologies Corporation Limited,Shanghai 201308,China)

机构地区中国科学院计算技术研究所计算机体系结构国家重点实验室中国科学院大学中国科学技术大学计算机科学与技术学院上海寒武纪信息科技有限公司

出处《中国科学：信息科学》 CSCD 北大核心 2022年第6期1040-1052,共13页 Scientia Sinica(Informationis)

基金国家自然科学基金(批准号:61925208) 北京市自然科学基金(批准号:JQ18013) 中国科学院战略性先导科技专项(批准号:XDB32050200) 中国科学院稳定支持基础研究领域青年团队计划(批准号:YSBR-029)和中国科学院青年创新促进会资助项目。

关键词机器学习系统编程与编译张量处理中间表示编程效率 machine learning systems programming&compiling tensor processing intermediate representation programming efficiency

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1胡博,张乔.面向机器学习应用的车辆工程类人才培养模式创新与实践[J].汽车知识,2022,22(3):138-140.
2刘妮娜,衡马俊.基于综合实训的TVM的结构设计[J].科技创新与生产力,2022(3):110-112.
3孙媛,旦正错,刘思思,赵小兵.面向机器阅读理解的藏文数据集TibetanQA[J].中国科学数据（中英文网络版）,2022,7(2):30-38. 被引量：1
4岳鑫彤,贾海薇.区块链技术在应急管理领域的应用研究[J].中国应急管理科学,2022(1):66-79. 被引量：3
5陆芬.高效应对重大突发事件维护人民生命财产安全自然资源部应急测绘保障预案印发[J].资源导刊,2022(10):6-6.
6黔西南:奋力绘就新画卷[J].当代贵州,2022(16):54-55.
7杨建梁,刘越男,祁天娇.文档数据化:概念、框架与方法[J].中国图书馆学报,2022,48(3):63-78. 被引量：20
8韩伊娜.企业税务会计及税务风险管理路径的构建与优化路径探析[J].新金融世界,2022,21(2):90-92.
9周吴平,简伟研.以智能审核应对DRG高靠分组问题[J].中国医疗保险,2022(6):44-47. 被引量：9
10刘艳春.管理会计在互联网企业管理的应用研究[J].大众投资指南,2022(7):131-133. 被引量：1

中国科学：信息科学

2022年第6期

浏览历史

内容加载中请稍等...

面向机器学习系统的张量中间表示

相关作者

相关机构

相关主题

浏览历史