甘霖
副教授
单位: 清华丘成桐数学科学中心 , 北京雁栖湖应用数学研究院
团队: 人工智能和机器学习
邮箱: lingan@tsinghua.edu.cn
研究方向: 并行计算,高性能计算应用,AI for Science
教育经历
- 2011 - 2016 清华大学计算机系 Doctor
- 2007 - 2011 北京邮电大学 Bachelor
工作经历
- 2025 - BIMSA Associate Professor
- 2025 - 清华大学丘成桐数学科学中心 Tenured Associate Professor
- 2019 - 2025 清华大学计算机系 Assistant Professor, Associate Professor
- 2016 - 2019 清华大学计算机系 Postdoc
荣誉与奖项
- 2024 算力中国·最佳学术论文
- 2023 中国青年五四奖章
- 2023 英特尔中国学术英才计划
- 2023 算力中国·青年先锋人物
- 2021 中国电子学会科技进步一等奖
- 2018 IEEE高性能计算专委会杰出新人奖
- 2017 戈登·贝尔提名奖
- 2016 戈登·贝尔奖
- 2015 25年最具影响力论文奖
出版物
- [1] Y Liu, Y Chen, C Guo, J Song, X Shi, L Gan, W Wu, W Wu, H Fu, X Liu et al., Verifying quantum advantage experiments with multiple amplitude tensor network contraction, Physical Review Letters, 132(3) (2024)
- [2] J Guo, Y Lai, J Zhang, J Zheng, H Fu, L Gan, L Hu, G Xu, X Che, CDA: A Universal Domain Adaptation Method for Scene Classification From Remote Sensing Imagery, IEEE Geoscience and Remote Sensing Letters (2024)
- [3] J Xu, J Fu, L Gan, Y Chen, Z Sun, Z Huang, G Yang, Leveraging the Hardware Resources to Accelerate cryo-EM Reconstruction of RELION on the New Sunway Supercomputer, ACM Transactions on Architecture and Code Optimization (2024)
- [4] M Yuan, Q Liu, L Gan, G Yang, ESFLOW: Mapping Large-Scale Earthquake Simulation to Spatial Computing Systems, IEEE International Symposium on Circuits and Systems (ISCAS), 1-5 (2024)
- [5] Q Deng, Q Liu, M Yuan, X Duan, L Gan, J Yang, W Zhao, Z Zhang, G Wu et al., Acceleration of Multi-body Molecular Dynamics with Customized Parallel Dataflow, IEEE Transactions on Parallel and Distributed Systems (2024)
- [6] M Li, C Liu, J Liao, X Zheng, H Yang, R Sun, J Xu, L Gan, G Yang, Z Luan et al., Towards optimized tensor code generation for deep learning on sunway many-core processor, Frontiers of Computer Science, 18(2) (2024)
- [7] Z Zhang, Z Wang, Y Guo, W Wang, Z Sun, W Wan, L Gan, R Han, Y Wang, A Low Overhead Heterogeneous Parallel Optimization Method Based on Three-Dimensional Elastic Wave Numerical Simulation, IEEE Transactions on Geoscience and Remote Sensing (2024)
- [8] J Guo, J Zheng, Y Xu, H Fu, W Xue, L Wang, L Gan, P Gao, W Wan, X Wu et al., LB-SCAM: A learning-based method for efficient large-scale sensitivity analysis and tuning of the Single Column Atmosphere Model (SCAM), Geoscientific Model Development, 17(9), 3975-3992 (2024)
- [9] Z Song, L Gan, S Xiang, Y Wang, X Duan, G Yang, Enabling High-Performance Physical Based Rendering on New Sunway Supercomputer, 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS … (2024)
- [10] H Lin, L Yan, Q Chang, H Lu, C Li, Q He, Z Song, X Duan, Z Yin, Y Li et al., O2ath: an OpenMP offloading toolkit for the sunway heterogeneous manycore platform, CCF Transactions on High Performance Computing, 6(3), 274-286 (2024)
- [11] Y Chen, Y Liu, X Shi, J Song, X Liu, L Gan, C Guo, H Fu, J Gao, D Chen et al., Lifetime-based optimization for simulating quantum circuits on a new sunway supercomputer, ACM SIGPLAN Annual Symposium (2023)
- [12] X Duan, Q Shao, J Weng, B Schmidt, L Gan, G Li, H Fu, W Xue, W Liu et al., Bio-esmd: A data centric implementation for large-scale biological system simulation on sunway taihulight supercomputer, IEEE Transactions on Parallel and Distributed Systems, 34(3), 881-893 (2023)
- [13] P Gao, X Duan, B Schmidt, W Wan, J Guo, W Zhang, L Gan, H Fu, W Xue et al., Redesign and accelerate the airebo bond-order potential on the new sunway supercomputer, IEEE Transactions on Parallel and Distributed Systems, 34(12), 3117-3132 (2023)
- [14] J Guo, Y Xu, H Fu, W Xue, L Gan, M Tan, T Wu, Y Shen, X Wu, L Hu et al., GEO-WMS: an improved approach to geoscientific workflow management system on HPC, CCF Transactions on High Performance Computing, 5(4), 360-373 (2023)
- [15] R Dong, L Zhang, W Li, S Yuan, L Gan, J Zheng, H Fu, L Mou, XX Zhu, An adaptive image fusion method for sentinel-2 images and high-resolution images with long-time intervals, International Journal of Applied Earth Observation and Geoinformation, 121 (2023)
- [16] W Wan, L Gan, W Wang, Z Yin, H Tian, Z Zhang, Y Wang, M Hua, X Liu et al., 7-pflops extreme scale earthquake simulation with crossing multi-faults and topography on Sunway, Proceedings of the International Conference for High Performance Computing (2023)
- [17] YH Deng, YC Gu, HL Liu, SQ Gong, H Su, ZJ Zhang, HY Tang, MH Jia et al., Gaussian boson sampling with pseudo-photon-number-resolving detectors and quantum computational advantage, Physical review letters, 131(15) (2023)
- [18] X Duan, J Wang, P Gao, M Ma, L Gan, X Liu, H Fu, W Xue, D Chen et al., Enabling Real World Scale Structural Superlubricity All-Atom Simulation on the Next-Generation Sunway Supercomputer, Proceedings of the International Conference for High Performance Computing (2023)
- [19] M Yuan, Q Liu, Q Deng, S Xiang, L Gan, J Yang, X Duan, H Fu, G Yang, FPGA-accelerated tersoff multi-body potential for molecular dynamics simulations, International Symposium on Applied Reconfigurable Computing (2022)
- [20] J Xu, J Fu, L Gan, Y Chen, Z Huang, G Yang, Accelerating cryo-em reconstruction of relion on the new Sunway supercomputer, 2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications … (2022)
- [21] Y Li, X Duan, L Gan, W Wan, Y Chen, K Xu, J Yang, W Liu, W Xue, H Fu et al., Enabling large-scale simulation of cam on the sunway taihulight supercomputer, IEEE Transactions on Computers, 71(4), 824-837 (2021)
- [22] Y Li, L Gan, M Chen, Y Chen, H Lu, C Lu, J Pan, H Fu, G Yang, Benchmarking 50-photon gaussian boson sampling on the sunway TaihuLight, IEEE Transactions on Parallel and Distributed Systems, 33(6), 1357-1372 (2021)
- [23] Q Sun, Y Liu, H Yang, M Dun, Z Luan, L Gan, G Yang, D Qian, Input-aware sparse tensor storage format selection for optimizing MTTKRP, IEEE Transactions on Computers, 71(8), 1968-1981 (2021)
- [24] L Gan, H Fu, G Yang, Translating novel HPC techniques into efficient geoscience solutions, Journal of Computational Science, 52 (2021)
- [25] P Gao, X Duan, B Schmidt, W Zhang, L Gan, H Fu, W Xue, W Liu, G Yang, Optimization of reactive force field simulation: Refactor, parallelization, and vectorization for interactions, IEEE Transactions on Parallel and Distributed Systems, 33(2), 359-373 (2021)
- [26] M Dun, Y Li, Q Sun, H Yang, W Li, Z Luan, L Gan, G Yang, D Qian, Towards efficient canonical polyadic decomposition on sunway many-core processor, Information Sciences, 549, 221-248 (2021)
- [27] Q Han, H Yang, M Dun, Z Luan, L Gan, G Yang, D Qian, Towards efficient tile low-rank GEMM computation on sunway many-core processors, The Journal of Supercomputing, 77(5), 4533-4564 (2021)
- [28] R Dong, W Fang, H Fu, L Gan, J Wang, P Gong, High-resolution land cover mapping through learning with noise correction, IEEE Transactions on Geoscience and Remote Sensing, 60, 1-13 (2021)
- [29] J Lai, L Gan, L Wang, Mixed-precision Methods to Reconstruct Numerical Ocean Simulations, IEEE Intl Conf on Parallel & Distributed Processing with Applications (2021)
- [30] B Chen, M Li, H Yang, Z Luan, L Gan, G Yang, D Qian, swRodinia: A Benchmark Suite for Exploiting Architecture Properties of Sunway Processor, Benchmarking, Measuring, and Optimizing: Third BenchCouncil International … (2021)
- [31] HS Zhong, YH Deng, J Qin, H Wang, MC Chen, LC Peng, YH Luo, D Wu et al., Phase-programmable gaussian boson sampling using stimulated squeezed light, Physical review letters, 127(18) (2021)
- [32] X Duan, P Gao, M Zhang, T Zhang, H Meng, Y Li, B Schmidt, H Fu, L Gan et al., Cell-list based molecular dynamics on many-core processors: A case study on sunway TaihuLight supercomputer, International Conference for High Performance Computing, Networking … (2020)
- [33] HS Zhong, H Wang, YH Deng, MC Chen, LC Peng, YH Luo, J Qin, D Wu et al., Quantum computational advantage using photons, Science, 370(6523), 1460-1463 (2020)
- [34] X Duan, M Zhang, W Liu, H Fu, L Gan, W Xue, G Yang, Tuning a general purpose software cache library for TaihuLight’s SW26010 processor, CCF Transactions on High Performance Computing, 2, 164-182 (2020)
- [35] L Gan, M Yuan, J Yang, W Zhao, W Luk, G Yang, High performance reconfigurable computing for numerical simulation and deep learning, CCF Transactions on High Performance Computing, 2, 196-208 (2020)
- [36] L Li, J Fang, J Jiang, L Gan, W Zheng, H Fu, G Yang, Efficient AES implementation on Sunway TaihuLight supercomputer: A systematic approach, Journal of Parallel and Distributed Computing, 138, 178-189 (2020)
- [37] M Li, Y Liu, X Liu, Q Sun, X You, H Yang, Z Luan, L Gan, G Yang, D Qian, The deep learning compiler: A comprehensive survey, IEEE Transactions on Parallel and Distributed Systems, 32(3), 708-727 (2020)
- [38] S Zhang, H Fu, L Wu, Y Li, H Wang, Y Zeng, X Duan, W Wan, L Wang et al., Optimizing high-resolution Community Earth System Model on a heterogeneous many-core supercomputing platform, Geoscientific Model Development, 13(10), 4809-4829 (2020)
- [39] R Dong, W Li, H Fu, L Gan, L Yu, J Zheng, M Xia, Oil palm plantation mapping from high-resolution remote sensing images using deep learning, International Journal of Remote Sensing, 41(5), 2022-2046 (2020)
- [40] MC Chen, R Li, L Gan, X Zhu, G Yang, CY Lu, JW Pan, Quantum-teleportation-inspired algorithm for sampling large random quantum circuits, Physical review letters, 124(8) (2020)
- [41] R Dong, C Li, H Fu, J Wang, W Li, Y Yao, L Gan, L Yu, P Gong, Improving 3-m resolution land cover mapping through efficient learning from an imperfect 10-m resolution map, Remote Sensing, 12(9) (2020)
- [42] Q Sun, Y Liu, M Dun, H Yang, Z Luan, L Gan, G Yang, D Qian, Sptfs: Sparse tensor format selection for mttkrp via deep learning, International Conference for High Performance Computing, Networking (2020)
- [43] P Gao, X Duan, T Zhang, M Zhang, B Schmidt, X Zhang, H Sun, W Zhang et al., Millimeter-scale and billion-atom reactive force field simulation on sunway taihulight, IEEE Transactions on Parallel and Distributed Systems, 31(12), 2954-2967 (2020)
- [44] Y Hu, H Yang, Z Luan, L Gan, G Yang, D Qian, Massively scaling seismic processing on sunway taihulight supercomputer, IEEE Transactions on Parallel and Distributed Systems, 31(5), 1194-1208 (2019)
- [45] O Li, W Zhao, X Huang, Y Chen, L Gan, H Yu, J Zhang, Y Liu, H Fu et al., Scaling the Training of Recurrent Neural Networks on Sunway TaihuLight Supercomputer, Computational Science–ICCS 2019 (2019)
- [46] X Zhong, H Yang, Z Luan, L Gan, G Yang, D Qian, : accelerating tensor decomposition on Sunway architecture, CCF Transactions on High Performance Computing, 1(3), 161-176 (2019)
- [47] W Gao, J Fang, W Zhao, J Yang, L Wang, L Gan, H Fu, G Yang, swatop: Automatically optimizing deep learning operators on sw26010 many-core processor, Proceedings of the 48th International Conference on Parallel Processing, 1-10 (2019)
- [48] T Zhang, Y Li, P Gao, Q Shao, M Shao, M Zhang, J Zhang, X Duan, Z Liu et al., SW_GROMACS: accelerate GROMACS on Sunway TaihuLight, Proceedings of the International Conference for High Performance Computing (2019)
- [49] L Gan, J Xu, X Wang, S Wu, X Duan, Y Li, H Fu, G Yang, Million-core-scalable simulation of the elastic migration algorithm on Sunway TaihuLight supercomputer, 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid (2019)
- [50] C Liu, H Yang, R Sun, Z Luan, D Qian, swTVM: exploring the automated compilation for deep learning on sunway architecture, arXiv preprint arXiv:1904.07404 (2019)
- [51] M Li, Y Liu, H Yang, Z Luan, L Gan, G Yang, D Qian, Accelerating sparse cholesky factorization on sunway manycore architecture, IEEE Transactions on Parallel and Distributed Systems, 31(7), 1636-1650 (2019)
- [52] J Xu, H Fu, W Luk, L Gan, W Shi, W Xue, C Yang, Y Jiang, C He, G Yang, Optimizing finite volume method solvers on Nvidia GPUs, IEEE Transactions on Parallel and Distributed Systems, 30(12), 2790-2805 (2019)
- [53] W Zhao, H Fu, J Fang, W Zheng, L Gan, G Yang, Optimizing convolutional neural networks on the sunway taihulight supercomputer, ACM Transactions on Architecture and Code Optimization, 15(1), 1-26 (2018)
- [54] J Xu, H Fu, W Shi, L Gan, Y Li, W Luk, G Yang, Performance tuning and analysis for stencil-based applications on POWER8 processor, ACM Transactions on Architecture and Code Optimization, 15(4), 1-25 (2018)
- [55] X Duan, P Gao, T Zhang, M Zhang, W Liu, W Zhang, W Xue, H Fu, L Gan et al., Redesigning LAMMPS for peta-scale and hundred-billion-atom simulation on Sunway TaihuLight, International conference for high performance computing, networking (2018)
- [56] B Chen, H Fu, Y Wei, C He, W Zhang, Y Li, W Wan, W Zhang, L Gan et al., Simulating the Wenchuan earthquake with accurate surface topography on Sunway TaihuLight, International Conference for High Performance Computing, Networking … (2018)
- [57] M Hu, H Yu, K Gu, Z Wang, H Ruan, K Wang, S Ren, B Li, L Gan, S Xu et al., A particle-filter framework for robust cryo-EM 3D reconstruction, Nature methods, 15(12), 1083-1089 (2018)
- [58] X Wang, L Gan, J Xu, J Yang, M Xia, H Fu, X Huang, G Yang, PLZMA: a parallel data compression method for cloud computing, Algorithms and Architectures for Parallel Processing: 18th International … (2018)
- [59] X Wang, P Xu, W Xue, Y Ao, C Yang, H Fu, L Gan, G Yang, W Zheng, A fast sparse triangular solver for structured-grid problems on sunway many-core processor SW26010, International Conference on Parallel Processing, 1-11 (2018)
- [60] L Gan, H Fu, O Mencer, W Luk, G Yang, Data flow computing in geoscience applications, Advances in Computers, 104, 125-158 (2017)
- [61] H Fu, J Liao, N Ding, X Duan, L Gan, Y Liang, X Wang, J Yang, Y Zheng et al., Redesigning CAM-SE for peta-scale climate modeling performance and ultra-high resolution on Sunway TaihuLight, Proceedings of the international conference for high performance computing (2017)
- [62] P Gong, J Wang, C Li, L Ji, H Huang, N Clinton, Y Cheng, W Li, M Zhang et al., Automated Global Land Cover Mapping–FROM-GLC Version 2: The Production of the 30 M Circa 2015 Global Land Cover Map, Tsinghua University: Beijing, China (2017)
- [63] L Gan, H Fu, W Luk, C Yang, W Xue, G Yang, Solving mesoscale atmospheric dynamics using a reconfigurable dataflow architecture, IEEE Micro, 37(4), 40-50 (2017)
- [64] L Li, J Fang, J Jiang, L Gan, W Zheng, H Fu, G Yang, SW-AES: accelerating AES algorithm on the sunway taihulight, IEEE International Symposium on Parallel and Distributed Processing (2017)
- [65] Y Ao, C Yang, X Wang, W Xue, H Fu, F Liu, L Gan, P Xu, W Ma, 26 pflops stencil computations for atmospheric modeling on sunway taihulight, IEEE International Parallel and Distributed Processing Symposium (2017)
- [66] H Fu, L Gan, C Yang, W Xue, L Wang, X Wang, X Huang, G Yang, Solving global shallow water equations on heterogeneous supercomputers, PloS One, 12(3) (2017)
- [67] H Fu, J Xu, L Gan, C Yang, W Xue, W Zhao, W Shi, X Wang, G Yang, Unleashing the performance potential of CPU-GPU platforms for the 3D atmospheric Euler solver, IEEE 27th International Conference on Application-specific Systems (2016)
- [68] C Yang, W Xue, H Fu, H You, X Wang, Y Ao, F Liu, L Gan, P Xu, L Wang et al., 10M-core scalable fully-implicit solver for nonhydrostatic atmospheric dynamics, Proceedings of the International Conference for High Performance Computing, SC'16 (2016)
- [69] H Fu, L Gan, R Clapp, G Alves, E Biondi, G Yang, B Biondi, GPU Accelerations on the 3D Elastic RTM Method, EAGE Conference and Exhibition, 78, 1-5 (2016)
- [70] J Xu, H Fu, L Gan, C Yang, W Xue, G Yang, Accelerating the 3D euler atmospheric solver through heterogeneous CPU-GPU platforms, ACM International Conference on Computing Frontiers, 353-356 (2016)
- [71] J Xu, H Fu, L Gan, Y Song, H Peng, W Shi, G Yang, Performance optimization of Jacobi stencil algorithms based on POWER8 architecture, IEEE 27th International Conference on Application-specific Systems (2016)
- [72] J Xu, H Fu, L Gan, Y Song, H Peng, W Shi, G Yang, Evaluating the POWER8 architecture through optimizing stencil-based algorithms, IEEE Trustcom/BigDataSE/ISPA, 1374-1381 (2016)
- [73] J Xu, H Fu, L Gan, C Yang, W Xue, S Xu, W Zhao, X Wang, B Chen et al., Generalized GPU acceleration for applications employing finite-volume methods, 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid (2016)
- [74] J Fang, H Fu, H Zhang, W Wu, N Dai, L Gan, G Yang, Optimizing complex spatially-variant coefficient stencils for seismic modeling on GPU, IEEE 21st International Conference on Parallel and Distributed Systems (2015)
- [75] B Liu, H Fu, L Gan, W Zhao, G Yang, Optimizing Residue Number Reverse Converters through Bitwise Arithmetic on FPGAs, IEEE 23rd Annual International Symposium on Field-Programmable Custom (2015)
- [76] L Gan, H Fu, W Luk, C Yang, W Xue, X Huang, Y Zhang, G Yang, Solving the global atmospheric equations through heterogeneous reconfigurable platforms, ACM Transactions on Reconfigurable Technology and Systems (TRETS), 8(2), 1-16 (2015)
- [77] W Xue, C Yang, H Fu, X Wang, Y Xu, J Liao, L Gan, Y Lu, R Ranjan et al., Ultra-scalable CPU-MIC acceleration of mesoscale atmospheric modeling on Tianhe-2, IEEE Transactions on Computers, 64(8), 2382-2393 (2014)
- [78] L Gan, H Fu, W Xue, Y Xu, C Yang, X Wang, Z Lv, Y You, G Yang, K Ou, Scaling and analyzing the stencil performance on multi-core and many-core architectures, 20th IEEE International Conference on Parallel and Distributed Systems (2014)
- [79] L Gan, H Fu, C Yang, W Luk, W Xue, O Mencer, X Huang, G Yang, A highly-efficient and green data flow engine for solving euler atmospheric equations, 24th International Conference on Field Programmable Logic and (2014)
- [80] W Xue, C Yang, H Fu, X Wang, Y Xu, L Gan, Y Lu, X Zhu, Enabling and scaling a global shallow-water atmospheric model on Tianhe-2, IEEE 28th International Parallel and Distributed Processing Symposium (2014)
- [81] Y You, H Fu, S Song, M Mehri Dehanavi, L Gan, X Huang, G Yang, Evaluating Multi-core Architectures through Accelerating the Three-Dimensional Lax–Wendroff Correction, International Journal of High Performance Computing Applications, 28(3) (2014)
- [82] Y You, H Fu, SL Song, MM Dehnavi, L Gan, X Huang, G Yang, Evaluating multi-core and many-core architectures through accelerating the three-dimensional Lax–Wendroff correction stencil, The International journal of high performance computing applications, 28(3) (2014)
- [83] L Gan, H Fu, W Luk, C Yang, W Xue, G Yang, Global Atmospheric Simulation on a Reconfigurable Platform, IEEE 21st Annual International Symposium on Field-Programmable Custom Integrated Circuits (2013)
- [84] Y You, H Fu, X Huang, G Song, L Gan, W Yu, G Yang, Accelerating the 3D elastic wave forward modeling on GPU and MIC, IEEE International Symposium on Parallel & Distributed Processing (2013)
- [85] C Yang, W Xue, H Fu, L Gan, L Li, Y Xu, Y Lu, J Sun, G Yang, W Zheng, A peta-scalable CPU-GPU algorithm for global atmospheric simulations, ACM SIGPLAN Notices, 48(8), 1-12 (2013)
- [86] L Gan, H Fu, W Luk, C Yang, W Xue, X Huang, Y Zhang, G Yang, Accelerating solvers for global atmospheric equations through mixed-precision data flow engine, 23rd International Conference on Field programmable Logic and (2013)
- [87] H Fu, L Gan, RG Clapp, H Ruan, O Pell, O Mencer, M Flynn, X Huang et al., Scaling reverse time migration performance through reconfigurable dataflow engines, IEEE Micro, 34(1), 30-40 (2013)
- [88] L Gan, Y Wang, W Xue, T Chau, Applied Reconfigurable Computing
- [89] HS Zhong, H Wang, YH Deng, MC Chen, LC Peng, YH Luo, J Qin, D Wu et al., 粗读使用 “九章” 光量子计算优势
- [90] L Gan, GPU Acceleration on the 3D Elastic RTM Method
更新时间: 2025-06-06 11:16:15