Journal of System Simulation ›› 2022, Vol. 34 ›› Issue (10): 2107-2118.doi: 10.16182/j.issn1004731x.joss.21-1365
• Overview • Next Articles
Chunye Gong1(), Jie Liu1, Weimin Bao2, Dongmei Pan1, Xinbiao Gan1, Shengguo Li1, Xuguang Chen1, Tiaojie Xiao1, Bo Yang1(
), Ruibo Wang1
Received:
2021-12-30
Revised:
2022-02-25
Online:
2022-10-30
Published:
2022-10-18
Contact:
Bo Yang
E-mail:gongchunye@nudt.edu.cn;yangbo78@nudt.edu.cn
CLC Number:
Chunye Gong, Jie Liu, Weimin Bao, Dongmei Pan, Xinbiao Gan, Shengguo Li, Xuguang Chen, Tiaojie Xiao, Bo Yang, Ruibo Wang. Review on Ecological Construction of Domestic High-performance Parallel Application Software in Post Moore Era[J]. Journal of System Simulation, 2022, 34(10): 2107-2118.
1 | Koenig M E D. The Convergence of Moore's/Mooers' Law's[J]. Information Processing & Management (S0306-4573), 1987, 23(16): 583-592. |
2 | Kim N S, Austin T, Baauw D, et al. Leakage Current: Moore's Law Meets Static Power[J]. Computer (S1460-2067), 2003, 36(12): 68-75. |
3 | Mitchell Waldrop M. The Chips are Down for Moore's Law [J]. Nature(S0028-0836), 2016, 530(7589): 144-147. DOI:10.1038/530144a . |
4 | Yang Xuejun, Liao Xiangke, Lu Kai, et al. The TianHe-1A Supercomputer: Its Hardware and Software [J]. Journal of Computer Science and Technology (S1000-9000), 2011, 26(3): 344-351. |
5 | Liao Xiangke, Xiao Liquan, Yang Canqun, et al. MilkyWay-2 Supercomputer: System and Application [J]. Frontiers of Computer Science (S2095-2228), 2014, 8(3): 345-356. |
6 | Wang R, Lu K, Chen J, et al. Brief Introduction of TianHe Exascale Prototype System [J]. Tsinghua Science and Technology(S1007-0214), 2021, 26(3): 361-369. DOI: 10.26599/TST.2020.9010009 . |
7 | Fu H, Liao J, Yang J, et al. The Sunway TaihuLight Supercomputer: System and Applications [J]. Science China Information Sciences(S1674-733X), 2016, 59(7): 072001. |
8 | Chen D, Fang J, Xu C, et al. Characterizing Scalability of Sparse Matrix-Vector Multiplications on Phytium FT-2000+[J]. International Journal of Parallel Programming(S0885-7458), 2020, 48(1): 80-97. |
9 | 华为. 鲲鹏服务器主板S920X01(2U)技术白皮书01[R]. 贵州: 华为, 2020.Huawei. Kunpeng Server Motherboard S920x01 (2U) Technical White Paper 01 [R]. Guizhou: Huawei, 2020. |
10 | 刘胜, 卢凯, 郭阳, 等. 一种自主设计的面向E级高性能计算的异构融合加速器[J]. 计算机研究与发展, 2021, 58(6): 1234-1237. |
Liu Sheng, Lu Kai, Guo Yang, et al. A Self-Designed Heterogeneous Accelerator for Exascale High Performance Computing[J]. Journal of Computer Research and Development, 2021, 58(6): 1234-1237. | |
11 | Xu Z, Lin J, Matsuoka S. Benchmarking SW26010 Many-Core Processor [C]//2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). Orlando, USA. United States: IEEE, 2017: 743-752. |
12 | Yang Canqun. National SuperComputer Center in Tianjing[EB/OL]. (2021-06-14)[2021-11-14]. . |
13 | Lu Yutong. National SuperComputer Center in Guanzhou[EB/OL]. (2021-06-12)[2021-07-11]. . |
14 | Li Kenli. National Super Computer Center in Changsha [EB/OL]. (2021-02-12)[2021-06-24]. . |
15 | Gong Chunye, Liu Jie, Chi Lihua, et al. GPU Accelerated Simulations of 3D Deterministic Particle Transport Using Discrete Ordinates Method [J]. Journal of Computational Physics(S0021-9991), 2011, 230(15): 6010-6022. |
16 | Gong Chunye, Liu Jie, Huang Haowei, et al. Particle Transport with Unstructured Grid on GPU [J]. Computer Physics Communications(S0010-4655), 2012, 183(3): 588-593. |
17 | Yang B, Lu K, Liu J, et al. GPU Accelerated Monte Carlo Simulation of Deep Penetration Neutron Transport[C]//2012 2nd IEEE International Conference on Parallel, Distributed and Grid Computing. Solan, India: IEEE, 2012: 899-904. |
18 | 龚春叶. 面向异构体系结构的粒子输运并行算法研究[D]. 长沙: 国防科技大学, 2011. |
Gong Chunye. Parallel Algorithms Research of Particle Transport on Heterogeneous Architecture[D]. Changsha: National University of Defense Technology, 2011. | |
19 | Xu Chuanfu, Deng Xiaogang, Zhang Lilun, et al. Collaborating CPU and GPU for Large-scale High-order CFD Simulations with Complex Grids on the TianHe-1A Supercomputer[J]. Journal of Computational Physics (S0021-9991), 2014, 278: 275-297. |
20 | Heinecke A, Breuer A, Rettenberger S, et al. Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers[C]//International Conference for High Performance Computing, Networking, Storage and Analysis. New Orleans, LA: IEEE, 2014: 3-14. DOI: 10.1109/SC.2014.6 . |
21 | Li Zhe, Wu Chengkun, Li Yishui, et al. FEP-Based Large-Scale Virtual Screening for Effective Drug Discovery Against COVID-19[J/OL].[2021-07-11] journalsSAGE, Received. |
22 | Gan X, Zhang Y, Wang R,et al. TianheGraph: Customizing Graph Search for Graph500 on Tianhe Supercomputer [J]. IEEE Transactions on Parallel and Distributed Systems (S1045-9219), 2022, 33(4): 941-951. DOI: 10.1109/TPDS.2021.3100785 . |
23 | Yang Chao, Xue Wei, Fu Haohuan, et al. 10M-Core Scalable Fully-implicit Solver for Nonhydrostatic Atmospheric Dynamics [C]//International Conference for High Performance Computing, Networking, Storage and Analysis. Salt Lake City, Utah, USA:IEEE Press, 2016: 57-68. DOI: 10.1109/SC.2016.5 . |
24 | Fu Haohuan, Yin Wanwang, Yang Guangwen, et al. 18.9-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight[C]//International Conference for High Performance Computing, Networking, Storage and Analysis on-SC'17. Denver, CO, USA: IEEE Computer Society, 2017: 1-12. DOI:10.1145/3126908.3126910 . |
25 | Liu Yong, Liu Xin, Li Fang, et al. Closing the “Quantum Supremacy” Gap: Achieving Real-Time Simulation of a Random Quantum Circuit Using a New Sunway Supercomputer [C]//International Conference for High Performance Computing, Networking, Storage and Analysis. Association for Computing Machinery New York NY United States, 2021: 1-12. |
26 | Open Cascade SAS. Open Cascade [EB/OL]. (2021-02-01)[2021-08-16]. . |
27 | Lu Fengshun, Qi Long, Jiang Xiong, et al. NNW-GridStar: Interactive Structured Mesh Generation Software for Aircrafts [J]. Advances in Engineering Software (S0965-9978), 2020, 145: 102803. DOI:10.1016/j.advengsoft.2020 . |
28 | 李海峰, 郑澎, 方维, 等. 面向大规模数值计算的并行网格生成 [C]//第十二届中国CAE工程分析技术年会论文集. 北京: 北京诺维特机械科学技术发展中心, 2016: 7-12. |
Li Haifeng, Zheng Peng, Fang Wei, et al. Parallel Mesh Generation for Large-scale Numerical Simulation[C]//12th China CAE Engineering Analysis Technology Annual Conference. Beijing: Beijing Novotel Machinery science and Technology Development Center, 2016: 7-12. | |
29 | 高翔, 张翔, 徐传福, 等. 面向科学工程计算的通用网格生成软件系统研究[J]. 计算机工程与科学, 2020, 42(10): 1897-1904. |
Gao Xiang, Zhang Xiang, Xu Chuanfu, et al. Research on General Mesh Generation Software for Scientific and Engineering Computing[J]. Computer Engineering and Science, 2020, 42(10): 1897-1904. | |
30 | Sandia National Laboratory, ParaView, Los Alamos National Laboratory. Paraview [EB/OL]. (2021-03-01)[2021-08-16]. . |
31 | Xu Xiaowen, Mo Zeyao, Yue Xiaoqiang, et al. α Setup-AMG: an Adaptive-Setup-Based Parallel AMG Solver for Sequence of Sparse Linear Systems[J]. CCF Transactions on High Performance Computing (S2524-4922), 2020, 2(2): 98-110. DOI:10.1007/s42514-020-00033-w . |
32 | Li Y, Xie P, Chen X . et al. VBSF: a New Storage Format for SIMD Sparse Matrix-Vector Multiplication on Modern Processors[J]. The Journal of Supercomputing(S0920-8542), 2020, 76(3): 2063-2081. |
33 | Li Z, Jia H, Zhang Y, et al. Automatic Generation of High-performance FFT Kernels on Arm and X86 CPUs[J]. IEEE Transactions on Parallel and Distributed Systems(S1045-9219), 2020, 31(8): 1925-1941. DOI: 10.1109/TPDS.2020.2977629 . |
34 | Corporation Nvidia. Nvidia Developer [EB/OL]. (2021-06-01)[2021-07-21]. . |
35 | 张先轶, 王茜, 张云泉. OpenBLAS: 龙芯3A CPU的高性能BLAS库[J]. 软件学报, 2011, 22(增2): 208-216. |
Zhang Xianyi, Wang Qian, Zhang Yunquan. OpenBLAS: High Performance Blas Library of Godson 3A CPU[J]. Journal of Software, 2011, 22(S2): 208-216. | |
36 | 陈坚强. 国家数值风洞[EB/OL]. (2021-05-12)[2021-08-03]. . |
37 | Shan Fanli, Zhang Dingrui, Hou Lingyun, et al. Partially Premixed Combustion Simulation Using a Novel Transported Multi-Regime Flamelet Model[J]. Acta Astronautica(S1879-2030), 2022, 191: 245-257. |
38 | 国务院. 新时期促进集成电路产业和软件产业高质量发展的若干政策[EB/OL]. (2021-09-01)[2021-12-03]. . |
39 | Vetter J. Exploring Emerging Technologies in the HPC Co-Design Space[C]//APS March Meeting 2014. American Physical Society. American: the American Physical Society, 2014. |
40 | Wang Qinglin, Liu Jie, Gong Chunye, et al. Scalability of 3D Deterministic Particle Transport on the Intel MIC Architecture[J]. Nuclear Science and Techniques (S1001-8042), 2015, 26(5): P050502-1. |
41 | Gong Chunye, Bao Weimin, Liu Jie, et al. An Efficient Wavefront Parallel Algorithm for Structured Three Dimensional LU-SGS[J]. Computers & Fluids (S0045-7930), 2016, 134-135: 23-30. |
42 | Gong Chunye, Bao Weimin, Tang Guojian. A Parallel Algorithm for the RIESZ Fractional Reaction-Diffusion Equation with Explicit Finite Difference Method[J]. Fractional Calculus and Applied Analysis(S1311-0454), 2013, 16(3): 654-669. |
43 | Hennessy J L, David Patterson. A New Golden Age for Computer Architecture[J]. Communications of the ACM (S0001-0782), 2019, 62(2): 48-60. DOI:10.1145/3282307 . |
44 | 赵广立. 李国杰院士:未来几十年是并行计算的黄金时代[EB/OL]. (2020-03-25)[2021-12-01]. . |
45 | 工业和信息化部. "十四五"软件和信息技术服务业发展规划[EB/OL]. (2021-11-30)[2021-12-01]. . |
46 | 宋岩.关于完善科技成果评价机制的指导意见[EB/OL]. (2021-08-02)[2021-12-01]. . |
[1] | Jiang Lin, He Feilong, Shan Rui, Wang Shuai, Wu Haoyue, Wu Xin. Design and Implementation of Reconfigurable Video Array Processor Test Platform [J]. Journal of System Simulation, 2020, 32(5): 792-800. |
[2] | Xue Junjie, Shi Guoqiang, Zhou Junhua, Qu Huiyang, Tao Luan, Pu Ruiying. Simplification and Compression Service Construction of 3D model for Complex Products [J]. Journal of System Simulation, 2020, 32(4): 553-561. |
[3] | Tie Ming, Yu Ying, Zhang Xing, Wang Jianlin. HPC-based Virtual Flight Test Method for Multidisciplinary Multiphysical Field Coupling [J]. Journal of System Simulation, 2019, 31(9): 1733-1740. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||