工作经历
研究方向
学生招收与培养
在如下方向招收感兴趣的、有才华的本科生、硕士生和博士生(名额充足):
1. 并行编译器优化:针对架构的OpenMP编译器优化;
2. 并行编程模型:针对CPU+GPU、CPU+AI加速器等异构架构,研究并行编程模型;
3. 编译器自动向量化技术:针对simd架构的自动向量化及指令调度;
4. AI编译器研究:针对AI框架研究AI算子的生成和调度;
5. AI4Science方向研究:利用AI技术加速传统HPC应用;
6. GPU编译器优化:基于Mesa研究GPU指令的调度和优化;
7. 程序分析方向:利用编译技术分析程序的可靠性、漏洞以及正确性;
8. 函数式语言编译器:Haskell编译器GHC在RISC-V、国产处理器上的支持;
9. CodeSize代码密度优化:嵌入式领域针对RISC-V/ARM平台的代码密度优化。
以上方向都有充足的课题经费支持,并且与产业界密切相关。已经与华为成立创新实验室,可推荐到华为实习。学生毕业大部分进入百度、华为、阿里等头部企业或进一步深造,成为栋梁之才。
The fruit that I have gained for ever
is that which thou hast accepted.
-- Tagore
论文代表作
Shihan Yuan, Zuoyan Zhang, Guanghui Song, Junhui Peng, Feng Wang, Zhuo Tang, Kenli Li, Jie Zhao: A Decoupled Analytical Model for Tile Size Selection in Affine Programs. ACM Transactions on Architecture and Code Optimization (CCF A), (2026).
Qi Du,Feng Wang, Chengkun Wu: Parallelization Strategies for DeepMD-kit UsingOpenMP: Enhancing Efficiency in MachineLearning-Based Molecular Simulations. IEEE Transactions on Computers.(CCF A), 3534-3545.(2025)
Qi Du, Feng Wang, Chengkun Wu, Han Wang, Yongpeng Liu, Zhaoyin Zhou, Kenli Li: Scaling Deep Learning Molecular Dynamics to 500M Atoms on 4096-Node ARMv8 Clusters. 2025 The IEEE International Conference on Cluster Computing (Cluster), (2025). Best Paper Award.
Qi Du, Feng Wang, Hui Huang, Jinlin Chen: Improving LAMMPS performance for molecular dynamic simulation on large-scale HPC systems. The Computer Journal, 706–716. (2025)
Shaobai Yuan, Jihong He, Yihui Xie, Feng Wang, Jie Zhao: Post-Link Outlining for Code Size Reduction. CC 2025: 154-166 (2025)
Qingyu Gao, Liantao Song, Yan Lei, Feng Wang, Lei Wang, Shize Zong, Yan Ding. Enhancing Consistency in Container Migration via
TEE: A Secure Architecture. 2024 IEEE 23rd International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom): 21-28 (2024)
Qi Du, Feng Wang, Hui Huang, Heng Wan, Xiaoyu Wu, Chengkun Wu: Exploring Natural Language Processing Model Acceleration in Molecular Dynamics Simulation Using High-Performance Computing and Machine Learning. BIBM 2024: 1479-1484 (2024)
Zhijie Yang, Lei Wang, Wei Shi, Yao Wang, Junbo Tie, Feng Wang, Xiang Yu, LingHui Peng, Chao Xiao, Xun Xiao, Yao Yao, Gan Zhou, Xuhu Yu, Rui Gong, Xia Zhao, Yuhua Tang, Weixia Xu: Back to Homogeneous Computing: A Tightly-Coupled Neuromorphic Processor With Neuromorphic ISA. IEEE Trans. Parallel Distributed Syst. 34(11): 2910-2927 (2023)
姜浩, 杜琦, 郭敏, 全哲, 左克, 王锋, 杨灿群. 面向ARMv8 64位多核处理器的QGEMM设计与实现[J]. 计算机学报,2017,40(9):2018-2029.
孙海燕, 陈跃跃, 王锋, 杨灿群, 阳柳, 王霁: TI DSP C语言编译器正确性测试. 计算机科学 42(Z6): 513-515 (2015)
Hao Jiang, Feng Wang, Kuan Li, Canqun Yang, Kejia Zhao, Chun Huang: Implementation of an Accurate and Efficient Compensated DGEMM for 64-bit ARMv8 Multi-Core Processors. ICPADS 2015: 491-498
Feng Wang, Hao Jiang, Ke Zuo, Xing Su, Jingling Xue, Canqun Yang: Design and Implementation of a Highly Efficient DGEMM for 64-Bit ARMv8 Multi-core Processors. ICPP 2015: 200-209
Hao Jiang, Feng Wang, Yunfei Du and Lin Peng: Fast Implementation of Quad-Precision GEMM on ARMv8 64-bit Multi-Core Processor. 16th GAMM-IMACS International Symposium on Scientific Computing, Computer Arithmetic and Validated Numerics (SCAN) (2014)
Xiangke Liao, Canqun Yang, Tao Tang, Huizhan Yi, Feng Wang, Qiang Wu, Jingling Xue: OpenMC: Towards Simplifying Programming for TianHe Supercomputers. J. Comput. Sci. Technol. 29(3): 532-546 (2014)
易会战,王锋,左克,等. 基于内存缓存的异步检查点容错技术[J]. 计算机研究与发展,2014,51(6):1229-1239.
王锋,杜云飞,陈娟. GPGPU性能模型研究[J]. 计算机工程与科学, 2013, 35(12): 1-7.
Canqun Yang, Qiang Wu, Tao Tang, Feng Wang, Jingling Xue. (2013). Programming for scientific computing on peta-scale heterogeneous parallel systems. Journal of Central South University, 20(2013), 1189–1203.
Peng Di, Hui Wu, Jingling Xue, Feng Wang, Canqun Yang: Parallelizing SOR for GPGPUs using alternate loop tiling. Parallel Comput. 38(6-7): 310-328 (2012)
刘勇鹏, 王锋, 卢凯,等.面向异构并行计算系统的流水线式压缩检查点[J].电子学报, 2012, 40(002):223-229.
Qiang Wu, Canqun Yang, Feng Wang, Jingling Xue: A Fast Parallel Implementation of Molecular Dynamics with the Morse Potential on a Heterogeneous Petascale Supercomputer. IPDPS Workshops 2012: 140-149
Feng Wang, Canqun Yang, Yunfei Du, Juan Chen, Huizhan Yi, Weixia Xu: Optimizing Linpack Benchmark on GPU-Accelerated Petascale Supercomputer. J. Comput. Sci. Technol. 26(5): 854-865 (2011)
杨灿群,王锋,杜云飞. Cell处理器上的软件Cache研究[J]. 计算机工程与科学, 2011, 33(2): 46-50.
Canqun Yang, Feng Wang, Yunfei Du, Juan Chen, Jie Liu, Huizhan Yi, Kai Lu: Adaptive Optimization for Petascale Heterogeneous CPU/GPU Computing. CLUSTER 2010: 19-28 (Best Paper Award)
Canqun Yang, Zhen Ge, Juan Chen, Feng Wang, Yunfei Du: Solving 2D Nonlinear Unsteady Convection-Diffusion Equations on Heterogenous Platforms with Multiple GPUs. ICPADS 2009: 961-966
杨灿群,王锋,彭林,杨学军.用表驱动算法在GCC中优化实现指数函数[J].计算机工程与科学,2007,29(5):77-80
王锋, 杨灿群. 编译器前端乘幂运算的实现与优化[J]. 计算机工程与应用, 2004, 40(36): 47-49.
主要项目
主要授权专利