当前位置 >>  首页 >> 研究队伍 >> 研究员

张云泉

撰稿: 摄影: 发布时间:2018年06月26日

科研人员
性别
职称 研究员
所属部门 计算机体系结构国家重点实验室
研究方向 并行算法与并行软件,并行计算模型,性能优化和性能评测
联系方式 zyq@ict.ac.cn
简历

1991年9月-1995年7月,北京理工大学计算机科学与技术系计算机应用专业,获工学学士学位;
1995年9月-2000年7月,中国科学院软件研究所计算机软件与理论专业硕博连读,获工学博士学位;
2000年7月-2001年12月,中科院软件研究所并行软件研究开发中心,并行算法与并行软件,助研;
2002年1月-2007年3月,中科院软件研究所并行软件研究开发中心,并行算法与并行软件,副研究员;
2003年5月2013年8月,中科院软件研究所并行计算实验室,并行算法与并行软件,常务副主任;
2007年4月-2013年8月,中科院软件研究所并行计算实验室,并行算法与并行软件,研究员;
2007年6月-2013年8月,中科院软件研究所并行计算实验室,并行算法与并行软件,博士生导师。
2010年8月-2013年8月,中科院软件所与AMD公司 “APU软件联合研究开发中心”主任。2011年5月-2013年8月,中科院软件所与美国Argonne国家实验室数学与计算机科学部(MCS)“PPCT 联合实验室”(a JOINT LAB FOR Parallel PROcessing and computing techniques Research)中方主任。
2013年8月至今,中科院计算所计算机体系结构国家重点实验室,研究员,博士生导师。

 

社会任职、获奖及荣誉

1.国家超算济南中心主任 2.中国大数据产业应用协同创新联盟执行理事长 3 全国高校人工智能与大数据创新联盟常务副理事长  4. ACM中国常务理事 5. 中国软件行业协会常务理事 6.中国计算机学会常务理事/高性能计算专业委员会秘书长  7.  大数据专家委员会副秘书长。
 

曾获国家科技进步奖二等奖一项,获中科院科技进步二等奖一项,2016年中国计算机学会科学技术二等奖,2017年首届CCF青竹奖获得者,2017年中科院科教成果一等奖,2017年中科院杰出科学与技术成就奖。2005年到2011年中国软件行业协会全国先进工作者2010年中科院软件所优秀指导教师奖、2000年度中科院院长奖学金优秀奖、2000年国家科技进步二等奖,排名第9位、1998年中科院科技进步二等奖,排名第9位

代表论著
已在国内外发表学术论文100余篇,出版译著二本,论著章节三章。其中SCI 4篇, ISTP 7篇,EI 38篇.在著名国际会议SC、PPoPP、Cluster、ECIR、IEEE ICPADS、Euro-Par、IEEE ICPP、ICA3PP、IEEE HPCC等发表过十多篇文章。

  译著:

  1. 张云泉 张先轶 龙国平 姚继锋 译, Benedict R. Gaster Lee Howes David R. Kaeli, Perhaad Mistry, Dana Schaa 著,《OpenCL异构计算》(Heterogeneous Computing with OpenCL),清华大学出版社,2012年6月。

  2. 张云泉,陈英 译,(印)C.Xavier(美)S.S.Iyengar著,《并行算法导论》(Introduction to Parallel Algorithms),机械工业出版社/中信出版社,计算机科学丛书,ISBN 7-111-13390-0/TP.329,2004年2月北京第一版第一次印刷。

  Books & Chapters

  1.张云泉,孙家昶,袁国兴,张林波,“2004年高性能计算机发展趋势分析与展望”, 《2004年中国计算机科学技术发展报告》第一篇《高性能计算机》,pp.1-22, 中国计算机学会学术工作委员会主编, 清华大学出版社,ISBN 7-302-11420-X,2005年8月。

  2.张云泉,孙家昶,袁国兴,张林波,“2005年高性能计算机排行榜对比分析”, 《中国计算机科学技术发展报告2005》第一篇《计算机》,pp.3-25, 中国计算机学会文集(CCFP 0002),中国计算机学会学术工作委员会主编, 清华大学出版社,ISBN 7-302-13503-7/TP 8471,2006年8月。

  3.张云泉,孙家昶,袁国兴,张林波,“2006年高性能计算机排行榜对比分析”, 《中国计算机科学技术发展报告2006》第一篇《高性能计算机》,pp.47-67, 中国计算机学会文集(CCFP 0005),中国计算机学会主编, 清华大学出版社,ISBN 978-7-302-16262-9,2007年11月。

  期刊(Journal Paper)

  1. Zhang Yunquan, Sun Jiachang, Yuan Guoxing, and Zhang Linbo. Perspectives of China’s HPC system development: a view from the 2009 China HPC TOP100 list. China: Frontiers of Computer Science in China, 2010. 437-444. 收录于: SCI 收录于: EI (601112w2r531)

  2.孙相征, 张云泉, 王宣强, 王磊. 数值软件自适应性能优化搜索过程评价技术研究. 计算机研究与发展, 2010, 47(4): 679-686. EI(201004015)

  3. 刘胜飞, 张云泉, 孙相征. 一种改进的OpenMP指导调度策略研究. 计算机研究与发展, 2010, 47(4): 687-694.EI(201004016)

  4. 余元, 张云泉, 李会元. 一类非张量积区域快速傅立叶变换算法在国产并行机上的可扩展性测试. 数值计算与计算机应用, 2010, 31(2): 123-130.

  5. 孙相征, 张云泉, 王婷, 杨超, 李力刚. 天体大规模数值模拟软件性能优化. 华中科技大学学报(自然科学版), 2010, (1): 51-54.

  6. Chen Shao-Hu, Zhang Yunquan, Zhang Xian-Yi, and Cheng Hao. Performance Testing And Analysis Of Blas Libraries On Multi-core Cpus. Ruan Jian Xue Bao/Journal of Software, 2010, 21(SUPPL. 1): 214-223. EI (20111013735688)

  7. Yuan Liang, Zhang Yunquan, Long Guo-Ping, Wang Ke, and Zhang Xian-Yi. A Gpu Computational Model Based On Latency Hidden Factor. Ruan Jian Xue Bao/Journal of Software, 2010, 21(SUPPL. 1): 251-262. EI (20111013735691)

  8. 王婷, 孙相征, 张云泉, 杨超, 李力刚, 刘芳芳, 管文华, 唐雨新, 姚继峰. 曙光5000A天体大规模数值模拟软件性能测试. 西安交通大学学报, 2009, 43(10): 71-75.

  9.袁娥, 张云泉, 刘芳芳, 孙相征. SpMV的自动性能优化实现技术及其应用研究. 计算机研究与发展, 2009, 46(7): 1117-1126.EI(200907007)

  10. Tang, Yuxin, Zhang Yunquan, and Chen, Hu. A Parallel Shortest Path Algorithm Based On Graph-partitioning And Iterative Correcting. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2009, 24(5): 351-360.(SCI)

  11. Zhang Jian, Zhang Wenhui, Zhan Naijun, Shen Yidong, Chen Haiming, Zhang Yunquan, Wang Yongji, Wu Enhua, Wang Hongan, and Zhu Xueyang. Basic Research In Computer Science And Software Engineering At Sklcs. Frontiers of Computer Science in China, 2008, 2(1): 1-11. EI(20081511188751)

  12. 陈虎, 张云泉, 柳锴, 李玉成. 基于机群架构的并行数据库中间件系统改进研究. 计算机研究与发展, 2007, 44(z3): 142-146.

  13. Zhang Yunquan, Chen Guoliang, Sun Guangzhong, and Miao Qiankun. Models Of Parallel Computation: A Survey And Classification. Frontiers of Computer Science in China, 2007, 1(2): 156-165. EI (20074811741394)

  14. 陈靖, 张云泉, 张林波, 袁伟. 一种新的MPI Allgather算法及其在万亿次机群系统上的实现与性能分析. 计算机学报, 2006, 29(5): 808-814.EI (200605017)

  15. Chen Guo-Liang, Sun Guang-Zhong, Zhang Yunquan, and Mo Ze-Yao. Study On Parallel Computing. Journal of Computer Science and Technology, 2006, 21(5): 665-673. (SCI) EI (20064310193819)

  16.袁伟, 张云泉, 孙家昶, 李玉成. 国产万亿次机群系统NPB性能测试分析. 计算机研究与发展, 2005, 42(6): 1079-1084.EI (200506027)

  17. 唐渊, 孙家昶, 张云泉, 张林波. 集群网络评测模型的新探索. 软件学报, 2005, 16(6): 1131-1139.EI (200506012)

  18. 张广治, 张云泉, 李伟华, 李玉成. FM-index算法性能测试及并行化. 计算机工程, 2005, 31(22): 51-53.EI (200522018)

  19. 张云泉. 面向高性能数值计算的并行计算模型DRAM(h). 计算机学报, 2003, 26(12): 1660-1670.EI (200312007)

  20. 张云泉, 孙家昶, 唐志敏, 迟学斌. 数值计算程序的存储复杂性分析. 计算机学报, 2000, 23(4): 363-373.

  21. 张云泉, 施巍松. 负载平衡无关的并行程序最适处理器网格选择. 软件学报, 2000, 11(12): 1674-1680.

  22. 熊玉庆, 张云泉. 并行计算通信库测试方法研究及实践. 软件学报, 2000, 11(12): 1681-1684.

  23. 张云泉, 迟学斌. 在PVM应用程序中调用ScaLAPACK库函数方法. 数值计算与计算机应用, 1999, (4): 274-282.

  24. Yan Li, Yunquan Zhang, Yi-Qun Liu, Guoping Long, Haipeng Jia: MPFFT: An Auto-Tuning FFT Library for OpenCL GPUs. J. Comput. Sci. Technol. 28(1): 90-105 (2013)(SCI)

  25. Wang, Weiyan; Zhang, Yunquan; Yan, Shengen; Zhang, Ying; Jia, Haipeng, Parallelization and performance optimization on face detection algorithm with OpenCL: A case study,Tsinghua Science and Technology , Vol.17, No.3, pp.287-295, 2012.

  会议(Conference Paper)

  1.Sun Xiangzheng, Zhang Yunquan, Wang Ting, Long Guoping, Zhang Xianyi, and Li Yan. Crsd: Application Specific Auto-tuning Of Spmv For Diagonal Sparse Matrices. In: Euro-Par 2011 Parallel Processing. 2011. 316-327.

  2. Yang Chao, Li Ligang, and Zhang Yunquan. Development of a Scalable Solver for the Earth’s Core Convection. In: High Performance Computing and Applications. Germany: HIGH PERFORMANCE COMPUTING AND APPLICATIONS, 2010. 497-502. 收录于: ISTP 收录于: EI (11193g703466716r)

  3. Yang Chao, Zhang Yunquan, and Li Ligang. Numerical Simulation Of The Thermal Convection In The Earth's Outer Core. In: Proceedings - 2010 12th IEEE International Conference on High Performance Computing and Communications, HPCC 2010. United States: 2010. 552-555. EI (20104613376545)

  4.Wang Lei, Zhang Yunquan, Zhang Xianyi, and Liu Fangfang. Accelerating Linpack Performance With Mixed Precision Algorithm On Cpu+gpgpu Heterogeneous Cluster. In: Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010. United States: 2010. 1169-1174. EI (20104613393066)

  5 Wang Jing, Zhang Yunquan, Zhang Xianyi, Sun Xiangzheng, and Sheng Quanhu. Quantwiz: A Scalable Parallel Software Package For Label-free Protein Quantification. In: Proceedings 2010 IEEE 5th International Conference on Bio-Inspired Computing: Theories and Applications, BIC-TA 2010. United States: 2010. 976-980. EI (20105213534210)

  6.Wang Jing, Zhang Yunquan, Zhang Xianyi, Sun Xiangzheng, Hu Zelin, Li Sujun, and Zeng Rong. Quantwiz: A Parallel Software Package For Lc-ms-based Label-free Protein Quantification. In: 2009 11th IEEE International Conference on High Performance Computing and Communications, HPCC 2009. United States: HPCC: 2009 11TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2009. 683-687. ISTP,EI (20094712487667)

  7. Yu Yuan, Zhang Yunquan, Wang Ting, Sun Jiachang, Zhang Xianyi, Tang Yuxin, and Rao Li. Early Performance Evaluation Of Dawning 5000a And Deepcomp 7000. In: Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS. United States: 2009. 578-585. EI (20101212791530)

  8. Liu Shengfei, Zhang Yunquan, Sun, Xiangzheng, and Qiu, RongRong. Performance Evaluation Of Multithreaded Sparse Matrix-vector Multiplication Using Openmp. 345 E 47TH ST, NEW YORK, NY 10017 USA: HPCC: 2009 11TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2009. 659-665.

  9.Zhang Di, Zhang Yunquan, Liu Shengfei, and Huang Xiaodi. Parallelization Of Fm-index. In: Proceedings - 10th IEEE International Conference on High Performance Computing and Communications, HPCC 2008. United States: HPCC 2008: 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2008. 169-173. ISTP, EI (20084811737714)

  10. Tang Yuan, and Zhang Yunquan. Utilizing The Multi-threading Techniques To Improve The Two-level Checkpoint/rollback System For Mpi Applications. In: Proceedings - 10th IEEE International Conference on High Performance Computing and Communications, HPCC 2008. United States: 2008. 864-869. EI (20084811737808)

  11.Tang Yuxin, Zhang Yunquan, and Chen Hu. A Parallel Shortest Path Algorithm Based On Graph-partitioning And Iterative Correcting. In: Proceedings - 10th IEEE International Conference on High Performance Computing and Communications, HPCC 2008. United States: HPCC 2008: 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2008. 155-161. ISTP,EI (20084811737712)

  12. Zhang Yunquan, Jiachang Sun, Guoxing Yuan, and Zhang Linbo. A Brief Introduction To China Hpc Top100: From 2002 To 2006. In: CHINA HPC 2007: Proceedings of the Asian Technology Information Program's (ATIP's) 3rd Workshop on High Performance Computing in China - Solution Approaches to Impediments for High Performance. United States: 2007. 32-36. EI (20085011783002)

  13. Di Zhang, Zhang Yunquan, and Jing Chen. Efficient Construction Of Fm-index Using Overlapping Block Processing For Large Scale. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Germany: 2007. 113-123. EI (20075110984449)

  14.Zhang Yunquan, Chen Ying, and Tang Yuan. Block Size Selection Of Parallel LU And QR on PVP-based And Risc-based Supercomputers. In: CHINA HPC 2007: Proceedings of the Asian Technology Information Program's (ATIP's) 3rd Workshop on High Performance Computing in China - Solution Approaches to Impediments for High Performance. United States: 2007. 115-125. EI (20085011783014)

  15.Zhang, Di, Zhang Yunquan, and Chen, Jing. Efficient Construction Of Fm-index Using Overlapping Block Processing For Large Scale Texts. In: Advances in Information Retrieval. HEIDELBERGER PLATZ 3, D-14197 BERLIN, GERMANY: Advances in Information Retrieval, 2007. 113-123. ISTP

  16.张云泉, 孙家昶, 袁国兴, 张林波. 2004年高性能计算机发展趋势分析与展望. 上海: 中国计算机学会, 2005.

  17.Chen Jing, Zhang Linbo, Zhang Yunquan, and Yuan Wei. Performance Evaluation Of Allgather Algorithms On Terascale Linux Cluster With Fast Ethernet. In: Proceedings - Eighth International Conference on High-Performance Computing in Asia-Pacific Region, HPC Asia 2005. United States: 2005. 437-442. EI (20070910439446)

  18. Zhang Yunquan. Performance Characteristics Of Itanium2 And Opteron For Numerical Scientific Computing: A Common User's View. MARITIME GREENWICH CAMPUS, OLD ROYAL NAVAL COLLEGE, PARK ROW, LONDON, SE10 9LS, ENGLAND: DCABES and ICPACE Joint Conference on Distributed Algorithms for Science and Engineering, 2005. 85-88. ISTP

  19. Tang Yuan, Zhang Yunquan, Sun Jia-Chang, and Li Yu-Cheng. Hardware Impact On Communication Performance Of Beowulf Linux Cluster. In: IASTED International Multi-Conference on Applied Informatics. 2003. 495-500. EI (2004128066617)

  20. Shengen Yan, Guoping Long, Yunquan Zhang: StreamScan: fast scan algorithms for GPUs without global barrier synchronization. PPOPP 2013: 229-238.Shenzhen, China.

  21. Xianyi Zhang, Qian Wang, Yunquan Zhang, Model-driven Level 3 BLAS Performance Optimization on Loongson 3A Processor, ICPADS 2012, Singapore.

  22. Liang Yuan, Yunquan Zhang: A Locality-based Performance Model for Load-and-Compute Style Computation. CLUSTER 2012: 566-571

  23. Haipeng Jia, Yunquan Zhang, Guoping Long, Jianliang Xu, Shengen Yan, Yan Li: GPURoofline: A Model for Guiding Performance Optimizations on GPUs. Euro-Par 2012: 920-932

  24. Haipeng Jia, Yunquan Zhang, Weiyan Wang, Jianliang Xu: Accelerating Viola-Jones Facce Detection Algorithm on GPUs. HPCC 2012: 396-403

  25. Haipeng Jia, Yunquan Zhang, Guoping Long, Shengen Yan: An Insightful Program Performance Tuning Chain for GPU Computing. ICA3PP (1) 2012: 502-516

  26. Liang Yuan, Chen Ding, Daniel tefankovic, Yunquan Zhang: Modeling the Locality in Graph Traversals. ICPP 2012: 138-147

  27. Chao Li, Yunquan Zhang, Changwen Zheng, Xiaohui Hu: Implementing High-performance Intensity Model with Blur Effect on GPUs for Large-scale Star Image Simulation. IPDPS Workshops 2012: 1879-1888

<p style="ma</body></html>
附件下载: