文章、专利及软件著作权(部分)


编程与编译优化方向


  • 并行编程


1.      刘艳娜,陈莉,唐生林. 一个面向任务图并行程序的错误检查工具, 计算机科学, 2016
2.      刘颖,吕方,王蕾,陈莉,崔慧敏,冯晓兵. 异构并行编程模型研究与进展, [J];软件学报, 2014 (7) :1459-1475
3.      王蕾,崔慧敏,陈莉,冯晓兵.任务并行编程模型研究与进展. [J];软件学报,2013,24(1):77-90
4.      李恒杰,何文婷,陈莉,刘雷,吴承勇.支持算法组件自动替换的编程范式及编译框架.高技术通讯, 2013 , 23 (11) :1131-1138
5.      Li Chen, Baojiang Shou, Xionghui Hou, and Lei Huang. A Compiler-Assisted Runtime-Prefetching Scheme for Heterogeneous Platforms. In Proceedings of the 8thinternational workshop on OpenMP (IWOMP'12), LNCS 7312,Barbara M. Chapman et al (Eds.):Springer-Verlag, Berlin, Heidelberg, 116–129.
6.      Shixiong Xu, Li Chen. Shared Work List: Hacking Amorphous Data Parallelism in UPC. 2012 International Workshop on Programming Models and Applications for Multicores and Manycores. Held in conjunction with PPoPP 2012. Orleans, LA, February 26, 2012.
7.      Dongni Han, Shixiong Xu, Li Chen, Lei Huang. PADS: A Pattern-Driven Stencil Compiler-Based Tool for Reuse of Optimizations on GPGPUs. The 17th IEEE International Conference on Parallel and Distributed Systems (ICPADS'2011), Tainan, Taiwan, 10th-12th December, 2011.
8.      Shixiong Xu, Dongni Han, Li Chen. Computation Pattern Driven Reuse of Manual Optimizations for GPGPUs, The 12th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2011), Gwangju, South Korea. Oct. 2011.
9.      Li Chen, Lei Liu, Shenglin Tang, Lei Huang, Zheng Jing, Shixiong Xu, Dingfei Zhang, Baojiang Shou. Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation. The 23rd International Workshop on Languages and Compilers for Parallel Computing (LCPC2010), Houston, USA. Oct. 2010. LNCS 6548, pp. 151–165, 2011.
10.   Mi Wei, Li Yuxiang, Chen Li, etc. A source-to-source translation method with type restoration in a compiler, Journal of Computer Research and Development,47(7):1145-1155, 2010
11.   Yuxiang Li, Hui Shi, Li Chen. Vectorization Oriented Local Data Reorganization, Journal of Chinese Computer Systems, Vol. 8, pp.1528-1534, 2009
12.   Yuxiang Li, Hui Shi, Li Chen. The Improvement on SIMD Vectorization Algorithm for Non-multimedia Applications, Journal of Chinese Computer Systems, Vol. 10, pp.1927-1935, 2009
13.   Lei Liu, Li Chen, etc. Global loop tiling for distributed memory systems, 14th International Euro-Par Conference on Parallel and Distributed Computing, August 26–29, 2008. Las Palmas, Spain. LNCS 5168, pp.382-391.
14.   Lei Liu, Dingfei Zhang, Hengjie Li, Li Chen. Automatic Implementation of Multi-partitioning Using Global Tiling,In Proceedings of the 14th IEEE International Conference on Parallel and Distributed Systems (ICPADS'08),Melbourne, Victoria, AUSTRALIA, 8th-10th December, 2008. pp.673~680.
15.   Yiran Wang, Li Chen and Zhao-Qing Zhang. Global Partial Replicate Computation Partitioning. Proceedings of the 2004 International Conference on Parallel Processing, Montreal, Quebec, Canada, August 15-18,2004, pp. 108-116.
16. Li Chen, Zhao-Qing Zhang, Xiao-Bing Feng. Redundant Computation Partition on Distributed-Memory Systems. Fifth International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP'02). China, 2002. pp. 252-260.

  • 多核编程


A. 论文
[1]     卢兴敬,刘雷,贾海鹏,冯晓兵,武成岗.ParaC:面向GPU平台的图像处理领域的编程框架.软件学报,2017,28(7)
[2]     Jing Li, Yuan Wu, Xiaobing Feng, Chengyong Wu. Two-Level task scheduling for irregular applications on GPU platform. Int’l Journal of Parallel Programming, 2017. [doi: 10.1007/s10766-015-0387-0]
[3]     Xingjing Lu, Long Chen, Zhiyuan Li. Performance Evaluation and Enhancement of Process-Based Parallel Loop Execution. International Journal of Parallel Programming. February 2017, Volume 45, Issue 1, pp 185–198
[4]     Li Jing, Liu Lei, Wu Yuan, Liu Xiang-Hua, Gao Yi, Feng xiaobing. Pragma directed shared memory centric optimizations on GPUs. Journal of Computer Science and Technology, 2016,31:235. [doi: 10.1007/s11390-016-1624-8]
[5]     Chuanle Ke, Lei Liu, Chao Zhang, Tongxin Bai, Bryan Jacobs, and Chen Ding. 2011. Safe parallel programming using dynamic dependence hints. In Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications (OOPSLA '11). ACM, New York, NY, USA, 243-258. DOI: https://doi.org/10.1145/2048066.2048087
[6]     Lei Liu, Dingfei Zhang, Hengjie Li, and Li Chen. 2008. Automatic Implementation of Multi-partitioning Using Global Tiling. In Proceedings of the 2008 14th IEEE International Conference on Parallel and Distributed Systems (ICPADS '08). IEEE Computer Society, Washington, DC, USA, 673-680. DOI=http://dx.doi.org/10.1109/ICPADS.2008.91
[7]     Lei Liu, Li Chen, Chengyong Wu, and Xiao-Bing Feng. 2008. Global Tiling for Communication Minimal Parallelization on Distributed Memory Systems. In Proceedings of the 14th international Euro-Par conference on Parallel Processing (Euro-Par '08), Emilio Luque, Tomàs Margalef, and Domingo Benítez (Eds.). Springer-Verlag, Berlin, Heidelberg, 382-391.

B. 专利
[1]     赵鹏 刘雷 曹玮 一种任务调度方法及装置. 申请号:201610188139.2
[2]     赵鹏 刘雷 曹玮 访问哈希表的装置和方法. 申请号:201510179425.8
[3]     卢兴敬 一种面向图像处理的高性能GPU代码自动生成方法.申请号:201610490507.9
[4]     卢兴敬,邢明杰,刘雷.一种优化模式自动生成方法及优化装置.申请号:201510863780.7
[5]     吕方 李晶 吴承勇 刘雷 吴远 刘向华. 一种任务调度方法与系统. 申请号201310329615.4


  • 编译优化


A. 论文
[1] Chenxi Wang,Ting Cao,John Zigman,Fang Lv,Yunquan Zhang, Xiaobing Feng. Efficient Management for Hybrid Memory in Managed Language Runtime, Network and Parallel Computing. 2016.
[2] Fang Lv, Lei Liu, Hui-min Cui,Lei Wang, Ying Liu, Xiao-bing Feng, Pen-Chung Yew. WiseThrottling: a new asynchronous task scheduler for mitigating I/O bottleneck in large-scale datacenter servers. April 2015.
[3] Fang Lv, Hui-Min Cui, Lei Wang, Lei Liu Cheng-Gang Wu, Xiao-Bing Feng, Pen-Chung Yew. Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip Multiprocessor Systems of Cluster Platforms. Journal of Computer Science and Technology. Vol. 29 (1): 21-37. 2014.
[4] 吕方,崔慧敏,霍玮,冯晓兵. 面向并发性能下降的调度策略的综述.计算机研究与发展.2014, 51(1):17-30.
[5] 刘颖, 吕方, 王蕾, 陈莉, 崔慧敏, 冯晓兵. 异构并行编程模型研究与进展[J].软件学报,2014, 25(7)
[6] 刘颖,黄磊,吕方,崔慧敏,王蕾,冯晓兵.异构架构下基于放松重用距离的多平台数据布局优化. 软件学报,2016,27(8):2168−2184.
[7] Kun Ling , Tao Wang, Ying Liu, Lei Huang, Shiwen Hu, Mang Zhang, Hongjian Zhao, Tingyu Lu, Ruiqi Lian. Loongcc - A Compiler based on Open64 for MIPS64 Compatible Loongson 3 Architecture. OPEN64 Workshop at PLDI 2012.
[8] Fang Lu, Lei Wang, Xiaobing Feng, Zhiyuan Li, Zhaoqing Zhang. Exploiting idle register classes for fast spill destination. In: Proceedings of the 22nd annual international conference on Supercomputing (ICS’08).
[9] 黄磊, 冯晓兵, 吕方. 一种寄存器压力敏感的指令投机调度技术[J]. 计算机研究与发展, 2009, 46(3):485-491.
[10] 张超, 吕方, 王蕾,等. 一种基于反馈信息的地址寄存器提升方法[J]. 计算机研究与发展, 2009, 46(4):698-704.
 
B. 专利
1. 吕方,张军超,连瑞琦,吴承勇. 一种对编译器中的错误进行自动定位的方法.授权号ZL200510090093.2. 授权时间  2008-12-10
2. 吕方,冯晓兵,杨扬,张超,王蕾. 一种使用空闲寄存器类作为快速溢出空间的优化方法.  授权号ZL200810055607.4. 授权时间  2012-09-05
3. 周舒畅;黄磊;吕方. 一种使用陷阱指令优化条件转移代码性能的方法及其系统.    授权号ZL201010292061.1. 授权时间2012-10-10
4. 吕方 李晶 吴承勇 刘雷 吴远 刘向华. 一种任务调度方法与系统. 申请号201310329615.4
5.吕方,卢廷玉,郭磊等. 一种面向异OS的汇编器及汇编移植方法. 申请号201510003749.6
6. 吕方,黄磊,胡丹琪,冯晓兵. 一种非对称的数据三模冗余方法及系统. 申请号201510055815.4
7. 吕方, 王晨曦,黄磊,冯晓兵, 崔慧敏,王蕾. 一种面向可变粒度内存系统的二进制文件重写方法. 申请号 201510082216.1
8. 刘颖.一种全局数据的地址分配方法及相关装置.申请号:201210558814.8 授权时间2016-8-24
9. 刘颖.异构系统程序启动方法和装置.申请号:201310239648.X
10. 刘颖.OpenCL程序编译方法和编译器. 申请号:201310404125.6
11. 刘颖,黄磊. 轮廓技术编译的优化方法及装置. 申请号:201310539297.4
12. 刘颖,崔慧敏. 数据处理方法和中心节点. 申请号:201410331030.0
13. 刘颖,崔慧敏,冯晓兵. OpenCL运行时系统框架的设计方法及装置.申请号:201410065503.7
14. 卢廷玉,刘颖. 一种数据存储方法和装置. 申请号:201310464285.X
 
C. 软件著作权登记
大型程序运行错误分析软件Triage v1.0

  • 系统资源优化


1. Memos: A Full Hierarchy Hybrid Memory Management Framework. Lei Liu *, Hao Yang, Yong Li, Mengyao Xie, Lian Li, Chenggang Wu. The 34th International Conf. on Computer Design (ICCD):2016
2. Rethinking Memory Management in Modern Operating System: Horizontal, Vertical or Random? Lei Liu *, Yong Li, Chen Ding, Hao Yang, Chengyong Wu. IEEE Transactions on Computers (TC):2016. SIG: Trans. Version of the ISCA-2014 paper. This article was a featured article candidate in IEEE TC.
3. Going Vertical in Memory Management. Lei Liu *, et al. ACM SIGARCH Computer Architecture News:October, 2014.
4. Going Vertical in Memory Management: Handling Multiplicity by Multi-policy. Lei Liu *, Yong Li, Zehan Cui, Chengyong Wu, et al. The 41st ACM/IEEE International Symposium on Computer Architecture (ISCA):2014 (acceptance rate: 17.8%). SIG: The 9th ISCA paper in mainland China history.
5. BPM/BPM+: Software-based Dynamic Memory Partitioning Mechanisms for Mitigating DRAM Bank-/Channel-level Interferences in Multicore Systems. Lei Liu *, Zehan Cui, Yong Li, Chengyong Wu, et al. ACM Trans. on Architecture and Code Optimization (TACO):2014
6. A Software Memory Partition Approach for Eliminating Bank-level Interference in Multicore Systems. Lei Liu *, Zehan Cui, Mingjie Xing, Chengyong Wu, et al. The 21st ACM/IEEE International Conference on Parallel Architectures and Compilation Techniques (PACT):2012 (acceptance rate: 18.8%)

软件分析和安全方向


  • 软件分析技术


(1) Lian Li ,Yi Lu,Jingling Xue,Dynamic Symbolic Execution for Polymorphism,CC 2017.
(2) Feng Li, Wei Huo, Congming Chen, Lujie Zhong, Xiaobing Feng, Zhiyuan Li. Effective Fault Localization Based on Minimum Debugging Frontier Set. In CGO, 2013.
(3) Feng Li, Zhiyuan Li, Wei Huo, Xiaobing Feng, Locating Software Faults Based on Minimum Debugging Frontier Set. TSE.

附件下载: