My research interests are programming languages, compiler constructions, and compiler optimizations.
My current projects include parallel programming design and implementation for AI chips, compiler optimizations for heterogeneous architectures, programming environments for emerging applications, and agile compiler construction.
My PhD thesis is on compilers constructions and optimizations, especially a new compilation methodology to integrate domain experts' optimization experiences into compilers.
The purpose of this research is to enable developing a domain-specific compilers efficiently, thus closing the performance gap between domain experts' hand-tuned code and compiler-generated code.
Chengxi Wang, Huimin Cui, Ting Cao, John Zigman, Haris Volos, Onur Mutlu, Fang Lv, Xiaobing Feng, Guoqing Harry Xu: Panthera: Holistic Memory Management for Big Data Processing over Hybrid Memories. PLDI, 2019 pdf.
Ying Liu, Lei Huang, Mingchuan Wu, Huimin Cui, Fang Lv, Xiaobing Feng, Jingling Xue: PPOpenCL: a performance-portable OpenCL compiler with host and kernel thread code fusion. CC, 2019 pdf.
2018
Jiacheng Zhao, Yisong Chang, Denghui Li, Chunwei Xia, Huimin Cui, Ke Zhang and Xiaobing Feng: On Retargeting the AI Programming Framework to New Hardwares. NPC, 2018 pdf.
Jiange Zhang, Qian Wang, Qing Yi, and Huimin Cui: Automating the Exchangeability of Shared Data Abstractions. LCPC, 2018 pdf.
Danqi Hu, Fang Lv, Chenxi Wang, Huimin Cui, Lei Wang, Ying Liu, Xiaobing Feng: NVM Streaker - a fast and reconfigurable performance simulator for non-volatile memory-based memory architecture. Journal of Supercomputing, 2018 pdf.
Jiacheng Zhao, Huimin Cui, Yalin Zhang, Jingling Xue, Xiaobing Feng: Revisiting Loop Tiling for Datacenters - Live and Let Live. ICS2018 pdf.
Lei Wang, Liangji Zhuang, Junhang Chen, Huimin Cui, Fang Lv, Ying Liu, Xiaobing Feng: Lazygraph lazy data coherency for replicas in distributed graph-parallel computation. PPpPP2018 pdf.
2016
Lei Wang, Fan Yang, Liangji Zhuang, Huimin Cui, Fang Lv, Xiaobing Feng: Articulation Points Guided Redundancy Elimination for Betweenness Centrality. PPpPP2016 pdf.
Wenting He, Huimin Cui, Binbin Lu, Jiacheng Zhao, Shengmei Li, Gong Ruan, Jingling Xue, Xiaobing Feng, Wensen Yang and Youliang Yan: Hadoop+: Modeling and Evaluating the Heterogeneity for MapReduce Applications in Heterogeneous Clusters. ICS 2015 pdf.
2014
Qing Yi, Qian Wang and Huimin Cui: Specializing Compiler Optimizations Through Programmable Composition For Dense Matrix Computations. MICRO 2014 pdf.
Huimin Cui, Gong Ruan, Jingling Xue, Rui Xie, Lei Wang and Xiaobing Feng: A Collaborative Divide-and-Conquer K-Means Clustering Algorithm for Processing Large Data. CF 2014 pdf.
Fang Lu,Huimin Cui, Lei Wang, Lei Liu, Cheng-Gang Wu, Xiao-Bing Feng, and Pen-Chung Yew: Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip Multiprocessor Systems of Cluster Platforms. JCST 2014 pdf.
2013
Jiacheng Zhao, Huimin Cui, Jingling Xue, Xiaobing Feng, Youliang Yan and Wensen Yang: An Empirical Model for Predicting Cross-Core Performance Interference on Multicore Processors. PACT 2013 pdf.
崔慧敏
Huimin Cui 崔慧敏
PhD, Professor
State Key Laboratory of Computer Architecture
Institute of Computing Technology, CAS
Beijing, 100190
Office : Room 0610, ICT
Phone : 6260-0610
Email : cuihm@ict.ac.cn
B.S. Tsinghua University;
M.S. Tsinghua University;
Ph.D. Institute of Computing Technology, CAS .
Research Interests
My research interests are programming languages, compiler constructions, and compiler optimizations.
My current projects include parallel programming design and implementation for AI chips, compiler optimizations for heterogeneous architectures, programming environments for emerging applications, and agile compiler construction.
My PhD thesis is on compilers constructions and optimizations, especially a new compilation methodology to integrate domain experts' optimization experiences into compilers.
The purpose of this research is to enable developing a domain-specific compilers efficiently, thus closing the performance gap between domain experts' hand-tuned code and compiler-generated code.
This is my PhD thesis (in Chinese).
Recent Publications
2019
Chengxi Wang, Huimin Cui, Ting Cao, John Zigman, Haris Volos, Onur Mutlu, Fang Lv, Xiaobing Feng, Guoqing Harry Xu: Panthera: Holistic Memory Management for Big Data Processing over Hybrid Memories. PLDI, 2019 pdf.
Ying Liu, Lei Huang, Mingchuan Wu, Huimin Cui, Fang Lv, Xiaobing Feng, Jingling Xue: PPOpenCL: a performance-portable OpenCL compiler with host and kernel thread code fusion. CC, 2019 pdf.
2018
Jiacheng Zhao, Yisong Chang, Denghui Li, Chunwei Xia, Huimin Cui, Ke Zhang and Xiaobing Feng: On Retargeting the AI Programming Framework to New Hardwares. NPC, 2018 pdf.
Jiange Zhang, Qian Wang, Qing Yi, and Huimin Cui: Automating the Exchangeability of Shared Data Abstractions. LCPC, 2018 pdf.
Danqi Hu, Fang Lv, Chenxi Wang, Huimin Cui, Lei Wang, Ying Liu, Xiaobing Feng: NVM Streaker - a fast and reconfigurable performance simulator for non-volatile memory-based memory architecture. Journal of Supercomputing, 2018 pdf.
Jiacheng Zhao, Huimin Cui, Yalin Zhang, Jingling Xue, Xiaobing Feng: Revisiting Loop Tiling for Datacenters - Live and Let Live. ICS2018 pdf.
Lei Wang, Liangji Zhuang, Junhang Chen, Huimin Cui, Fang Lv, Ying Liu, Xiaobing Feng: Lazygraph lazy data coherency for replicas in distributed graph-parallel computation. PPpPP2018 pdf.
2016
Lei Wang, Fan Yang, Liangji Zhuang, Huimin Cui, Fang Lv, Xiaobing Feng: Articulation Points Guided Redundancy Elimination for Betweenness Centrality. PPpPP2016 pdf.
Jiacheng Zhao, Huimin Cui, Jingling Xue, Xiaobing Feng: Predicting Cross-Core Performance Interference on Multicore Processors with Regression Analysis. IEEE Transactions on Parallel and Distributed Systems 2016 pdf.
2015
Wenting He, Huimin Cui, Binbin Lu, Jiacheng Zhao, Shengmei Li, Gong Ruan, Jingling Xue, Xiaobing Feng, Wensen Yang and Youliang Yan: Hadoop+: Modeling and Evaluating the Heterogeneity for MapReduce Applications in Heterogeneous Clusters. ICS 2015 pdf.
2014
Qing Yi, Qian Wang and Huimin Cui: Specializing Compiler Optimizations Through Programmable Composition For Dense Matrix Computations. MICRO 2014 pdf.
Huimin Cui, Gong Ruan, Jingling Xue, Rui Xie, Lei Wang and Xiaobing Feng: A Collaborative Divide-and-Conquer K-Means Clustering Algorithm for Processing Large Data. CF 2014 pdf.
Fang Lu,Huimin Cui, Lei Wang, Lei Liu, Cheng-Gang Wu, Xiao-Bing Feng, and Pen-Chung Yew: Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip Multiprocessor Systems of Cluster Platforms. JCST 2014 pdf.
2013
Jiacheng Zhao, Huimin Cui, Jingling Xue, Xiaobing Feng, Youliang Yan and Wensen Yang: An Empirical Model for Predicting Cross-Core Performance Interference on Multicore Processors. PACT 2013 pdf.
Huimin Cui, Qing Yi, Jingling Xue, Xiaobing Feng: Layout-oblivious compiler optimization for matrix computations. HiPEAC'2013 pdf.
Huimin Cui, Qing Yi, Jingling Xue, Xiaobing Feng: Layout-oblivious compiler optimization for matrix computations. TACO 2013: 9(4): 35 (2013) pdf.
2012
Yang Yang, Huimin Cui, Xiaobing Feng, Jingling Xue: A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs. JCST 2012 : 27(1): 57-74 (2012) pdf.
Huimin Cui, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng, Dongrui Fan: Extendable pattern-oriented optimization directives. TACO 2012 : 9(3): 14 (2012) pdf.
Huimin Cui, Qing Yi, Jingling Xue, Xiaobing Feng: Layout-oblivious optimization for matrix computations. PACT 2012 (Poster): 429-430 pdf.
Huimin Cui, Qing Yi, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng: A Highly Parallel Reuse Distance Analysis Algorithm on GPUs. IPDPS 2012: 1080-1092 pdf.
2011
Huimin Cui, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng, Dongrui Fan: Extendable pattern-oriented optimization directives. CGO 2011: 107-118 pdf.
Huimin Cui, Lei Wang, Jingling Xue, Yang Yang, Xiaobing Feng: Automatic Library Generation for BLAS3 on GPUs.IPDPS 2011: 255-265 pdf.
2010
Huimin Cui, Lei Wang, Dong-Rui Fan, Xiaobing Feng: Landing Stencil Code on Godson-T. J. Comput. Sci. Technol. 25(4): 886-894 (2010) pdf.
Lei Wang, Huimin Cui, Yuelu Duan, Fang Lu, Xiaobing Feng, Pen-Chung Yew: An adaptive task creation strategy for work-stealing scheduling. CGO 2010: 266-277
2007
Xuehai Qian, He Huang, Zhenzhong Duan, Junchao Zhang, Nan Yuan, Yongbin Zhou, Hao Zhang, Huimin Cui, Dongrui Fan: Optimized Register Renaming Scheme for Stack-Based x86 Operations. ARCS2007: 43-56