Home

Recent Selected Papers (Huimin Cui)


PLDI'19
Panthera: Holistic Memory Management for Big Data Processing over Hybrid Memories
Chengxi Wang, Huimin Cui, Ting Cao, John Zigman, Haris Volos, Onur Mutlu, Fang Lv, Xiaobing Feng, Guoqing Harry Xu
PLDI'2019

CC'19
PPOpenCL: a performance-portable OpenCL compiler with host and kernel thread code fusion
Ying Liu, Lei Huang, Mingchuan Wu, Huimin Cui, Fang Lv, Xiaobing Feng, Jingling Xue
CC'2019

NPC'18
On Retargeting the AI Programming Framework to New Hardwares. NPC, 2018
Jiacheng Zhao, Yisong Chang, Denghui Li, Chunwei Xia, Huimin Cui, Ke Zhang and Xiaobing Feng
NPC'2018

LCPC'18
Automating the Exchangeability of Shared Data Abstractions
Jiange Zhang, Qian Wang, Qing Yi, and Huimin Cui
LCPC'2018

JSC'18
NVM Streaker - a fast and reconfigurable performance simulator for non-volatile memory-based memory architecture
Danqi Hu, Fang Lv, Chenxi Wang, Huimin Cui, Lei Wang, Ying Liu, Xiaobing Feng
Journal of Supercomputing'2018

ICS'18
Revisiting Loop Tiling for Datacenters - Live and Let Live
Jiacheng Zhao, Huimin Cui, Yalin Zhang, Jingling Xue, Xiaobing Feng
ICS'2018

PPoPP'18
Lazygraph lazy data coherency for replicas in distributed graph-parallel computation
Lei Wang, Liangji Zhuang, Junhang Chen, Huimin Cui, Fang Lv, Ying Liu, Xiaobing Feng
PPoPP'2018

PPoPP'16
Articulation Points Guided Redundancy Elimination for Betweenness Centrality.
Lei Wang, Fan Yang, Liangji Zhuang, Huimin Cui, Fang Lv, Xiaobing Feng
PPoPP'2016

TPDS'15
Predicting Cross-Core Performance Interference on Multicore Processors with Regression Analysis.
Jiacheng Zhao, Huimin Cui, Jingling Xue, Xiaobing Feng
TPDS'2016

ICS'15
Hadoop+: Modeling and Evaluating the Heterogeneity for MapReduce Applications in Heterogeneous Clusters.
Wenting He, Huimin Cui, Binbin Lu, Jiacheng Zhao, Shengmei Li, Gong Ruan, Jingling Xue, Xiaobing Feng, Wensen Yang and Youliang Yan
ICS'2015

MICRO'14
Specializing Compiler Optimizations Through Programmable Composition For Dense Matrix Computations.
Qing Yi, Qian Wang and Huimin Cui
MICRO'2014

CF'14
A Collaborative Divide-and-Conquer K-Means Clustering Algorithm for Processing Large Data.
Huimin Cui, Gong Ruan, Jingling Xue, Rui Xie, Lei Wang and Xiaobing Feng
CF'2014

JCST'14
Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip Multiprocessor Systems of Cluster Platforms.
Fang Lu, Huimin Cui, Lei Wang, Lei Liu, Cheng-Gang Wu, Xiao-Bing Feng, and Pen-Chung Yew
JCST'2014

PACT'13
An Empirical Model for Predicting Cross-Core Performance Interference on Multicore Processors.
Jiacheng Zhao, Huimin Cui, Jingling Xue, Xiaobing Feng, Youliang Yan and Wensen Yang
22nd International Conference on Parallel Architectures and Compilation Techniques (PACT), Edinburgh, 2013

HiPEAC'13
Layout-oblivious compiler optimization for matrix computations.
Huimin Cui, Qing Yi, Jingling Xue, Xiaobing Feng
The 8th International Conference on High-Performance Embedded Architectures and Compilers (HiPEAC'13), Berlin, Germany, 2013

TACO'13
Layout-oblivious compiler optimization for matrix computations.
Huimin Cui, Qing Yi, Jingling Xue, Xiaobing Feng
ACM Transactions on Architecture and Code Optimization (TACO)
9(4): 35 (2013)

JCST'12
A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs.
Yang Yang, Huimin Cui, Xiaobing Feng, Jingling Xue
Journal of Computer Science and Technology (JCST)
27(1): 57-74 (2012)

TACO'12
Extendable pattern-oriented optimization directives.
Huimin Cui, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng, Dongrui Fan
ACM Transactions on Architecture and Code Optimization (TACO)
9(3): 14 (2012)

PACT'12
Layout-oblivious optimization for matrix computations.
Huimin Cui, Qing Yi, Jingling Xue, Xiaobing Feng
21st International Conference on Parallel Architectures and Compilation Techniques
429-430 (Poster)

IPDPS'12
A Highly Parallel Reuse Distance Analysis Algorithm on GPUs.
Huimin Cui, Qing Yi, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng
26th International Parallel and Distributed Processing Symposium
1080-1092

CGO'11
Extendable pattern-oriented optimization directives
Huimin Cui, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng, Dongrui Fan
9th Annual IEEE/ACM International Symposium on Code Generation and Optimization
107-118

IPDPS'11
Automatic Library Generation for BLAS3 on GPUs
Huimin Cui, Lei Wang, Jingling Xue, Yang Yang, Xiaobing Feng
25th International Parallel and Distributed Processing Symposium
255-265

JCST'10
Landing Stencil Code on Godson-T.
Huimin Cui, Lei Wang, Dong-Rui Fan, Xiaobing Feng
Journal of Computer Science and Technology (JCST)
25(4): 886-894 (2010)

CGO'10
An adaptive task creation strategy for work-stealing scheduling.
Lei Wang, Huimin Cui, Yuelu Duan, Fang Lu, Xiaobing Feng, Pen-Chung Yew
8th Annual IEEE/ACM International Symposium on Code Generation and Optimization
(2010)

ARCS'07
Optimized Register Renaming Scheme for Stack-Based x86 Operations.
Xuehai Qian, He Huang, Zhenzhong Duan, Junchao Zhang, Nan Yuan, Yongbin Zhou, Hao Zhang, Huimin Cui, Dongrui Fan
20th International Conference on Architecture of Computing Systems
43-56