期刊文章:[1] Jiacheng Zhao, Huimin Cui, Jingling Xue, Xiaobing Feng: Predicting Cross-Core Performance Interference on Multicore Processors with Regression Analysis. IEEE Trans. Parallel Distrib. Syst. 27(5): 1443-1456 (2016)[2] Danqi Hu, Fang Lv, Chenxi Wang, Huimin Cui, Lei Wang, Ying Liu, Xiaobing Feng: NVM Streaker: a fast and reconfigurable performance simulator for non-volatile memory-based memory architecture. The Journal of Supercomputing 74(8): 3875-3903 (2018)[3] Fang Lv, Lei Liu, Huimin Cui, Lei Wang, Ying Liu, Xiaobing Feng, Pen-Chung Yew:WiseThrottling: a new asynchronous task scheduler for mitigating I/O bottleneck in large-scale datacenter servers. The Journal of Supercomputing 71(8): 3054-3093 (2015)[4] Huimin Cui, Qing Yi, Jingling Xue, Xiaobing Feng: Layout-oblivious compiler optimization for matrix computations. TACO 9(4): 35:1-35:20 (2013)[5] Huimin Cui, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng, Dongrui Fan: Extendable pattern-oriented optimization directives. TACO 9(3): 14:1-14:37 (2012)[6] Fang Lu, Huimin Cui, Lei Wang, Lei Liu, Chenggang Wu, Xiaobing Feng, Pen-Chung Yew: Dynamic I/O-Aware Scheduling for Batch-Mode Applications on Chip Multiprocessor Systems of Cluster Platforms. J. Comput. Sci. Technol. 29(1): 21-37 (2014)[7] Yang Yang, Huimin Cui, Xiaobing Feng, Jingling Xue: A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs. J. Comput. Sci. Technol. 27(1): 57-74 (2012)[8] Huimin Cui, Lei Wang, Dong-Rui Fan, Xiaobing Feng: Landing Stencil Code on Godson-T. J. Comput. Sci. Technol. 25(4): 886-894 (2010)会议文章:[1] Ying Liu, Lei Huang, Mingchuan Wu, Huimin Cui, Fang Lv, Xiaobing Feng, Jingling Xue: PPOpenCL: a performance-portable OpenCL compiler with host and kernel thread code fusion. CC 2019: 2-16[2] Chenxi Wang, Huimin Cui, Ting Cao, John Zigman, Haris Volos, Onur Mutlu, Fang Lv, Xiaobing Feng, Guoqing Harry Xu: Panthera: holistic memory management for big data processing over hybrid memories. PLDI 2019: 347-362[3] Jiacheng Zhao, Huimin Cui, Yalin Zhang, Jingling Xue, Xiaobing Feng: Revisiting Loop Tiling for Datacenters: Live and Let Live. ICS 2018: 328-340[4] Chunwei Xia, Jiacheng Zhao, Huimin Cui, Xiaobing Feng: Characterizing DNN Models for Edge-Cloud Computing. IISWC 2018: 82-83[5] Jiange Zhang, Qian Wang, Qing Yi, Huimin Cui: Automating the Exchangeability of Shared Data Abstractions. LCPC 2018: 185-192[6] Jiacheng Zhao, Yisong Chang, Denghui Li, Chunwei Xia, Huimin Cui, Ke Zhang, Xiaobing Feng:On Retargeting the AI Programming Framework to New Hardwares. NPC 2018: 39-51[7] Lei Wang, Liangji Zhuang, Junhang Chen, Huimin Cui, Fang Lv, Ying Liu, Xiaobing Feng: Lazygraph: lazy data coherency for replicas in distributed graph-parallel computation. PPOPP 2018: 276-289[8] Jiacheng Zhao, Huimin Cui, Jingling Xue, Xiaobing Feng: Predicting Cross-Core Performance Interference on Multicore Processors with Regression Analysis. IEEE Trans. Parallel Distrib. Syst. 27(5): 1443-1456 (2016)[9] Lei Wang, Fan Yang, Liangji Zhuang, Huimin Cui, Fang Lv, Xiaobing Feng: Articulation points guided redundancy elimination for betweenness centrality. PPOPP 2016: 7:1-7:13[10] Wenting He, Huimin Cui, Binbin Lu, Jiacheng Zhao, Shengmei Li, Gong Ruan, Jingling Xue, Xiaobing Feng, Wensen Yang, Youliang Yan: Hadoop+: Modeling and Evaluating the Heterogeneity for MapReduce Applications in Heterogeneous Clusters. ICS 2015: 143-153[11] Huimin Cui, Gong Ruan, Jingling Xue, Rui Xie, Lei Wang, Xiaobing Feng: A collaborative divide-and-conquer K-means clustering algorithm for processing large data. Conf. Computing Frontiers 2014: 20:1-20:10[12] Qing Yi, Qian Wang, Huimin Cui: Specializing Compiler Optimizations through Programmable Composition for Dense Matrix Computations. MICRO 2014: 596-608[13] Jiacheng Zhao, Xiaobing Feng, Huimin Cui, Youliang Yan, Jingling Xue, Wensen Yang:An empirical model for predicting cross-core performance interference on multicore processors. PACT 2013: 201-212[14] Huimin Cui, Qing Yi, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng: A Highly Parallel Reuse Distance Analysis Algorithm on GPUs. IPDPS 2012: 1080-1092[15] Huimin Cui, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng, Dongrui Fan: Extendable pattern-oriented optimization directives. CGO 2011: 107-118[16] Huimin Cui, Lei Wang, Jingling Xue, Yang Yang, Xiaobing Feng: Automatic Library Generation for BLAS3 on GPUs. IPDPS 2011: 255-265[17] Lei Wang, Huimin Cui, Yuelu Duan, Fang Lu, Xiaobing Feng, Pen-Chung Yew: An adaptive task creation strategy for work-stealing scheduling. CGO 2010: 266-277 |