Untitled
PhD in Computer Engineering
Department of Electrical and Computer Engineering
U.S.A
Email: cli17 at ncsu dot edu
"Quality is not expensive, it's priceless"
Chao Li joined Qualcomm Silicon Valley as a Senior System Engineer. He received the Ph.D. in Computer Engineering from North Carolina State University in 2016, where he worked with Prof. Huiyang Zhou. His research area lies in computer architecture and programming. As performance is not taken for granted in Dark Silicon Era, he focuses on - 'Engineering on Hardware and Software Together, Deliver High Performance!'
Research Interests:
Parallel Computing Architecture and Programming: Many-core Architecture/Accelerator, Memory System Optimization, Performance Analysis and Optimization, GPGPU.
Highly Parallel Data-Intensive Algorithms: Machine Learning (e.g. Deep Learning/DNNs), Graph Algorithms (e.g. Matrix-centered Computation).
Professional Experience:
Research Assistant, NEC Laboratories America. Princeton, 2015 Jan ~ August.
PhD Research Intern, Pacific Northwest National Lab. Richland, 2014 May~August.
Research Assistant, IBM R&D Labs. Beijing, 2012 Feb~May.
Selected Publications:
[SC] Chao Li, Yi Yang, Min Feng, Chakradhar Srimat and Huiyang Zhou. Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs. in the International Conference on High Performance Computing, Networking, Storage, and Analysis (SC'16), 2016. (Best student paper finalist).
[ICS] Chao Li, Shuaiwen Leon Song, Hongwen Dai, Albert Sidelnik, Siva Kumar Sastry Hari and Huiyang Zhou. Locality-Driven Dynamic GPU Cache Bypassing. The 29th ACM International Conference on Supercomputing.
Newport Beach, CA, 2015.
[CGO] Chao Li, Yi Yang, Zhen Lin and Huiyang Zhou. Automatic Data Placement into GPU On-chip Memory Resources. The 13th ACM/IEEE International Symposium on Code Generation and Optimization, Bay Area, CA, 2015.
[PPoPP] Shengen Yan, Chao Li, Yunquan Zhang and Huiyang Zhou. yaSpMV:Yet Another SpMV Framework on GPUs. The 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Orlando,
FL, 2014.
[IPDPSW] Chao Li, Yunquan Zhang, Changwen Zheng
and Xiaohui Hu. Implementing High-Performance Intensity Model with BlurEffect on GPUs for Large-scale Image Simulation. The 26th IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, Shanghai, 2012.