Bin Ren

Bin Ren joined the Computer Science Department of William & Mary in Fall 2016. He received his Ph.D from the Department of Computer Science and Engineering, Ohio State University in 2014 under the supervision of Prof. Gagan Agrawal. He was a post-doctoral research associate in High Performance Computing group of Pacific Northwest National Laboratory with Dr. Sriram Krishnamoorthy from 2014 to 2016.

    Research interests:
  • Parallel Computing & High-Performance Computing
  • Compiler Techniques
  • Real-Time Machine Learning
  • Machine Learning Systems

Honors and Awards

  • NSF CAREER Award, 2021
  • Best Paper Award, SC 2020
  • Best Student Paper Nomination, SC 2020
  • Jeffress Trust Award, 2020
  • ISLPED Design Contest First Place, 2020
  • Student Cluster Reproducibility Challenge Paper, SC 2019
  • Best Paper Award, CGO 2013
  • SIGPLAN Research Highlights, 2013
  • Recent or Selected Publications (full list, my Ph.D. advisees are highlighted)

    [MICRO'2022] Wei Niu, Jiexiong Guan, Xipeng Shen, Yanzhi Wang, Gagan Agrawal, Bin Ren, GCD^2: A Globally Optimizing Compiler for Mapping DNNs to Mobile DSPs, 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2022.

    [ECCV'2022] Yushu Wu, Yifan Gong, Pu Zhao, Yanyu Li, Zheng Zhan, Wei Niu, Hao Tang, Minghai Qin, Bin Ren, and Yanzhi Wang, Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution, in Proc. of European Conference on Computer Vision (ECCV), 2022.

    [ECCV'2022] Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Xuan Shen, Geng Yuan, Bin Ren, Hao Tang, Minghai Qin, and Yanzhi Wang, SPViT: Enabling Faster Vision Transformers via Soft Token Pruning, in Proc. of European Conference on Computer Vision (ECCV), 2022.

    [IPDPS'2022] Qihan Wang, Bin Ren, Jie Chen, Robert G. Edwards, MICCO: An Enhanced Multi-GPU Scheduling Framework for Many-Body Correlation Functions, The 36th IEEE International Parallel & Distributed Processing Symposium (IPDPS), June 2022. (Acceptance rate of the 1st round: 9.7% = 46/474).

    [TACO'2022] Qihan Wang, Zhen Peng, Bin Ren, Jie Chen, Robert G. Edwards, MemHC: An Optimized GPU Memory Management Framework for Accelerating Many-body Correlation, ACM Transactions on Architecture and Code Optimization (ACM TACO), June 2022. (Original Work)

    [TECS'2022] Geng Yuan, Mengshu Sun, Wei Niu, Zhengang Li, Yuxuan Cai, Yanyu Li, Jun Liu, Weiwen Jiang, Xue Lin, Bin Ren, Xulong Tang, Yanzhi Wang, Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework, ACM Transactions on Embedded Computing Systems (TECS).

    [TPDS'2022] Mert Hidayetoglu, Tekin Bicer, Simon Garcia de Gonzalo, Bin Ren, Doga Gursoy, Rajkumar Kettimuthu, Ian T. Foster, Wen-Mei W. Hwu, MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging, IEEE Transactions on Parallel and Distributed Systems. 33(9): 2014-2031 (2022).

    [TODAES'2021] Yifan Gong*, Geng Yuan*, Zheng Zhan, Wei Niu, Zhengang Li, Pu Zhao, Yuxuan Cai, Sijia Liu, Bin Ren, Xue Lin, Xulong Tang, and Yanzhi Wang, Automatic mapping of the best-suited DNN pruning schemes for real-time mobile acceleration, ACM Transactions on Design Automation of Embedded Systems.

    [NeurIPS'2021 Spotlight: Top 3%] Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, Siyue Wang, Minghai Qin, Bin Ren, Yanzhi Wang, Sijia Liu, Xue Lin, MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge, Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), December 2021. (Acceptance rate: 2344/9122 = 26%, Spotlight 3%)

    [ICCV'2021] Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David Kaeli, Bin Ren, Xue Lin, Yanzhi Wang, Achieving On-Mobile Real-Time Super-Resolution With Neural Architecture and Pruning Search, International Conference on Computer Vision (ICCV), October 2021. (Acceptance rate: 25.9%)

    [PLDI'2021] Wei Niu, Jiexiong Guan, Yanzhi Wang, Gagan Agrawal, and Bin Ren, DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion, 42nd ACM SIGPLAN Conference on Programming Language Design and Implementation, 2021. (Acceptance rate: 87/320 = 27%)

    [TPAMI'2021] Wei Niu*, Zhenggang Li*, Xiaolong Ma, Peiyan Dong, Gang Zhou, Xuehai Qian, Xue Lin, Yanzhi Wang, and Bin Ren, GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight Sparsity, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 (impact factor: 17.86).

    [ESEC/FSE'2021] Jialiang Tan*, Yu Chen*, Zhenming Liu, Bin Ren, Shuaiwen Leon Song, Xipeng Shen, and Xu Liu (*equal contribution), Toward Efficient Interactions between Python and Native Libraries, 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, August 2021. (Acceptance rate: 97/396 = 24.5%)

    [ICS'2021] Chengming Zhang, Geng Yuan, Wei Niu, Jiannan Tian, Sian Jin, Donglin Zhuang, Zhe Jiang, Yanzhi Wang, Bin Ren, Shuaiwen Leon Song, and Dingwen Tao, ClickTrain: Efficient and Accurate End-to-End Deep LearningTraining via Fine-Grained Architecture-Preserving Pruning, 35th ACM International Conference on Supercomputing, June 2021. (Acceptance Rate: 38/157 = 24%)

    [CVPR'2021 Oral Paper: top 5%] Zhengang Li, Geng Yuan, Wei Niu, Yanyu Li, Pu Zhao, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Yanzhi Wang, Bin Ren, and Xue Lin, NPAS: A compiler-aware framework of unified network pruning and architecture search for beyond real-time mobile acceleration. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021. Acceptance rate: 21.2% (1593/7500)

    [CACM'2021] Hui Guan, Shaoshan Liu, Xiaolong Ma, Wei Niu, Bin Ren, Xipeng Shen, Yanzhi Wang, Pu Zhao (authors in alphabet order), CoCoPIE: Making Mobile AI Sweet as PIE - Compression-Compilation Co-Design Goes a Long Way, Communications of the ACM (CACM), 2021.

    [DAC'2021] Pu Zhao, Geng Yuan, Yuxuan Cai, Wei Niu, Qi Liu, Wujie Wen, Bin Ren, Yanzhi Wang, and Xue Lin, Neural Pruning Search for Real-Time Object Detection of Autonomous Vehicles, 57th Annual Design Automation Conference (DAC), 2021.

    [AAAI'2021] Wei Niu*, Mengshu Sun*, Zhengang Li*, Jou-An Chen, Jiexiong Guan, Xipeng Shen, Yanzhi Wang, Xue Lin, and Bin Ren (*equal contribution), Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices, Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI), 2021 (Acceptance Rate: 20.9%).

    [AAAI'2021] Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, and Yanzhi Wang, YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design, Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI), 2021 (Acceptance Rate: 20.9%).

    [SC'2020] Mert Hidayetoglu, Tekin Bicer, Simon Garcia de Gonzalo, Bin Ren, Vincent De Andrade, Doga Gursoy, Rajkumar Kettimuthu, Ian Foster, and Wen-mei Hwu, Petascale XCT: 3D Image Reconstruction with Hierarchical Communications on Multi-GPU Nodes, The 2020 ACM International Conference for High Performance Computing, Networking, Storage and Analysis (SC), November, 2020 (Best Paper Award and Best Student Paper Finalist). (Acceptance Rate: TBA)

    [ASE'2020] Hongyu Liu*, Ruiqin Tian*, Bin Ren, and Tongping Liu (*equal contribution), Prober: Practically Defending Overflows with Page Protection, The 35th IEEE/ACM International Conference on Automated Software Engineering (ASE), September, 2020. (Acceptance Rate: 93/414 = 22.5%).

    [ECCV'2020] Xiaolong Ma*, Wei Niu*, Tianyun Zhang, Sijia Liu, Sheng Lin, Hongjia Li, Xiang Chen, Jian Tang, Kaisheng Ma, Bin Ren, and Yanzhi Wang (*equal contribution), An Image Enhancing Pattern-based Sparsity for Real-Time Inference on Mobile Devices, The 16th European Conference on Computer Vision (ECCV), August, 2020. (Acceptance Rate: TBA)

    [ICML'2020] Yu Chen, Zhenming Liu, Bin Ren, and Xin Jin, On Efficient Constructions of Checkpoints, The 37th International Conference on Machine Learning, July, 2020. (Acceptance Rate: 1,088/4,990 = 21.8%).

    [IJCAI'2020 (demo)] Wei Niu*, Pu Zhao*, Zheng Zhan, Xue Lin, Yanzhi Wang, and Bin Ren (*equal contribution), Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization, in Proc. of IJCAI, 2020. (Acceptance Rate: 15.8%). Demo links: Youtube, Bilibili

    [ICS'2020] Ruoming Jin*, Zhen Peng*, Wendell Wu, Feodor Dragan, Gagan Agrawal, and Bin Ren (*equal contribution), Parallelizing Pruned Landmark Labeling: Dealing with Dependencies in Graph Algorithms, The 34th ACM International Conference on Supercomputing, June, 2020. (Acceptance rate: 40/132 = 30%).

    [ICS'2020] Tyler Coy, Shuibing He, Bin Ren, and Xuechen Zhang, Compiler Aided Checkpointing using Crash-Consistent Data Structures in NVMM Systems, The 34th ACM International Conference on Supercomputing, June, 2020. (Acceptance rate: 40/132 = 30%).

    [DAC'2020] Peiyan Dong, Siyue Wang, Wei Niu, Chengming Zhang, Sheng Lin, Zhengang Li, Yifan Gong, Bin Ren, Xue Lin, and Dingwen Tao, RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition, The 57th Annual Design Automation Conference, July, 2020. (Acceptance rate: TBA).

    [ASPLOS'2020] Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian, Xue Lin, Yanzhi Wang, and Bin Ren, PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning, The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, March, 2020 (Acceptance Rate: 86/476 = 18%).

    [CGO'2020] Yu Chen, Ivy B. Peng, Zhen Peng, Xu Liu, and Bin Ren, ATMem: Adaptive Data Placement in Graph Applications on Heterogeneous Memories, The 2020 International Symposium on Code Generation and Optimization, February, 2020 (Acceptance Rate: 26/95 = 27%).

    [AAAI'2020] Xiaolong Ma, Fu-Ming Guo, Wei Niu, Xue Lin, Jian Tang, Kaisheng Ma, Bin Ren, and Yanzhi Wang, PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Device, The 34th AAAI Conference on Artificial Intelligence, February, 2020 (Acceptance Rate: 1,591/7,737 = 20.6%).

    [TPDS'2020] Soklong Lim, Tyler Coy, Zaixin Lu, Bin Ren, and Xuechen Zhang, NVGRAPH: Enforcing Crash Consistency of Evolving Network Analytics in NVMM Systems, The IEEE Transactions on Parallel and Distributed Systems (TPDS), January, 2020.

    [TOPC'2019] Bin Ren, Shruthi Balakrishna, Youngjoon Jo, Sriram Krishnamoorthy, Kunal Agrawal, and Milind Kulkarni, Extracting SIMD Parallelism from Recursive Task-Parallel Programs, The ACM Transaction on Parallel Computing (TOPC), September, 2019.

    [SC'2019] Mert Hidayetoglu, Tekin Bicer, Simon Garcia de Gonzalo, Bin Ren, Doga Gursoy, Rajkumar Kettimuthu, Ian T. Foster, and Wen-Mei W. Hwu, MemXCT: Memory-Centric X-Ray CT Reconstruction with Massive Parallelization, The International Conference for High Performance Computing, Networking, Storage, and Analysis, November, 2019 (SC20 Student Cluster Reproducibility Challenge Paper). (Acceptance Rate: 87/344 = 25%)

    [PACT'2019] Soklong Lim, Zaixin Lu, Bin Ren, and Xuechen Zhang, Enforcing Crash Consistency of Evolving Network Analytics in Non-Volatile Main Memory Systems, The 28th International Conference on Parallel Architecture and Compilation Techniques, September, 2019. (Acceptance Rate: 26/126 = 21%)

    [CGO'2019] Ruiqin Tian*, Junqiao Qiu*, Zhijia Zhao, Xu Liu, and Bin Ren (* co-primary), Transforming Query Sequences for High-Throughput B+ Tree Processing on Many-core Processors, The 2019 International Symposium on Code Generation and Optimization (CGO), February, 2019. (Acceptance Rate: 21/69 = 31%)

    [PACT'2018] Zhen Peng, Alexander Powell, Bo Wu, Tekin Bicer, and Bin Ren, GraphPhi: Efficient Parallel Graph Processing on Emerging Throughput-oriented Architectures, The 27th International Conference on Parallel Architectures and Compilation Techniques (PACT), November, 2018. (Acceptance Rate: 36/126 = 29%)

    [PPoPP'2017] Bin Ren, Sriram Krishnamoorthy, Kunal Agrawal, and Milind Kulkarni, Exploiting Vector and Multicore Parallelism for Recursive Data- and Task-Parallel Programs, The 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), February, 2017. (Acceptance Rate: 29/132 = 22%)

    [TACO'2017] Mehmet Can Kurt, Sriram Krishnamoorthy, Gagan Agrawal, and Bin Ren, User-Assisted Store Recycling for Dynamic Task Graph Schedulers, The ACM Transactions on Architecture and Code Optimization (TACO), January, 2017. (Original work, invited to HiPEAC'17)

    [PACT'2016] Junqiao Qiu, Zhijia Zhao, and Bin Ren, MicroSpec: Fine-Grained Speculative Parallelization for FSM Computations, The 25th International Conference on Parallel Architecture and Compilation Techniques (PACT), September, 2016. (Acceptance Rate: 31/119 = 26%) [pdf]

    [ICPP'2016] Jeff Daily, Ananth Kalyanaraman, Sriram Krishnamoorthy, and Bin Ren, On the Impact of Widening Vector Registers on Sequence Alignment, The 45th annual International Conference on Parallel Processing (ICPP), August, 2016. (Acceptance Rate: 53/251 = 21%)

    [PLDI'2015] Bin Ren, Youngjoon Jo, Sriram Krishnamoorthy, Kunal Agrawal, and Milind Kulkarni, Efficient Execution of Recursive Programs on Commodity Vector Hardware, The 36th annual ACM SIGPLAN conference on Programming Language Design and Implementation (PLDI), June, 2015. (Acceptance Rate: 58/303 = 19%) [pdf]

    [IPDPS'2015] Linchuan Chen, Xin Huo, Bin Ren, Surabhi Jain, and Gagan Agrawal, Efficient and Simplified Parallel Graph Processing over CPU and MIC, The 29th IEEE International Parallel & Distributed Processing Symposium (IPDPS), May, 2015. (Acceptance Rate: 108/496 = 21%) [pdf]

    [ICS'2014] Xin Huo, Bin Ren, and Gagan Agrawal, A Programming System for Xeon Phis with Runtime SIMD Parallelization, The 28th International Conference on Supercomputing (ICS), June, 2014. (Acceptance Rate: 34/162 = 21%) [pdf]

    [TACO'2014] Bin Ren, Todd Mytkowicz, and Gagan Agrawal, A Portable Optimization Engine for Accelerating Irregular Data-Traversal Applications on SIMD Architectures, The ACM Transactions on Architecture and Code Optimization (TACO), 2014. [pdf]

    [CGO'2013] Bin Ren, Gagan Agrawal, James R. Larus, Todd Mytkowicz, Tomi Poutanen, and Wolfram Schulte, SIMD Parallelization of Applications that Traverse Irregular Data Structures, The 2013 International Symposium on Code Generation and Optimization (CGO), February, 2013 (CACM Research Highlight Nomination, SIGPLAN Research Highlight, Best Paper Award). (Acceptance Rate: 33/117 = 28%) [pdf]

    [PACT'2011] Bin Ren, and Gagan Agrawal, Compiling Dynamic Data Structure in Python to Enable the Use of Multi-core and Many-core Libraries, The 20th International Conference on Parallel Architectures and Compilation Techniques (PACT), October, 2011 (Acceptance Rate: 36/221 = 16%).[pdf]

    Students

  • Ruiqin Tian (Ph.D. student, Fall 2015, Graduated in Spring 2021)
  • Zhen Peng (Ph.D. student, Fall 2016)
  • Qihan Wang (Ph.D. student, Fall 2017)
  • Yu Chen (Ph.D. student, Fall 2018, co-advised w/ Zhenming Liu, Andreas Stathopoulos)
  • Wei Niu (Ph.D. student, Fall 2018)
  • Jiexiong Guan (Ph.D. student, Fall 2020)
  • Alexander Powell (Master, Graduated in Spring 17)
  • Eunyoung Cho (Master, Graduated in Spring 17)
  • Xiaoying Zhai (Master, Graduated in Spring 19)
  • Shuxin Zou (Master, Graduated in Spring 19)
  • Professional Services

  • Organization Committee: ICS'21 (Publication Chair), IISWC'20 (Registration Chair), ICS'18 (Submission Chair)
  • Organization Committee (Workshop): HIPS'21 (Workshop co-chair)
  • Program Committee (Conference): PPoPP'22, ICS'21, ICDCS'21, IPDPS'22/21, ICCD'20, LCTES'21/20, PACT'19/18
  • Program Committee (Conference, Continued): HPDC'19, ICPP'20/15, HiPC'19/18/17/15
  • Program Committee (Workshop): GPGPU'20/19, WOLFHPC'17/16, IPDRM'19/17, HIPS'20/19/16/15
  • Journal Reviewer: TPDS, TACO, CSUR, SmartGrid, IJPP, JPDC, TKDE
  • Conference Reviewer: ICCV'21, CVPR'21, AAAI'21, IPDPS'16/15, ICPP'16, PPoPP'15, PACT'15, CCGrid'15
  • Teaching

  • SP2022, CS680 Compiler and Parallel Computing
  • AU2021, CS304 Computer Organization
  • SP2021, CS304 Computer Organization
  • AU2020, CS680 Compiler and Parallel Computing
  • SP2020, CS680 Compiler and Parallel Computing
  • AU2019, CS304 Computer Organization
  • SP2019, CS680 Compiler and Parallel Computing
  • AU2018, CS304 Computer Organization
  • SP2018, CS780 Compiler Optimization for High Performance Computing
  • AU2017, CS304 Computer Organization
  • SP2017, CS304 Computer Organization
  • AU2016, CS680 Compiler Optimization for High Performance Computing
  • AU2013, CSE2421 System I: Introduction to Low Level Programming and Computer Organization
  • SP2013, CSE2421 System I: Introduction to Low Level Programming and Computer Organization
  • SU2010, CSE459.23 Programming in Java, CSE459.22 Programming in C++
  • SP2010, CSE459.22 Programming in C++
  • WI2010, CSE459.23 Programming in Java