Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization

Abstract

We demonstrate that we are able to achieve real-time inference on mobile devices with model pruning and compiler optimization for various DNN applications including super resolution, style transfer and coloring.

Date
Jan 15, 2021 1:00 PM
Location
online meeting
Pu Zhao
Pu Zhao
Research Assistant Professor