We demonstrate that we are able to achieve real-time inference on mobile devices with model pruning and compiler optimization for various DNN applications including super resolution, style transfer and coloring.