Publications

(2025). Sparse Learning for State Space Models on Mobile. In ICLR 2025.

PDF Project

(2025). RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation. In ICASSP 2025.

PDF Project DOI

(2025). QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge. In CVPR 2025.

PDF Code Poster

(2025). Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment. In AAAI 2025.

PDF Project DOI

(2025). Numerical Pruning for Efficient Autoregressive Models. In AAAI 2025.

PDF Project DOI

(2025). LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers. In AAAI 2025.

PDF Code Project DOI

(2025). Q-TempFusion: Quantization-Aware Temporal Multi-Sensor Fusion on Bird's-Eye View Representation. In WACV 2025.

PDF Project

(2024). Search for Efficient Large Language Models. In NeurIPS 2024.

PDF Code Project

(2024). Fast and Memory-Efficient Video Diffusion Using Streamlined Inference. In NeurIPS 2024.

PDF Code Project

(2024). Exploring Token Pruning in Vision State Space Models. In NeurIPS 2024.

PDF Code Project Poster

(2024). Rethinking Token Reduction for State Space Models. In EMNLP 2024.

PDF Code Project DOI

(2024). Pruning Foundation Models for High Accuracy without Retraining. In EMNLP 2024 Findings.

PDF Project DOI

(2024). TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform. In TCAD.

PDF Project DOI

(2024). HotaQ: Hardware Oriented Token Adaptive Quantization for Large Language Models. In TCAD.

PDF Project DOI

(2024). InstructGIE: Towards Generalizable Image Editing. In ECCV 2024.

PDF Code Project Poster DOI

(2024). DiffClass: Diffusion-Based Class Incremental Learning. In ECCV 2024.

PDF Code Project DOI

(2024). Neural architecture search for adversarial robustness via learnable pruning. In Front. HPC.

PDF Project DOI

(2024). FasterVD: On Acceleration of Video Diffusion Models. In IJCAI Demo 2024.

PDF Project DOI

(2024). Lotus: learning-based online thermal and latency variation management for two-stage detectors on edge devices. In DAC 2024.

PDF Code Project DOI

(2023). The Autonomous Vehicle Assistant (AVA): Emerging technology design supporting blind and visually impaired travelers in autonomous transportation. In Int. J. Hum. Comp. Stud..

PDF Project DOI

(2023). Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge. In CVPR 2023.

PDF Project DOI

(2023). Condense: A Framework for Device and Frequency Adaptive Neural Network Models on the Edge. In DAC 2023.

PDF Project DOI

(2023). Towards Real-Time Segmentation on the Edge. In AAAI 2023.

PDF Project DOI

(2022). Advancing Model Pruning via Bi-level Optimization. In NeurIPS 2022.

PDF Code Project

(2022). All-in-One: A Highly Representative DNN Pruning Framework for Edge Devices with Dynamic Power Management. In ICCAD 2022.

PDF Project DOI

(2022). Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution. In ECCV 2022.

PDF Code Project DOI

(2022). Pruning-as-Search: Effcient Neural Architecture Search via Channel Pruning and Structural Reparameterization. In IJCAI 2022.

PDF Code Project DOI

(2022). Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations. In IJCAI 2022.

PDF Project DOI

(2022). BLCR: Towards Real-time DNN Execution with Block-based Reweighted Pruning. In ISQED 2022.

PDF Project DOI

(2021). Neural Pruning Search for Real-Time Object Detection of Autonomous Vehicles. In DAC 2021.

PDF Project DOI

(2021). Intrinsic Examples: Robust Fingerprinting of Deep Neural Networks. In BMVC 2021.

PDF

(2021). Achieving On-Mobile Real-Time Super-Resolution With Neural Architecture and Pruning Search. In ICCV 2021.

PDF Project DOI

(2021). Power management in hybrid electric vehicles using deep recurrent reinforcement learning. In Electrical Engineering.

PDF Project DOI

(2021). Characteristic Examples: High-Robustness, Low-Transferability Fingerprinting of Neural Networks. In IJCAI 2021.

PDF Project DOI

(2021). NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration. In CVPR 2021.

PDF Project DOI

(2021). CoCoPIE: Enabling Real-Time AI on Off-the-Shelf Mobile Devices via Compression-Compilation Co-Design. In CACM.

PDF Project Poster DOI

(2021). Brief Industry Paper: Towards Real-Time 3D Object Detection for Autonomous Vehicles with Pruning Search. In RTAS 2021.

PDF Project DOI

(2021). Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization. In IJCAI-PRICAI 2020.

PDF Project DOI

(2020). Exploring GPU acceleration of Deep Neural Networks using Block Circulant Matrices. In Parallel Computing.

PDF Project DOI

(2020). 3D CNN Acceleration on FPGA using Hardware-Aware Pruning. In DAC 2020.

PDF Project DOI

(2020). Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness. In ICLR 2020.

PDF Code Project

(2020). Towards query-efficient black-box adversary with zeroth-order natural gradient descent. In AAAI 2020.

PDF Project DOI

(2020). Towards certificated model robustness against weight perturbations. In AAAI 2020.

PDF Project DOI

(2019). On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method. In ICCV 2019.

PDF Code Poster DOI

(2019). Fault Sneaking Attack: a Stealthy Framework for Misleading Deep Neural Networks. In DAC 2019.

PDF Project DOI

(2019). HSIM-DNN: Hardware Simulator for Computation-, Storage- and Power-Efficient Deep Neural Networks. In GLSVLSI 2019.

PDF Project DOI

(2019). Structured Adversarial Attack: Towards General Implementation and Better Interpretability. In ICLR 2019.

PDF Code Project

(2019). Admm attack: an enhanced adversarial attack for deep neural networks with undetectable distortions. In ASP-DAC 2019.

PDF Project DOI

(2018). Reinforced Adversarial Attacks on Deep Neural Networks Using ADMM. In GlobalSIP 2018.

PDF Project DOI

(2018). Defending DNN Adversarial Attacks with Pruning and Logits Augmentation. In GlobalSIP 2018.

PDF Project DOI

(2018). Defensive Dropout for Hardening Deep Neural Networks under Adversarial Attacks. In ICCAD 2018.

PDF Project DOI

(2018). An ADMM-Based Universal Framework for Adversarial Attacks on Deep Neural Networks. In ACM MM 2018.

PDF Project DOI

(2018). A Deep Reinforcement Learning Framework for Optimizing Fuel Economy of Hybrid Electric Vehicles. In ASP-DAC 2018.

PDF Project DOI

(2015). Robust Beamforming Design for Sum Secrecy Rate Optimization in MU-MISO Networks. In T TFS.

PDF Project DOI