Publications

Research Papers

When Cloud TEEs Encounter Availability: A Lightweight Framework for Verifiable CPU Availability
Shangjie Pan, Haochuan Lei, Yinghao Yang, Dongrong Zhang, Dong Du, Hang Lu* (路航) and Xiaowei Li
63nd ACM/EDAC/IEEE Design Automation Conference (DAC, CCF A类), 2026.

Hades: Harnessing Architecture Design Automation for Application-specific FHE Accelerators
Silin Liu, Yinghao Yang, Fuping Li, Hang Lu* (路航) and Xiaowei Li
63nd ACM/EDAC/IEEE Design Automation Conference (DAC, CCF A类), 2026.

Hypnos: A Hardware-Software Co-Design Framework for Memory-Efficient Homomorphic Processing
Haoxuan Wang, Yinghao Yang, Shangjie Pan, Hang Lu* (路航), Xiaowei Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD, CCF A类), 2026.

PIRacle: A Fast and Scalable Private Information Retrieval System for Key-Value Stores
Zehao Chen, Zhaoyan Shen, Yi Wang, Hang Lu (路航), Lei Ju
IEEE Transactions on Computers (TC, CCF A类), 2026.

SoK: Analysis of Accelerator TEE Designs
Chenxu Wang, Junjie Huang, Yujun Liang, Xuanyao Peng, Yuqun Zhang, Fengwei Zhang, Jiannong Cao, Hang Lu (路航), Rui Hou, Shoumeng Yan, Tao Wei, Zhengyu He
Network and Distributed System Security Symposium (NDSS, CCF A类), 2026.

Conflux: A High-Performance Keyword Private Retrieval System for Dynamic Datasets
Zehao Chen, Zhaoyan Shen, Qian Wei, Hang Lu (路航), Lei Ju
IEEE 32nd International Symposium on High-Performance Computer Architecture (HPCA, CCF A类), 2026.

FlexMem: High-Parallel Near-Memory Architecture for Flexible Dataflow in Fully Homomorphic Encryption
Shangyi Shi, Husheng Han, Jianan Mu, Xinyao Zheng, Ling Liang, Hang Lu (路航), Zidong Du, Xiaowei Li, Xing Hu
The 31st Asia and South Pacific Design Automation Conference (ASPDAC, CCF C类), 2026.

AceHomo: Accelerating Privacy Preserving Inference through Dynamic Level Adjustment
Hongyan Li, Jinkai Zhang, Hang Lu* (路航), Xiaowei Li
The 43rd IEEE International Conference on Computer Design (ICCD, CCF B类), 2025.

SecNPU: Securing LLM inference on NPU
Xuanyao Peng, Yinghao Yang, Shangjie Pan, Junjie Huang, Yujun Liang, Hang Lu* (路航), Fengwei Zhang and Xiaowei Li
The 43rd IEEE International Conference on Computer Design (ICCD, CCF B类), 2025.

Athena: Accelerating Quantized Convolutional Neural Networks under Fully Homomorphic Encryption
Yinghao Yang, Xicheng Xu, Liang Chang, Hang Lu* (路航), Xiaowei Li
IEEE/ACM 58th International Symposium on Microarchitecture (MICRO, CCF A类)，2025.

SmartPIR: A Private Information Retrieval System using Computational Storage Devices
Zehao Chen, Honghui You, Qian Wei, Hang Lu (路航), Zhaoyan Shen, Lei Ju
IEEE/ACM 58th International Symposium on Microarchitecture (MICRO, CCF A类)，2025.

Uranus: Ultra-efficient Acceleration Architecture for the Privacy Inference of Graph Neural Networks
Xicheng Xu, Yinghao Yang, Fuyao Liu, Hang Lu* (路航) and Xiaowei Li
The 2025 International Conference on Computer-Aided Design (ICCAD, CCF B类), 2025.

RTPU: Unifying Non-Private and Private Inference with Reconfigurable Architecture
Fuping Li, Ying Wang, Yinghao Yang, Jingxuan Li, Yibo Du, Huawei Li, Yinhe Han, Hang Lu* (路航) and Xiaowei Li
The 2025 International Conference on Computer-Aided Design (ICCAD, CCF B类), 2025.

SNO: Securing Network Function Offloading on FPGA-based SmartNICs in Untrusted Clouds
Yunkun Liao, Jingya Wu, Wenyan Lu, Hang Lu* (路航), Xiaowei Li and Guihai Yan
The 2025 International Conference on Computer-Aided Design (ICCAD, CCF B类), 2025.

LayerTEE: Decoupled Memory Protection for Scalable Multi-Layer Communication on RISC-V
Shangjie Pan, Yinghao Yang, Xuanyao Peng, Xiquan Zhao, Dong Du, Hang Lu* (路航), Yubin Xia, Xiaowei Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD, CCF A类), 2025.

FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks
Zhaoxuan Kan, Jianan Mu, Husheng Han, Shangyi Shi, Tenghui Hua, Hang Lu (路航), Xiaowei Li, Xing Hu
42nd International Conference on Machine Learning (ICML, CCF A类), 2025.

Hypnos: Memory Efficient Homomorphic Processing Unit
Haoxuan Wang, Yinghao Yang, Hang Lu* (路航), Xiaowei Li
62nd ACM/EDAC/IEEE Design Automation Conference (DAC, CCF A类), 2025.

Ares: High Performance Near-Storage Accelerator for FHE-based Private Set Intersection
Haoxuan Wang, Yinghao Yang, Jinkai Zhang, Hang Lu* (路航), Xiaowei Li
62nd ACM/EDAC/IEEE Design Automation Conference (DAC, CCF A类), 2025.

Trident: the Acceleration Architecture for High-Performance Private Set Intersection
Jinkai Zhang, Yinghao Yang, Zhe Zhou, Zhicheng Hu, Xin Zhao, Liang Chang, Hang Lu* (路航), Xiaowei Li
IEEE Transactions on Computers (TC, CCF A类), 2025.

Hydra: Scale-out FHE Accelerator Architecture for Secure Deep Learning on FPGA
Yinghao Yang, Xicheng Xu, Hang Lu* (路航), Xiaowei Li
IEEE 31st International Symposium on High-Performance Computer Architecture (HPCA, CCF A类), 2025.

Dep-TEE: Decoupled Memory Protection for Secure and Scalable Inter-enclave Communication on RISC-V
Shangjie Pan, Xuanyao Peng, Zeyuan Man, Xiquan Zhao, Dongrong Zhang, Bicheng Yang, Dong Du, Hang Lu* (路航), Yubin Xia, Xiaowei Li
The 30th Asia and South Pacific Design Automation Conference (ASPDAC, CCF C类), 2025.

General Purpose Deep Learning Accelerator Based On Bit Interleaving
Liang Chang, Hang Lu (路航), Chenglong Li, Xin Zhao, Zhicheng Hu, Jun Zhou, Xiaowei Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD, CCF A类), 2024.

Mortar-FP8: Morphing the Existing FP32 Infrastructure for High Performance Deep Learning Acceleration
Hongyan Li, Hang Lu* (路航), Xiaowei Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD, CCF A类), 2024.

Poseidon-NDP: Practical Fully Homomorphic Encryption Accelerator Based on Near Data Processing Architecture
Yinghao Yang, Hang Lu* (路航), Xiaowei Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD, CCF A类), 2023.

Poseidon: Practical Homomorphic Encryption Accelerator
Yinghao Yang, Huaizhi Zhang, Shengyu Fan, Hang Lu* (路航), Mingzhe Zhang, Xiaowei Li
IEEE 29th International Symposium on High-Performance Computer Architecture (HPCA, CCF A类), 2023.

BitXpro: Regularity-aware Hardware Runtime Pruning for Deep Neural Networks
Hongyan Li, Hang Lu* (路航), Haoxuan Wang, Shengji Deng, Xiaowei Li
IEEE Transactions on Very Large Scale Integration Systems (TVLSI, CCF B类)，2023.

Mortar: Morphing the Bit Level Sparsity for General Purpose Deep Learning Acceleration
Yunhung Gao, Hongyan Li, Kevin Zhang, Xueru Yu, Hang Lu* (路航)
ACM 28th Asia and South Pacific Design Automation Conference (ASPDAC, CCF C类), 2023.

Distilling Bit-level Sparsity Parallelism for General Purpose Deep Learning Acceleration
Hang Lu (路航), Liang Chang, Chenglong Li, Zixuan Zhu, Shengjian Lu, Yanhuan Liu, Mingzhe Zhang
IEEE/ACM 54th International Symposium on Microarchitecture (MICRO, CCF A类)，2021.

Streamline Ring ORAM Accesses through Spatial and Temporal Optimization
Dingyuan Cao, Mingzhe Zhang, Hang Lu (路航), Xiaochun Ye, Dongrui Fan, Yuezhi Che, Rujia Wang
IEEE 27th International Symposium on High-Performance Computer Architecture (HPCA, CCF A类), 2021.

BitX: Empower Versatile Inference for Hardware Runtime Pruning
Hongyan Li, Hang Lu* (路航), Jiawen Huang, Wenxu Wang, Mingzhe Zhang, Wei Chen, Liang Chang, Xiaowei Li
IEEE/ACM 50th International Conference on Parallel Processing (ICPP, CCF B类)，2021.

Chaotic Weights: A Novel Approach to Protect Intellectual Property of Deep Neural Networks
Ning Lin, Xiaoming Chen, Hang Lu (路航), Xiaowei Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD, CCF A类), 2021.

Architecting Effectual Computation for Machine Learning Accelerators
Hang Lu* (路航), Mingzhe Zhang, Yinhe Han, Qi Wang, Huawei Li, Xiaowei Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD, CCF A类), 2020.

ShuttleNoC: Power-Adaptable Communication Infrastructure for Many-core Processors
Hang Lu* (路航), Yisong Chang, Ning Lin, Xin Wei, Guihai Yan, Xiaowei Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD, CCF A类), 2019.

HeadStart: Enforce Optimal Inceptions in Pruning Deep Convolutional Neural Networks for Efficient Inference on GPGPUs
Ning Lin, Hang Lu* (路航), Xin Wei, Xiaowei Li
IEEE/ACM 56th International Design Automation Conference (DAC, CCF A类), 2019.

When Deep Learning Meets the Edge: Auto-Masking Deep Neural Networks for Efficient Machine Learning on Edge Devices
Ning Lin, Hang Lu* (路航), Xing Hu, Jingliang Gao, Xiaowei Li
IEEE 37th International Conference on Computer Design (ICCD, CCF B类), 2019.

VNet: A Versatile Deep Neural Network Model for Efficient Semantic Segmentation
Ning Lin, Hang Lu* (路航), Xiaowei Li,
IEEE 37th International Conference on Computer Design (ICCD, CCF B类), 2019.

Tetris: Re-architecting Convolutional Neural Network Computation for Machine Learning Accelerators
Hang Lu* (路航), Xin Wei, Ning Lin, Guihai Yan, Xiaowei Li
IEEE/ACM 37th International Conference on Computer-Aided Design (ICCAD, CCF B类), 2018.

Redeeming Chip-level Power Efficiency by Collaborative Management of the Computation and Communication
Ning Lin, Hang Lu* (路航), Xin Wei, Xiaowei Li
ACM 24th Asia and South Pacific Design Automation Conference (ASPDAC, CCF C类), 2018.

PowerTrader: Enforcing Autonomous Power Management for Large-scale Many-core Processors
Hang Lu* (路航), Guihai Yan, Yinhe Han, Xiaowei Li
IEEE Transactions on Multi-scale Computing Systems (TMSCS), 2017.

RISO: Enforce Non-interfered Performance with Relaxed Network-on-Chip Isolation in Manycore Cloud Processors
Hang Lu* (路航), Binzhang Fu, Ying Wang, Yinhe Han, Guihai Yan, Xiaowei Li
IEEE Transactions on Very Large Scale Integration Systems (TVLSI, CCF B类), 2015.

ShuttleNoC: Boosting On-chip Communication Efficiency Through Localized Power Adaptation
Hang Lu* (路航), Guihai Yan, Yinhe Han, Ying Wang, Xiaowei Li
IEEE 20th Asia and South Pacific Design Automation Conference (ASPDAC, CCF C类，“最佳论文奖”提名), 2015.

RISO: Relaxed Networks-on-Chip Isolation for Cloud Processors
Hang Lu* (路航), Guihai Yan, Yinhe Han, Binzhang Fu, Xiaowei Li
IEEE/ACM 50th International Design Automation Conference (DAC, CCF A类), 2013.

Books

《多核处理器设计优化——低功耗、高可靠、易测试》，科学出版社，2021.
李晓维、路航、李华伟、王颖、鄢贵海著

《Customizable Computing，可定制计算》，机械工业出版社，2018.
鄢贵海、叶靖、王颖、路航、卢文岩、李家军、吴靖雅译，Yu-Ting Chen, Jason Cong, Michael Gill, Glenn Reinman, Bingjun Xiao 著

Hang Lu（路航）

Publications

Research Papers

Books