Publications
- [CCF-A]Fast on-device LLM inference with NPUs
Daliang Xu, H Zhang, L Yang, R Liu, G Huang, M Xu, X Liu
ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
Year: 2025
- [CCF-A]EdgeLLM: Fast On-device LLM Inference with Speculative Decoding
Daliang Xu, W Yin, H Zhang, X Jin, Y Zhang, S Wei, M Xu, X Liu
IEEE Transactions on Mobile Computing (TMC)
Year: 2024
- [CCF-A]Towards energy-efficient federated learning via int8-based training on mobile DSPs
J Yuan, S Wang, H Li, Daliang Xu, Y Li, M Xu, X Liu
Proceedings of the ACM Web Conference (TheWebConf)
Year: 2024
- [CCF-A]SoCFlow: Efficient and scalable DNN training on SoC-clustered edge servers
Daliang Xu, M Xu, C Lou, L Zhang, G Huang, X Jin, X Liu
ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
Year: 2024
- [CCF-A]Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge Servers
M Xu, Daliang Xu, C Lou, L Zhang, G Huang, X Jin, X Liu
IEEE Transactions on Mobile Computing (TMC)
Year: 2024
- [CCF-B]PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks
W Yin, Daliang Xu, G Huang, Y Zhang, S Wei, M Xu, X Liu
ACM Conference on Embedded Networked Sensor Systems (SenSys)
Year: 2024
- 🏆 [CCF-B Distinguished Paper Award]Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors
Daliang Xu, Q Li, M Xu, K Huang, G Huang, S Wang, X Jin, Y Ma, X Liu
International Conference on Service-Oriented Computing (ICSOC)
Year: 2023
- [CCF-A]Satellite Computing: From Space to Your Screen
Q Li, Daliang Xu
International Conference on Service-Oriented Computing (ICSOC)
Year: 2023
- [CCF-A]Mandheling: Mixed-precision on-device DNN training with DSP offloading
Daliang Xu, M Xu, Q Wang, S Wang, Y Ma, K Huang, G Huang, X Jin, X Liu
ACM International Conference on Mobile Computing and Networking (MobiCom)
Year: 2022