Daliang Xu (徐大亮)
I am an incoming Assistant Professor (Associate Researcher) at Beijing University of Posts and Telecommunications (BUPT), and I will soon work with Prof. Shangguang Wang and Prof. Mengwei Xu. I received my Ph.D. from Peking University (PKU) in June 2025, where I was fortunate to be advised by Prof. Gang Huang, Prof. Xuanzhe Liu, and Prof. Mengwei Xu. My research interests are in mobile computing and system software.
I always look for highly self-motivated undergraduates and graduates. If you are interested in my research, please feel free to send your CV to contact me at bupt_on_device_lab@163.com.
Notice: Please refer to the tips before sending your email.
Research
- Efficient on-device multimodal LLMs - Our research primarily optimizes on-device multimodal LLM inference from the perspective of heterogeneous hardware.
- Heterogeneous computing systems (e.g., NPU) for on-device multimodal LLMs. - Mobile devices typically contain a variety of heterogeneous computing resources (such as CPU, GPU, NPU, etc.). However, current on-device multimodal LLM systems fail to fully utilize them. To address this, we are designing new system software stack optimized for heterogeneous computing resources to maximize their utilization.
Our current research topics cover NPU-optimized on-device multimodal LLM engines, NPU compiler design, and other related areas.
Papers: Mandheling [MobiCom22], Niagara [ICSOC23 Distinguished Award], SoCFlow [ASPLOS24], EdgeLLM [TMC24], LLM.NPU() [ASPLOS25] - NPU-friendly multimodal LLM algorithms.
- Intelligent autonomous systems: multimodal perception, autonomous control, and self-evolution in UAVs or satellites.
- Heterogeneous computing systems (e.g., NPU) for on-device multimodal LLMs.
- New hardware for intelligent satellites or smartphones. - Next-generation intelligent satellites featuring high reliability, fault tolerance, and support for multimodal LLM inference.
- On-device accelerators for multimodal LLM. - Our research focuses on multimodal LLM quantized inference efficiency and minimizing accelerator energy consumption, current, and area.
- High-Reliability SoCs for Satellites. - Focus on characteristics such as high reliability and fault tolerance.
- On-device accelerators for multimodal LLM.
On-going projects
MLLM-NPU - a fast and lightweight NPU-optimized multimodal LLM inference engine for mobile devices.
Satellite hardware. - A lightweight satellite SoC, focusing on space-grade reliability and enhancing multimodal LLM acceleration capabilities.
Selected Publications (* = equal contributions)
Daliang Xu, Hao Zhang, Liming Yang, Ruiqi Liu, Gang Huang, Mengwei Xu, Xuanzhe Li
Daliang Xu*, Mengwei Xu*, Chiheng Lou, Li Zhang, Gang Huang, Xin Jin, Xuanzhe Liu
Daliang Xu, Wangsong Yin, Hao Zhang, Xin Jin, Ying Zhang, Shiyun Wei, Mengwei Xu, Xuanzhe Liu
Daliang Xu, Qing Li, Mengwei Xu, Kang Huang, Gang Huang, Shangguang Wang, Xin Jin, Ma Yun, Xuanzhe Liu
Daliang Xu*, Mengwei Xu*, Qipeng Wang, Shangguang Wang, Kang Huang, Gang Huang, Xin Jin, Xuanzhe Liu