Ziming Liu
Open Menu
Close Menu
Bio
Papers
Experience
Projects
Publications
Ziming Liu
,
Shaoyu Wang
,
Shenggan Cheng
,
Zhongkai Zhao
,
Kai Wang
,
Xuanlei Zhao
,
Hames Demmel
,
Yang You
(2024).
WallFacer: Harnessing Multi-dimensional Ring Parallelism for Efficient Long Sequence Model Training
. Arxiv Preprint.
PDF
Cite
Xuanlei Zhao
,
Bin Jia
,
Haotian Zhou
,
Ziming Liu
,
Shenggan Cheng
,
Yang You
(2024).
HeteGen: Efficient Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices
. In
MLSys 2024, Proceedings of Machine Learning and Systems
.
PDF
Cite
Xuanlei Zhao
,
Shenggan Cheng
,
Guangyang Lu
,
Jiarui Fang
,
Haotian Zhou
,
Bin Jia
,
Ziming Liu
,
Yang You
(2024).
AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference
. In
ICLR 2024, International Conference on Learning Representations
.
PDF
Cite
Xuanlei Zhao
,
Shenggan Cheng
,
Zangwei Zheng
,
Zheming Yang
,
Ziming Liu
,
Yang You
(2024).
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
. Arxiv Preprint.
PDF
Cite
Project
Ziming Liu
,
Shenggan Cheng
,
Haotian Zhou
,
Yang You
(2023).
Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large Model Training Efficiency
. In
SC ‘23, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
.
PDF
Cite
Shenggan Cheng
,
Ziming Liu
,
Jiangsu Du
,
Yang You
(2023).
ATP: Adaptive Tensor Parallelism for Foundation Models
. Arxiv Preprint.
PDF
Cite
Jiangsu Du
,
Ziming Liu
,
Jiarui Fang
,
Shenggui Li
,
Yongbin Li
,
Yutong Lu
,
Yang You
(2022).
EnergonAI: An Inference System for 10-100 Billion Parameter Transformer Models
. Arxiv Preprint.
PDF
Cite
Project