Efficient Sequence Parallelism System for Transformer model training.
Jun 30, 2024
Efficient Heterogeneous Parallel Inference System for LLM on resource-constrained devices.
May 13, 2024
Accepted by SC '23. Efficient Pipeline Parallelism System for LLM.
Nov 11, 2023