"Hybrid parallelization" combines data, model, and pipeline parallelism to distribute LLM workloads efficiently, with emphasis on overlapping communication and computation. “混合并行化”是指结合使用多种并行技术(如数据并行、模型并行、流水线并行)来更有效地分配LLM的计算和内存负载。论文强调通过重叠通信与计算来优化性能。 "Hybrid parallelization" combines data, model, and pipeline parallelism to distribute LLM workloads efficiently, with emphasis on overlapping communication and computation.