Use allreduce_coalesced for factor allreduce #35

Open

Labels

enhancementhelp wantedpytorch-1.11

opened

on Mar 10, 2022

pytorch/pytorch#62140

"grouped comm on a set of unflattened tensors can be more performant than flattening+a single flat nccl call."

Metadata

Assignees

No one assigned

Labels

enhancementhelp wantedpytorch-1.11

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests