Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
If the GPUs need to send messages to each other often, as in ZeRO-DP, then faster connectivity becomes super important to achieve faster training.