File size: 112 Bytes
5fa1a76
1
You could also replace the Transformers modeling code and replace torch.utils.checkpoint with the DeepSpeed API.