5fa1a76
1
2
3
4
5
6
7
yaml { "train_micro_batch_size_per_gpu": "auto", "train_batch_size": "auto" } Gradient accumulation Gradient accumulation can be auto-configured or explicitly set.