Ahmadzei's picture
update 1
57bdca5
raw
history blame contribute delete
694 Bytes
-name '*json')
The DeepSpeed configuration file is passed as a path to a JSON file if you're training from the command line interface or as a nested dict object if you're using the [Trainer] in a notebook setting.
py
TrainingArguments(, deepspeed="path/to/deepspeed_config.json")
py
ds_config_dict = dict(scheduler=scheduler_params, optimizer=optimizer_params)
args = TrainingArguments(, deepspeed=ds_config_dict)
trainer = Trainer(model, args, )
DeepSpeed and Trainer parameters
There are three types of configuration parameters:
Some of the configuration parameters are shared by [Trainer] and DeepSpeed, and it can be difficult to identify errors when there are conflicting definitions.