contains chemlactica-125m, chemlactica-1.3b, chemma-2b as well as training and validation data in JSONL format