MoE-Qwen-4x1.8B-pretrain-18000-ckpt / generation_config.json

Commit History