Introducing miniclaus 1.5B, a tiny but powerful model. Trained with MagPie and based on Qwen2.5 1.5B model, it performs very well on many tasks scoring top on his category, with impressive results: * MATH Hard 9.81 * MMLU-Pro 29.37 * GPQA 29.19 * MUSR 42.85 * BBH 42.04
We released today a newest version of Cybertron: V4 based on Qwen2.5 7B and trained on MagPie. Scoring #1 LLM on 7B & 8B class.
The model hasn't go thru DPO, so the weights are in good shape to welcome further training sessions and optimizations. Enjoy it in the hub as usual: fblgit/cybertron-v4-qw7B-MGS