How to Run
#3
by
mrfakename
- opened
Hi,
Do you know if there are any inference scripts for this?
good luck !
Thanks! So it’s not instruct tuned?
Thanks! So it’s not instruct tuned?
No, it's a base model.
Sad. I heard somewhere that MoE models are hard to finetune, is that true?
They released a fine tuned model last time, I'm sure they'll drop a instruct model soon, it's a hype drop a battle of two different generations of new young hungry team up to date and in touch with the younger generation versus the biggest and oldest in the industry, as far as it being hard to fine tune I think it just depends on your area of focus and who your asking.
Inference code: https://github.com/open-compass/MixtralKit
Evaluation results will be updated soon