Is this based on instruction tuned gemma or gemma base model?

#1
by jsgreenawalt - opened

I'm getting strange results (nonsense output) when trying to add some fine-tunes based on this model to a merge with gemma-2-9b-it variants, just wanted to confirm this is based on the non-it version of the original Gemma weights?

Thanks!

Intervitens AI Innovations Inc. org

This is the same weights as the base model, not -it, not finetuned either, the only modification is replacing two of the reserved tokens in the tokenizer, and changing the eos token in the config.

Thanks for the info

jsgreenawalt changed discussion status to closed

Sign up or log in to comment