Is this based on instruction tuned gemma or gemma base model?
#1
by
jsgreenawalt
- opened
I'm getting strange results (nonsense output) when trying to add some fine-tunes based on this model to a merge with gemma-2-9b-it variants, just wanted to confirm this is based on the non-it version of the original Gemma weights?
Thanks!
This is the same weights as the base model, not -it, not finetuned either, the only modification is replacing two of the reserved tokens in the tokenizer, and changing the eos token in the config.
Thanks for the info
jsgreenawalt
changed discussion status to
closed