|
--- |
|
license: other |
|
license_name: yi-34b |
|
license_link: https://huggingface.co/01-ai/Yi-34B/blob/main/LICENSE |
|
--- |
|
|
|
<img src=https://huggingface.co/lodrick-the-lafted/Kaiju-A-57B/resolve/main/kaiju.png> |
|
|
|
## Kaiju-A-57B |
|
|
|
I made this model as an experiment for /r/LocalLlama, who've all wanted a Yi graft like Goliath. |
|
I took the goliath-120B template and used the same proportions to blend Tess-M-v1.3 and Tess-M-v1.2. The mergekit yaml is in the repo. |
|
I chose these two as there are still precious few Yi-200K tunes and merging models with different ideas of positional encoding did not work well. |
|
Thanks to Meta for Llama which kickstarted open weight models, thanks to Yi for the base model, thanks migtissera and the others who have fine-tuned Yi. Special shoutout to chargoddard for mergekit and the original frankenllama. |
|
|
|
|
|
# Prompt Format: |
|
|
|
``` |
|
SYSTEM: <ANY SYSTEM CONTEXT> |
|
USER: |
|
ASSISTANT: |
|
``` |
|
|
|
|
|
|