--- base_model: - Sao10K/L3-Solana-8B-v1 - Masterjp123/Llama-3-SnowyRP-8B-V1 - kuotient/Meta-Llama-3-8B-Instruct - cgato/L3-TheSpice-8b-v0.1.3 library_name: transformers tags: - mergekit - merge --- # merged This is the unquantized first version of SnowyRP's Llama 3 model. Quality of this model is unknown, since it is a Llama 3 model that I have not had the chance to test much. Also Once the EXL2 Quant is done I'll test this model and if it's quality is upto my standards, I'll create an GGUF Quant. [EXL2](https://huggingface.co/Masterjp123/Llama-3-SnowyRP-8B-V1-EXL2-4.5bit) ## Merge Details For this model I used the most uncensored and smartest finetunes of LLama 3, since LLama 3 has the largest ammount of training tokens I know of, so I expect it to have more than enough of a knowledge base. This model is under both the LLama 3 licince and a few other ones that only affect conmerical uses, But I am not a lawer so use in commerical settings at your own risk. ## Model Use: Here is an modified Alpaca prompt I use with this model: ``` Below is an instruction that describes a task. Write a response that appropriately completes the request. ### Instruction: Take the role of {{char}} in a play that leaves a lasting impression on {{user}}. Write {{char}}'s next reply. Never skip or gloss over {{char}}’s actions. Progress the scene at a naturally slow pace. Avoid repetition, don't loop. Develop the plot slowly, always stay in character. Describe all actions in full, elaborate, explicit, graphic, and vivid detail. Mention all relevant sensory perceptions. ``` ### Merge Method This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [kuotient/Meta-Llama-3-8B-Instruct](https://huggingface.co/kuotient/Meta-Llama-3-8B-Instruct) as a base. ### Models Merged The following models were included in the merge: * [Sao10K/L3-Solana-8B-v1](https://huggingface.co/Sao10K/L3-Solana-8B-v1) * [Masterjp123/Llama-3-SnowyRP-8B-V1](https://huggingface.co/Masterjp123/Llama-3-SnowyRP-8B-V1) * [cgato/L3-TheSpice-8b-v0.1.3](https://huggingface.co/cgato/L3-TheSpice-8b-v0.1.3) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: kuotient/Meta-Llama-3-8B-Instruct dtype: float16 merge_method: ties parameters: int8_mask: 1.0 normalize: 1.0 slices: - sources: - layer_range: [0, 32] model: Masterjp123/Llama-3-SnowyRP-8B-V1 parameters: density: [1.0, 0.7, 0.1] weight: 1.0 - layer_range: [0, 32] model: cgato/L3-TheSpice-8b-v0.1.3 parameters: density: 0.5 weight: [0.0, 0.3, 0.7, 1.0] - layer_range: [0, 32] model: Sao10K/L3-Solana-8B-v1 parameters: density: 0.33 weight: - filter: mlp value: 0.5 - value: 0.0 - layer_range: [0, 32] model: kuotient/Meta-Llama-3-8B-Instruct ```