merged
This is the unquantized Seccond version of SnowyRP using LLama3, added some models will add a EXL2 Quant.
Merge Details
For this model I used the most uncensored and smartest finetunes of LLama 3, since LLama 3 has the largest ammount of training tokens I know of, so I expect it to have more than enough of a knowledge base.
This model is under both the LLama 3 licince and a few other ones that only affect conmerical uses, But I am not a lawer so use in commerical settings at your own risk.
Model Use:
Here is an modified Alpaca prompt I use with this model:
Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction: Take the role of {{char}} in a play that leaves a lasting impression on {{user}}. Write {{char}}'s next reply. Never skip or gloss over {{char}}’s actions. Progress the scene at a naturally slow pace. Avoid repetition, don't loop. Develop the plot slowly, always stay in character. Describe all actions in full, elaborate, explicit, graphic, and vivid detail. Mention all relevant sensory perceptions.
Merge Method
This model was merged using the TIES merge method using kuotient/Meta-Llama-3-8B-Instruct as a base.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
base_model: kuotient/Meta-Llama-3-8B-Instruct
dtype: float16
merge_method: ties
parameters:
int8_mask: 1.0
normalize: 1.0
slices:
- sources:
- layer_range: [0, 32]
model: Masterjp123/Llama-3-SnowyRP-8B-V1
parameters:
density: [1.0, 0.7, 0.1]
weight: 1.0
- layer_range: [0, 32]
model: cgato/L3-TheSpice-8b-v0.1.3
parameters:
density: 0.5
weight: [0.0, 0.3, 0.7, 1.0]
- layer_range: [0, 32]
model: Sao10K/L3-Solana-8B-v1
parameters:
density: 0.33
weight:
- filter: mlp
value: 0.5
- value: 0.0
- layer_range: [0, 32]
model: kuotient/Meta-Llama-3-8B-Instruct
- Downloads last month
- 21
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.