Ericu950
/

Epigr_2_Llama-3.1-8B-Instruct_text

Text Generation

Ancient Greek (to 1453)

textual criticism

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ericu950 commited on Oct 13, 2024

Commit

60cc9dd

·

verified ·

1 Parent(s): 1dfe82b

Upload README2.md with huggingface_hub

Files changed (1) hide show

README2.md +39 -0

README2.md ADDED Viewed

	@@ -0,0 +1,39 @@

+---
+base_model: []
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# merged_1
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct as a base.
+### Models Merged
+The following models were included in the merge:
+* /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/LlamaTuned2_ep_3
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+models:
+  - model: /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct
+  - model: /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/LlamaTuned2_ep_3
+    parameters:
+      density: 0.6  # Fixed density, slightly more sparse than the original
+      weight: 1  # Fixed weight to keep the fine-tuned model's influence high
+merge_method: ties
+base_model: /mimer/NOBACKUP/groups/naiss2024-22-361/Eric_Pap/Llama-3.1-8B-Instruct
+parameters:
+  normalize: true
+dtype: bfloat16
+```