Replete-AI
/

Replete-LLM-V2.5-Qwen-72b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rombodawg commited on 7 days ago

Commit

1319fc9

•

1 Parent(s): 31006cf

Update README.md

Files changed (1) hide show

README.md +11 -29

README.md CHANGED Viewed

@@ -1,39 +1,21 @@
 ---
-base_model: []
 library_name: transformers
-tags:
-- mergekit
-- merge
 ---
-# Replete-LLM-72b
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using ./qwen-72b as a base.
-### Models Merged
-The following models were included in the merge:
-* ./qwen-72b-instruct
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-models:
-  - model: ./qwen-72b-instruct
-    parameters:
-      weight: 1
-merge_method: ties
-base_model: ./qwen-72b
-parameters:
-  normalize: true
-  int8_mask: true
-dtype: bfloat16
-```

 ---
 library_name: transformers
+base_model:
+- Qwen/Qwen2.5-72B-Instruct
+license: apache-2.0
 ---
+# Replete-LLM-V2.5-Qwen-72b
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/ihnWXDEgV-ZKN_B036U1J.png)
+Replete-LLM-V2.5-Qwen-72b is a continues finetuned version of Qwen2.5-72B. I noticed recently that the Qwen team did not learn from my methods of continuous finetuning, the great benefits, and no downsides of it. So I took it upon myself to merge the instruct model with the base model myself using the *Ties* merge method
+This version of the model shows higher performance than the original instruct and base models.
+Quants: (Coming soon)
+GGUF:
+EXL2:
+Benchmarks: (Coming soon)