QuantFactory Banner

QuantFactory/Qwen2.5-Coder-7B-Chat-Instruct-TIES-v1.2-GGUF

This is quantized version of BenevolenceMessiah/Qwen2.5-Coder-7B-Chat-Instruct-TIES-v1.2 created using llama.cpp

Original Model Card

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the TIES merge method using Qwen/Qwen2.5-Coder-7B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

# Qwen2.5-Coder-7B-Chat-Instruct-TIES-v1.2

models:
  - model: Qwen/Qwen2.5-Coder-7B-Instruct # Reflecting Update 11/9/2024
    parameters:
      density: 1.0
      weight: 1.0
merge_method: ties
base_model: Qwen/Qwen2.5-Coder-7B # Reflecting Update 11/9/2024
parameters:
  normalize: true
  int8_mask: false
dtype: bfloat16
tokenizer_source: union
Downloads last month
696
GGUF
Model size
7.61B params
Architecture
qwen2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for QuantFactory/Qwen2.5-Coder-7B-Chat-Instruct-TIES-v1.2-GGUF