Edit model card

SHRDFU-7b Δ

  • Developed by: maldv
  • License: cc-by-nc-4.0
  • Finetuned from model: ammarali32/multi_verse_model
  • Methodology: Peft to train; extending intelligence and problem solving w/ crabcanon

As I work on understanding how to layer information in to the model, this model used no conditioning and even with low LR's, had quite a sharp graph. It definitely inherited the style of the source.

I had been experimenting in this series with wrapping each paragraph or turn with <s></s> bos/eos tokens. This may be semi-compatible with instruct, but is incompatible with alpaca and chatml. Good to know.

Downloads last month
9
Safetensors
Model size
7.24B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for maldv/SHRDFU-7b-delta

Finetuned
(8)
this model
Quantizations
1 model

Dataset used to train maldv/SHRDFU-7b-delta