Stheno GPTQ
Collection
GPTQ quants of Sao10K's (https://huggingface.co/Sao10K) Stheno 13B model
•
6 items
•
Updated
4-bit GPTQ quants of the writer version of Sao10K's fantastic SthenoWriter model (Stheno model collection link)
The main branch contains 4-bit groupsize of 128 and no act_order.
The other branches contain groupsizes of 128, 64, and 32 all with act_order.
A Stheno-1.8 Variant focused on writing.
Stheno-1.8 + Storywriter, mixed with Holodeck + Spring Dragon qLoRA. End Result is mixed with One More Experimental Literature-based LoRA.
Re-Reviewed... it's not bad, honestly.
Support me here :)
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 48.35 |
ARC (25-shot) | 62.29 |
HellaSwag (10-shot) | 83.28 |
MMLU (5-shot) | 56.14 |
TruthfulQA (0-shot) | 44.72 |
Winogrande (5-shot) | 74.35 |
GSM8K (5-shot) | 11.22 |
DROP (3-shot) | 6.48 |