Qwen1.5-4B-Chat模型在 Long-Instruction-with-Paraphrasing数据集上微调,提升了 long-context 能力

Eval on LongBench

long-context 能力得到提升

model score average score
Qwen1.5-4B-Chat 'dureader': 33.61
'hotpotqa': 96.5,'lsht': 41.0
'multifieldqa_en': 100.0
'multifieldqa_zh': 55.4
'passage_retrieval_en': 13.0
'passage_retrieval_zh': 16.5
'qmsum': 22.69
'trec': 73.0
'vcsum': 15.65
46.73
Qwen1.5-4b-chat-paraph {'dureader': 31.54
'hotpotqa': 99.0
'lsht': 39.5
'multifieldqa_en': 100.0
'multifieldqa_zh': 48.41
'passage_retrieval_en': 74.5
'passage_retrieval_zh': 62.5
'qmsum': 23.9
'trec': 74.5
'vcsum': 15.79}
56.96
Downloads last month
6
Safetensors
Model size
3.95B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train yuyijiong/Qwen1.5-4b-chat-paraph

Collection including yuyijiong/Qwen1.5-4b-chat-paraph