Edit model card

IBI-CAAI/MELT-Mistral-3x7B-Instruct-v0.1 AWQ

Model Summary

The MELT-Mistral-3x7B-Instruct-v0.1 Large Language Model (LLM) is a pretrained generative text model pre-trained and fine-tuned on using publically avalable medical data.

MELT-Mistral-3x7B-Instruct-v0.1 demonstrated a average 19.7% improvement over Mistral-3x7B-Instruct-v0.1 (MoE of 3 X Mistral-7B-Instruct-v0.1) across 3 USMLE, Indian AIIMS, and NEET medical examination benchmarks.

This is MoE model, thanks to Charles Goddard for code/tools.

The Medical Education Language Transformer (MELT) models have been trained on a wide-range of text, chat, Q/A, and instruction data in the medical domain.

While the model was evaluated using publically avalable USMLE, Indian AIIMS, and NEET medical examination example questions, its use it intented to be more broadly applicable.

Downloads last month
1
Safetensors
Model size
2.7B params
Tensor type
I32
·
FP16
·
Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for solidrust/MELT-Mistral-3x7B-Instruct-v0.1-AWQ

Quantized
(1)
this model

Collection including solidrust/MELT-Mistral-3x7B-Instruct-v0.1-AWQ