MedM-VL-CT-3B-en
Introduction
A medical LVLM, trained on English data, accepts text and a single 3D CT volume as input, and text-based results as output, enabling tasks such as report generation and medical VQA.
Here are the evaluation results on M3D-Bench:
Method | Report Generation | Medical VQA | |||||||
BLEU | ROUGE | METEOR | BERT-Score | Accuracy | BLEU | ROUGE | METEOR | BERT-Score | |
RadFM | 12.23 | 16.49 | 11.57 | 87.93 | 19.79 | 16.39 | 26.13 | 21.33 | 88.72 |
M3D-LaMed | 15.15 | 19.55 | 14.38 | 88.46 | 75.78 | 49.38 | 52.39 | 33.58 | 91.53 |
MedM-VL-CT-3B-en | 49.81 | 52.45 | 49.27 | 90.38 | 80.12 | 56.56 | 59.96 | 39.75 | 92.85 |
Quickstart
Please refer to MedM-VL.
- Downloads last month
- 4