Model Description

SkinSAM is on the 12-layer ViT-b model, the mask decoder module of SAM is fine-tuned on a combined dataset of ISIC and PH2 skin lesion images and masks. SkinSAM was trained on an Nvidia Tesla A100 40GB GPU.

Some of the notable results taken:
ISIC Dataset:

IOU 78.25%
Pixel Accuracy 92.18%
F1 Score 87.47%

PH2 Dataset:

IOU 86.68%
Pixel Accuracy 93.33%
F1 Score 93.95%

Downloads last month: 33

Safetensors

Model size

93.7M params

Tensor type

F32

Inference API

Mask Generation

Inference API (serverless) does not yet support transformers models for this pipeline type.

ahishamm
/

skinsam

Model Description

Dataset used to train ahishamm/skinsam

Space using ahishamm/skinsam 1