What libraries can I use for Image Classification?

The keras, timm, transformers, and transformers.js libraries are compatible with Image Classification.

What models can I use for Image Classification?

The google/vit-base-patch16-224, facebook/deit-base-distilled-patch16-224, and facebook/convnext-large-224 models can be used for Image Classification.

What datasets can I use for Image Classification?

The cifar100and fashion_mnist datasets can be used for Image Classification.

What metrics can I use for Image Classification?

The accuracy, recall, precision, and f1 metrics can be used for Image Classification.

Tasks

Image Classification

Image classification is the task of assigning a label or class to an entire image. Images are expected to have only one class for each image. Image classification models take an image as input and return a prediction about which class the image belongs to.

Inputs

Image Classification Model

Output

Egyptian cat

0.514

Tabby cat

0.193

Tiger cat

0.068

About Image Classification

Use Cases

Image classification models can be used when we are not interested in specific instances of objects with location information or their shape.

Keyword Classification

Image classification models are used widely in stock photography to assign each image a keyword.

Image Search

Models trained in image classification can improve user experience by organizing and categorizing photo galleries on the phone or in the cloud, on multiple keywords or tags.

Inference

With the transformers library, you can use the image-classification pipeline to infer with image classification models. You can initialize the pipeline with a model id from the Hub. If you do not provide a model id it will initialize with google/vit-base-patch16-224 by default. When calling the pipeline you just need to specify a path, http link or an image loaded in PIL. You can also provide a top_k parameter which determines how many results it should return.

from transformers import pipeline
clf = pipeline("image-classification")
clf("path_to_a_cat_image")

[{'label': 'tabby cat', 'score': 0.731},
...
]

You can use huggingface.js to classify images using models on Hugging Face Hub.

import { HfInference } from "@huggingface/inference";

const inference = new HfInference(HF_TOKEN);
await inference.imageClassification({
    data: await (await fetch("https://picsum.photos/300/300")).blob(),
    model: "microsoft/resnet-50",
});

Useful Resources

Creating your own image classifier in just a few minutes

With HuggingPics, you can fine-tune Vision Transformers for anything using images found on the web. This project downloads images of classes defined by you, trains a model, and pushes it to the Hub. You even get to try out the model directly with a working widget in the browser, ready to be shared with all your friends!

Available in Deploy on Inference Endpoints

Compatible libraries

Image Classification demo

using google/vit-base-patch16-224

Models for Image Classification

Browse Models (14,764)

google/vit-base-patch16-224

Image Classification • Updated Sep 5, 2023 • 3.75M • 686

Note A strong image classification model.

facebook/deit-base-distilled-patch16-224

Image Classification • Updated Jul 13, 2022 • 27.3k • 23

Note A robust image classification model.

facebook/convnext-large-224

Image Classification • Updated Jun 13, 2023 • 5.77k • 25

Note A strong image classification model.

Datasets for Image Classification

Browse Datasets (1,174)

No example dataset is defined for this task.

Note Contribute by proposing a dataset for this task !

Spaces using Image Classification

📈

nielsr/perceiver-image-classification

Note An application that classifies what a given image is about.

Metrics for Image Classification

accuracy: Accuracy is the proportion of correct predictions among the total number of cases processed. It can be computed with: Accuracy = (TP + TN) / (TP + TN + FP + FN) Where: TP: True positive TN: True negative FP: False positive FN: False negative

recall: Recall is the fraction of the positive examples that were correctly labeled by the model as positive. It can be computed with the equation: Recall = TP / (TP + FN) Where TP is the true positives and FN is the false negatives.

precision: Precision is the fraction of correctly labeled positive examples out of all of the examples that were labeled as positive. It is computed via the equation: Precision = TP / (TP + FP) where TP is the True positives (i.e. the examples correctly labeled as positive) and FP is the False positive examples (i.e. the examples incorrectly labeled as positive).

f1: The F1 score is the harmonic mean of the precision and recall. It can be computed with the equation: F1 = 2 * (precision * recall) / (precision + recall)