What libraries can I use for Token Classification?

The adapter-transformers, flair, spacy, span-marker, stanza, transformers, and transformers.js libraries are compatible with Token Classification.

What models can I use for Token Classification?

The dslim/bert-base-NER, FacebookAI/xlm-roberta-large-finetuned-conll03-english, blaze999/Medical-NER, and flair/ner-english models can be used for Token Classification.

What datasets can I use for Token Classification?

The eriktks/conll2003and unimelb-nlp/wikiann datasets can be used for Token Classification.

What metrics can I use for Token Classification?

The accuracy, recall, precision, and f1 metrics can be used for Token Classification.

Tasks

Token Classification

Token classification is a natural language understanding task in which a label is assigned to some tokens in a text. Some popular token classification subtasks are Named Entity Recognition (NER) and Part-of-Speech (PoS) tagging. NER models could be trained to identify specific entities in a text, such as dates, individuals and places; and PoS tagging would identify, for example, which words in a text are verbs, nouns, and punctuation marks.

Inputs

Input

My name is Omar and I live in Zürich.

Token Classification Model

Output

My name is OmarPERSON and I live in ZürichGPE.

About Token Classification

Use Cases

Information Extraction from Invoices

You can extract entities of interest from invoices automatically using Named Entity Recognition (NER) models. Invoices can be read with Optical Character Recognition models and the output can be used to do inference with NER models. In this way, important information such as date, company name, and other named entities can be extracted.

Task Variants

Named Entity Recognition (NER)

NER is the task of recognizing named entities in a text. These entities can be the names of people, locations, or organizations. The task is formulated as labeling each token with a class for each named entity and a class named "0" for tokens that do not contain any entities. The input for this task is text and the output is the annotated text with named entities.

Inference

You can use the 🤗 Transformers library ner pipeline to infer with NER models.

from transformers import pipeline

classifier = pipeline("ner")
classifier("Hello I'm Omar and I live in Zürich.")

Part-of-Speech (PoS) Tagging

In PoS tagging, the model recognizes parts of speech, such as nouns, pronouns, adjectives, or verbs, in a given text. The task is formulated as labeling each word with a part of the speech.

Inference

You can use the 🤗 Transformers library token-classification pipeline with a POS tagging model of your choice. The model will return a json with PoS tags for each token.

from transformers import pipeline

classifier = pipeline("token-classification", model = "vblagoje/bert-english-uncased-finetuned-pos")
classifier("Hello I'm Omar and I live in Zürich.")

This is not limited to transformers! You can also use other libraries such as Stanza, spaCy, and Flair to do inference! Here is an example using a canonical spaCy model.

!pip install https://huggingface.co/spacy/en_core_web_sm/resolve/main/en_core_web_sm-any-py3-none-any.whl

import en_core_web_sm

nlp = en_core_web_sm.load()
doc = nlp("I'm Omar and I live in Zürich.")
for token in doc:
    print(token.text, token.pos_, token.dep_, token.ent_type_)

## I PRON nsubj
## 'm AUX ROOT
## Omar PROPN attr PERSON
### ...

Useful Resources

Would you like to learn more about token classification? Great! Here you can find some curated resources that you may find helpful!

Notebooks

Scripts for training

Documentation

Token classification task guide

Deploy on Inference Endpoints

Compatible libraries

Token Classification demo

using FacebookAI/xlm-roberta-large-finetuned-conll03-english

Models for Token Classification

Browse Models (20,477)

dslim/bert-base-NER

Token Classification • Updated Oct 8 • 1.94M • 526

Note A robust performance model to identify people, locations, organizations and names of miscellaneous entities.

FacebookAI/xlm-roberta-large-finetuned-conll03-english

Token Classification • Updated Feb 19 • 3.7M • 149

Note A strong model to identify people, locations, organizations and names in multiple languages.

blaze999/Medical-NER

Token Classification • Updated Apr 8 • 198k • 166

Note A token classification model specialized on medical entity recognition.

flair/ner-english

Token Classification • Updated Jul 21 • 344k • 31

Note Flair models are typically the state of the art in named entity recognition tasks.

Datasets for Token Classification

Browse Datasets (1,058)

eriktks/conll2003

Updated Jan 18 • 26.3k • 124

Note A widely used dataset useful to benchmark named entity recognition models.

unimelb-nlp/wikiann

Viewer • Updated Feb 22 • 2M • 71.9k • 102

Note A multilingual dataset of Wikipedia articles annotated for named entity recognition in over 150 different languages.

Spaces using Token Classification

👀

spacy/gradio_pipeline_visualizer

Note An application that can recognizes entities, extracts noun chunks and recognizes various linguistic features of each token.

Metrics for Token Classification

accuracy: Accuracy is the proportion of correct predictions among the total number of cases processed. It can be computed with: Accuracy = (TP + TN) / (TP + TN + FP + FN) Where: TP: True positive TN: True negative FP: False positive FN: False negative

recall: Recall is the fraction of the positive examples that were correctly labeled by the model as positive. It can be computed with the equation: Recall = TP / (TP + FN) Where TP is the true positives and FN is the false negatives.

precision: Precision is the fraction of correctly labeled positive examples out of all of the examples that were labeled as positive. It is computed via the equation: Precision = TP / (TP + FP) where TP is the True positives (i.e. the examples correctly labeled as positive) and FP is the False positive examples (i.e. the examples incorrectly labeled as positive).

f1: The F1 score is the harmonic mean of the precision and recall. It can be computed with the equation: F1 = 2 * (precision * recall) / (precision + recall)