This model is for efficiency purposes for better accuracy refer to en_legal_ner_trf

Paper details

Named Entity Recognition in Indian court judgments

Indian Legal Named Entity Recognition(NER): Identifying relevant named entities in an Indian legal judgement using legal NER trained on spacy.

Scores

Type	Score
F1-Score	74.87
`Precision`	72.98
`Recall`	76.85

Feature	Description
Name	`en_legal_ner_sm`
Version	`3.2.0`
spaCy	`>=3.2.2,<3.3.0`
Default Pipeline	`token2vec`, `ner`
Components	`token2vec`, `ner`
Vectors	0 keys, 0 unique vectors (0 dimensions)
Sources	InLegalNER Train Data GitHub
License	`MIT`
Author	Aman Tiwari

Load Pretrained Model

Install the model using pip

pip install https://huggingface.co/opennyaiorg/en_legal_ner_sm/resolve/main/en_legal_ner_sm-any-py3-none-any.whl

Using pretrained NER model

# Using spacy.load().
import spacy
nlp = spacy.load("en_legal_ner_sm")
text = "Section 319 Cr.P.C. contemplates a situation where the evidence adduced by the prosecution for Respondent No.3-G. Sambiah on 20th June 1984"
doc = nlp(text)

# Print indentified entites
for ent in doc.ents:
     print(ent,ent.label_)

##OUTPUT     
#Section 319 PROVISION
#Cr.P.C. STATUTE
#G. Sambiah RESPONDENT
#20th June 1984 DATE

Label Scheme

View label scheme (14 labels for 1 components)

ENTITY	BELONGS TO
`LAWYER`	PREAMBLE
`COURT`	PREAMBLE, JUDGEMENT
`JUDGE`	PREAMBLE, JUDGEMENT
`PETITIONER`	PREAMBLE, JUDGEMENT
`RESPONDENT`	PREAMBLE, JUDGEMENT
`CASE_NUMBER`	JUDGEMENT
`GPE`	JUDGEMENT
`DATE`	JUDGEMENT
`ORG`	JUDGEMENT
`STATUTE`	JUDGEMENT
`WITNESS`	JUDGEMENT
`PRECEDENT`	JUDGEMENT
`PROVISION`	JUDGEMENT
`OTHER_PERSON`	JUDGEMENT

Author - Publication

@inproceedings{kalamkar-etal-2022-named,
    title = "Named Entity Recognition in {I}ndian court judgments",
    author = "Kalamkar, Prathamesh  and
      Agarwal, Astha  and
      Tiwari, Aman  and
      Gupta, Smita  and
      Karn, Saurabh  and
      Raghavan, Vivek",
    booktitle = "Proceedings of the Natural Legal Language Processing Workshop 2022",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, United Arab Emirates (Hybrid)",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.nllp-1.15",
    doi = "10.18653/v1/2022.nllp-1.15",
    pages = "184--193",
    abstract = "Identification of named entities from legal texts is an essential building block for developing other legal Artificial Intelligence applications. Named Entities in legal texts are slightly different and more fine-grained than commonly used named entities like Person, Organization, Location etc. In this paper, we introduce a new corpus of 46545 annotated legal named entities mapped to 14 legal entity types. The Baseline model for extracting legal named entities from judgment text is also developed.",
}

opennyaiorg
/

en_legal_ner_sm