Text Generation
Transformers
Safetensors
English
Inference Endpoints
GPT2-UKILv1 / README.md
azminetoushikwasi's picture
Update README.md
1023ad6 verified
---
library_name: transformers
license: apache-2.0
datasets:
- ciol-research/UKIL-DB-EN
language:
- en
metrics:
- perplexity
base_model:
- openai-community/gpt2
pipeline_tag: text-generation
---
<!-- Provide a quick summary of what the model is/does. -->
# Exploring Possibilities of AI-Powered Legal Assistance in Bangladesh through Large Language Modeling
- Authors: Azmine Toushik Wasi, Wahid Faisal, Mst Rafia Islam, Mahathir M Bappy
- arXiv : https://arxiv.org/abs/2410.17210
- The `GPT2-UKIL-EN` model is freely available in Hugging Face with DOI: [**10.57967/hf/3233**](https://doi.org/10.57967/hf/3233) with `ciol-research/GPT2-UKILv1`.
- The `UKIL-DB-EN` dataset is freely available in Hugging Face with DOI: [**10.57967/hf/3235**](https://doi.org/10.57967/hf/3235) with `ciol-research/UKIL-DB-EN`.
---
**Abstract:** Bangladesh's legal system struggles with major challenges like delays, complexity, high costs, and millions of unresolved cases, which deter many from pursuing legal action due to lack of knowledge or financial constraints. This research seeks to develop a specialized Large Language Model (LLM) to assist in the Bangladeshi legal system.
We created `UKIL-DB-EN`, an English corpus of Bangladeshi legal documents, by collecting and scraping data on various legal acts. We fine-tuned the `GPT-2` model on this dataset to develop GPT2-UKIL-EN`, an LLM focused on providing legal assistance in English.
The model was rigorously evaluated using semantic assessments, including case studies supported by expert opinions. The evaluation provided promising results, demonstrating the potential for the model to assist in legal matters within Bangladesh.
Our work represents the first structured effort toward building an AI-based legal assistant for Bangladesh. While the results are encouraging, further refinements are necessary to improve the model's accuracy, credibility, and safety. This is a significant step toward creating a legal AI capable of serving the needs of a population of 180 million.
---
#### **Cite as**:
```
@misc{wasi2024exploringpossibilitiesaipoweredlegal,
title={Exploring Possibilities of AI-Powered Legal Assistance in Bangladesh through Large Language Modeling},
author={Azmine Toushik Wasi and Wahid Faisal and Mst Rafia Islam and Mahathir Mohammad Bappy},
year={2024},
eprint={2410.17210},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2410.17210},
}
```