YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

GPT2-IMDB

What is it?

A GPT2 (gpt2) language model fine-tuned on the IMDB dataset.

Training setting

The GPT2 language model was fine-tuned for 1 epoch on the IMDB dataset. All comments were joined into a single text file separated by the EOS token:

import pandas as pd
df = pd.read_csv("imdb-dataset.csv")
imdb_str = " <|endoftext|> ".join(df['review'].tolist())

with open ('imdb.txt', 'w') as f:
    f.write(imdb_str)

To train the model the run_language_modeling.py script in the transformer library was used:

python run_language_modeling.py 
    --train_data_file imdb.txt 
    --output_dir gpt2-imdb 
    --model_type gpt2 
    --model_name_or_path gpt2
Downloads last month
14,440
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for lvwerra/gpt2-imdb

Adapters
24 models
Finetunes
20 models

Space using lvwerra/gpt2-imdb 1