metadata

language: en
license: apache-2.0

Women's Clothing Reviews Sentiment Analysis with DistilBERT

Overview

This Hugging Face repository contains a fine-tuned DistilBERT model for sentiment analysis of women's clothing reviews. The model is designed to classify reviews into positive, negative, or neutral sentiment categories, providing valuable insights into customer opinions.

Model Details

Model Architecture: Fine-tuned DistilBERT
Sentiment Categories: Neutral [0], Negative [1], Positive [2]
Input Format: Text-based clothing reviews
Output Format: Sentiment category labels

Fine-tuning procedure

This model was fine-tuned using a relatively small dataset containing 23487 rows broken down into train/eval/test dataset. Nevertheless, the fine-tuned model was able to performs slightly better than the base-distilbert-model on the test dataset.

Training result

It achieved the following results on the evaluation set:

Validation Loss: 1.1677

Comparison between the base distilbert model VS fine-tuned distilbert

Model	Accuracy	Precision	Recall	F1 Score
DistilBERT base model	0.79	0.77	0.79	0.77
DistilBERT fine-tuned	0.85	0.86	0.85	0.85

Installation

To use this model, you'll need to install the Hugging Face Transformers library and any additional dependencies.

pip install transformers
pip install torch

Usage

You can easily load the pre-trained model for sentiment analysis using Hugging Face's DistilBertForSequenceClassification and DistilBertTokenizerFast.

from transformers import DistilBertForSequenceClassification, DistilBertTokenizerFast
import torch

model_name = "ongaunjie/distilbert-cloths-sentiment" 
tokenizer = DistilBertTokenizerFast.from_pretrained(model_name)
model = DistilBertForSequenceClassification.from_pretrained(model_name)

review = "This dress is amazing, I love it!"
inputs = tokenizer.encode(review, return_tensors="pt")
with torch.no_grad():
    outputs = model(inputs)
predicted_class = int(torch.argmax(outputs.logits))