achedguerra
commited on
Commit
•
1740cb3
1
Parent(s):
3599ae3
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,98 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
base_model:
|
4 |
+
- facebook/detr-resnet-50
|
5 |
+
pipeline_tag: image-classification
|
6 |
+
library_name: adapter-transformers
|
7 |
+
---
|
8 |
+
# Sign Language Detection Model
|
9 |
+
|
10 |
+
## Model Description
|
11 |
+
This model, `achedguerra/resnet-50-signal_language`, is a fine-tuned version of [microsoft/resnet-50](https://huggingface.co/microsoft/resnet-50) for real-time sign language detection. It has been trained on a dataset of sign language images to provide accurate and efficient detection of sign language gestures.
|
12 |
+
|
13 |
+
## Key Features
|
14 |
+
- Based on the powerful ResNet-50 architecture
|
15 |
+
- Fine-tuned specifically for sign language detection
|
16 |
+
- Capable of real-time processing
|
17 |
+
- Promotes accessibility and inclusion in technology
|
18 |
+
|
19 |
+
## Use Cases
|
20 |
+
- Real-time sign language interpretation
|
21 |
+
- Assistive technology for the deaf and hard of hearing
|
22 |
+
- Educational tools for learning sign language
|
23 |
+
- Enhancing communication in diverse environments
|
24 |
+
|
25 |
+
## How to Use the Model
|
26 |
+
|
27 |
+
### Installation
|
28 |
+
First, ensure you have the Transformers library installed:
|
29 |
+
|
30 |
+
```bash
|
31 |
+
pip install transformers
|
32 |
+
```
|
33 |
+
|
34 |
+
### Loading the Model
|
35 |
+
You can load the model using the Transformers library:
|
36 |
+
|
37 |
+
```python
|
38 |
+
from transformers import AutoFeatureExtractor, AutoModelForImageClassification
|
39 |
+
import torch
|
40 |
+
from PIL import Image
|
41 |
+
|
42 |
+
model_name = "achedguerra/resnet-50-signal_language"
|
43 |
+
|
44 |
+
# Load the model and feature extractor
|
45 |
+
feature_extractor = AutoFeatureExtractor.from_pretrained(model_name)
|
46 |
+
model = AutoModelForImageClassification.from_pretrained(model_name)
|
47 |
+
```
|
48 |
+
|
49 |
+
### Inference
|
50 |
+
To use the model for inference:
|
51 |
+
|
52 |
+
```python
|
53 |
+
# Load and preprocess the image
|
54 |
+
image_path = "path/to/your/image.jpg"
|
55 |
+
image = Image.open(image_path)
|
56 |
+
inputs = feature_extractor(images=image, return_tensors="pt")
|
57 |
+
|
58 |
+
# Perform inference
|
59 |
+
with torch.no_grad():
|
60 |
+
outputs = model(**inputs)
|
61 |
+
|
62 |
+
# Get the predicted class
|
63 |
+
predicted_class_idx = outputs.logits.argmax(-1).item()
|
64 |
+
predicted_class = model.config.id2label[predicted_class_idx]
|
65 |
+
|
66 |
+
print(f"Predicted sign: {predicted_class}")
|
67 |
+
```
|
68 |
+
|
69 |
+
## Training Details
|
70 |
+
- Base model: microsoft/resnet-50
|
71 |
+
- Training data: Custom dataset of sign language images
|
72 |
+
- Fine-tuning process: The model was fine-tuned using transfer learning techniques to adapt it for sign language detection
|
73 |
+
|
74 |
+
## Performance
|
75 |
+
[Include any relevant performance metrics, such as accuracy, precision, recall, or F1 score]
|
76 |
+
|
77 |
+
## Limitations
|
78 |
+
- The model's performance may vary depending on the quality and lighting of input images
|
79 |
+
- It is trained on a specific set of sign language gestures and may not recognize all possible signs
|
80 |
+
|
81 |
+
## Ethical Considerations
|
82 |
+
- This model should be used to assist and enhance communication, not to replace human interpreters
|
83 |
+
- Care should be taken to ensure the model performs equally well across different skin tones and hand shapes
|
84 |
+
|
85 |
+
## Citation
|
86 |
+
If you use this model in your research or project, please cite it as follows:
|
87 |
+
|
88 |
+
```
|
89 |
+
@misc{SignLanguageDetectionModel,
|
90 |
+
author = Hugo Alejandro Guerra Peralta,
|
91 |
+
title = Sign Language Detection using Fine-tuned ResNet-50,
|
92 |
+
year = 2024,
|
93 |
+
howpublished = {\url{https://huggingface.co/achedguerra/resnet-50-signal_language}}
|
94 |
+
}
|
95 |
+
```
|
96 |
+
|
97 |
+
## Contact
|
98 |
+
For any questions or feedback, please open an issue on the model's Hugging Face repository at https://huggingface.co/achedguerra/resnet-50-signal_language or contact the author through the Hugging Face platform.
|