arunprsh
commited on
Commit
•
fc6ad7a
1
Parent(s):
a9b58fb
updating readme
Browse files
README.md
CHANGED
@@ -4,4 +4,11 @@ language:
|
|
4 |
- en
|
5 |
tags:
|
6 |
- medical
|
7 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- en
|
5 |
tags:
|
6 |
- medical
|
7 |
+
---
|
8 |
+
Dataset: https://www.kaggle.com/datasets/timmayer/covid-news-articles-2020-2022
|
9 |
+
|
10 |
+
Comprehensive guide can be found here: https://medium.com/@shankar.arunp/easily-build-your-own-gpt-from-scratch-using-aws-51811b6355d3
|
11 |
+
|
12 |
+
The model is GPT2 further pre-trained on the news articles to incorporate COVID-19 related context to the model.
|
13 |
+
|
14 |
+
Similar article on how to further pre-train a BERT base model from scratch using the articles can be found here: https://medium.com/@shankar.arunp/training-bert-from-scratch-on-your-custom-domain-data-a-step-by-step-guide-with-amazon-25fcbee4316a
|