arunprsh commited on
Commit
fc6ad7a
1 Parent(s): a9b58fb

updating readme

Browse files
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -4,4 +4,11 @@ language:
4
  - en
5
  tags:
6
  - medical
7
- ---
 
 
 
 
 
 
 
 
4
  - en
5
  tags:
6
  - medical
7
+ ---
8
+ Dataset: https://www.kaggle.com/datasets/timmayer/covid-news-articles-2020-2022
9
+
10
+ Comprehensive guide can be found here: https://medium.com/@shankar.arunp/easily-build-your-own-gpt-from-scratch-using-aws-51811b6355d3
11
+
12
+ The model is GPT2 further pre-trained on the news articles to incorporate COVID-19 related context to the model.
13
+
14
+ Similar article on how to further pre-train a BERT base model from scratch using the articles can be found here: https://medium.com/@shankar.arunp/training-bert-from-scratch-on-your-custom-domain-data-a-step-by-step-guide-with-amazon-25fcbee4316a