GPT43M_30K / README.md
PK03's picture
Update README.md
0dd5f55 verified
|
raw
history blame
132 Bytes
metadata
license: mit

This is a encoder only Tranformer model with 43 Million parameters It was trained on around 4 Million tokens