Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
This means that the model will have at least 512 tokens
for context when calculating the conditional likelihood of any one token (provided there are 512 preceding tokens
available to condition on).