Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
138 Bytes
This is true even if you're training the model further - you will probably get the best
performance if you keep the chat tokens constant.