Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
177 Bytes
Each model configuration has different attributes; for instance, all NLP models have the hidden_size, num_attention_heads, num_hidden_layers and vocab_size attributes in common.