ibm-granite
/

granite-3.0-8b-base

Text Generation

Model card Files Files and versions Community

amezasor commited on 19 days ago

Commit

59d18c1

•

1 Parent(s): 5e9c728

typo fix

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -253,8 +253,8 @@ output = tokenizer.batch_decode(output)
 print(output)
 ```
-**Model Architeture:**
-Granite-3.0-8B-Base is based on a decoder-only dense transformer architecture. Core components of this architecture are: GQA and RoPE, MLP with SwiGLU, RMSNorm, and shared input/output embbeddings.
 | Model                     | 2B Dense | 8B Dense     | 1B MoE | 3B MoE |
 | :--------                 | :--------| :--------    | :------| :------|

 print(output)
 ```
+**Model Architecture:**
+Granite-3.0-8B-Base is based on a decoder-only dense transformer architecture. Core components of this architecture are: GQA and RoPE, MLP with SwiGLU, RMSNorm, and shared input/output embeddings.
 | Model                     | 2B Dense | 8B Dense     | 1B MoE | 3B MoE |
 | :--------                 | :--------| :--------    | :------| :------|