amezasor commited on
Commit
59d18c1
1 Parent(s): 5e9c728
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -253,8 +253,8 @@ output = tokenizer.batch_decode(output)
253
  print(output)
254
  ```
255
 
256
- **Model Architeture:**
257
- Granite-3.0-8B-Base is based on a decoder-only dense transformer architecture. Core components of this architecture are: GQA and RoPE, MLP with SwiGLU, RMSNorm, and shared input/output embbeddings.
258
 
259
  | Model | 2B Dense | 8B Dense | 1B MoE | 3B MoE |
260
  | :-------- | :--------| :-------- | :------| :------|
 
253
  print(output)
254
  ```
255
 
256
+ **Model Architecture:**
257
+ Granite-3.0-8B-Base is based on a decoder-only dense transformer architecture. Core components of this architecture are: GQA and RoPE, MLP with SwiGLU, RMSNorm, and shared input/output embeddings.
258
 
259
  | Model | 2B Dense | 8B Dense | 1B MoE | 3B MoE |
260
  | :-------- | :--------| :-------- | :------| :------|