typo fix
Browse files
README.md
CHANGED
@@ -327,7 +327,7 @@ Granite-3.0-2B-Instruct is based on a decoder-only dense transformer architectur
|
|
327 |
| Initialization std | **0.1** | 0.1 | 0.1 | 0.1 |
|
328 |
| Sequence Length | **4096** | 4096 | 4096 | 4096 |
|
329 |
| Position Embedding | **RoPE** | RoPE | RoPE | RoPE |
|
330 |
-
| #
|
331 |
| # Active Parameters | **2.5B** | 8.1B | 400M | 800M |
|
332 |
| # Training tokens | **12T** | 12T | 10T | 10T |
|
333 |
|
|
|
327 |
| Initialization std | **0.1** | 0.1 | 0.1 | 0.1 |
|
328 |
| Sequence Length | **4096** | 4096 | 4096 | 4096 |
|
329 |
| Position Embedding | **RoPE** | RoPE | RoPE | RoPE |
|
330 |
+
| # Parameters | **2.5B** | 8.1B | 1.3B | 3.3B |
|
331 |
| # Active Parameters | **2.5B** | 8.1B | 400M | 800M |
|
332 |
| # Training tokens | **12T** | 12T | 10T | 10T |
|
333 |
|