Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ RedPajama-Base-INCITE-6.9B-v1, is a large transformer-based language model devel
|
|
19 |
|
20 |
## GPU Inference
|
21 |
|
22 |
-
This requires a GPU with
|
23 |
```python
|
24 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
25 |
# init
|
@@ -35,7 +35,7 @@ print(output_str)
|
|
35 |
|
36 |
## GPU Inference in Int8
|
37 |
|
38 |
-
This requires a GPU with
|
39 |
|
40 |
```python
|
41 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
|
19 |
|
20 |
## GPU Inference
|
21 |
|
22 |
+
This requires a GPU with 16GB memory.
|
23 |
```python
|
24 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
25 |
# init
|
|
|
35 |
|
36 |
## GPU Inference in Int8
|
37 |
|
38 |
+
This requires a GPU with 12GB memory.
|
39 |
|
40 |
```python
|
41 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|