Update README.md
Browse files
README.md
CHANGED
|
@@ -19,7 +19,7 @@ RedPajama-Base-INCITE-6.9B-v1, is a large transformer-based language model devel
|
|
| 19 |
|
| 20 |
## GPU Inference
|
| 21 |
|
| 22 |
-
This requires a GPU with
|
| 23 |
```python
|
| 24 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
| 25 |
# init
|
|
@@ -35,7 +35,7 @@ print(output_str)
|
|
| 35 |
|
| 36 |
## GPU Inference in Int8
|
| 37 |
|
| 38 |
-
This requires a GPU with
|
| 39 |
|
| 40 |
```python
|
| 41 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
|
|
| 19 |
|
| 20 |
## GPU Inference
|
| 21 |
|
| 22 |
+
This requires a GPU with 16GB memory.
|
| 23 |
```python
|
| 24 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
| 25 |
# init
|
|
|
|
| 35 |
|
| 36 |
## GPU Inference in Int8
|
| 37 |
|
| 38 |
+
This requires a GPU with 12GB memory.
|
| 39 |
|
| 40 |
```python
|
| 41 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|