Update README.md
Browse files
README.md
CHANGED
|
@@ -947,9 +947,9 @@ Click here to try the online demo of [MiniCPM-o 2.6](https://minicpm-omni-webdem
|
|
| 947 |
Inference using Huggingface transformers on NVIDIA GPUs. Please ensure that `transformers==4.44.2` is installed, as other versions may have compatibility issues. We are investigating this issue. Requirements tested on python 3.10:
|
| 948 |
```
|
| 949 |
Pillow==10.1.0
|
| 950 |
-
torch==2.
|
| 951 |
-
torchaudio==2.
|
| 952 |
-
torchvision==0.
|
| 953 |
transformers==4.44.2
|
| 954 |
librosa==0.9.0
|
| 955 |
soundfile==0.12.1
|
|
@@ -986,8 +986,13 @@ tokenizer = AutoTokenizer.from_pretrained('openbmb/MiniCPM-o-2_6', trust_remote_
|
|
| 986 |
|
| 987 |
# In addition to vision-only mode, tts processor and vocos also needs to be initialized
|
| 988 |
model.init_tts()
|
|
|
|
|
|
|
|
|
|
|
|
|
| 989 |
model.tts.float()
|
| 990 |
```
|
|
|
|
| 991 |
### Omni mode
|
| 992 |
we provide two inference modes: chat and streaming
|
| 993 |
|
|
|
|
| 947 |
Inference using Huggingface transformers on NVIDIA GPUs. Please ensure that `transformers==4.44.2` is installed, as other versions may have compatibility issues. We are investigating this issue. Requirements tested on python 3.10:
|
| 948 |
```
|
| 949 |
Pillow==10.1.0
|
| 950 |
+
torch==2.3.1
|
| 951 |
+
torchaudio==2.3.1
|
| 952 |
+
torchvision==0.18.1
|
| 953 |
transformers==4.44.2
|
| 954 |
librosa==0.9.0
|
| 955 |
soundfile==0.12.1
|
|
|
|
| 986 |
|
| 987 |
# In addition to vision-only mode, tts processor and vocos also needs to be initialized
|
| 988 |
model.init_tts()
|
| 989 |
+
```
|
| 990 |
+
|
| 991 |
+
If you are using an older version of PyTorch, you might encounter this issue `"weight_norm_fwd_first_dim_kernel" not implemented for 'BFloat16'`, Please convert the TTS to float32 type.
|
| 992 |
+
```python
|
| 993 |
model.tts.float()
|
| 994 |
```
|
| 995 |
+
|
| 996 |
### Omni mode
|
| 997 |
we provide two inference modes: chat and streaming
|
| 998 |
|