Update README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ C-RADIOv2 models are available in multiple sizes:
|
|
| 15 |
* Base (90M parameters).
|
| 16 |
* Large (320M parameters).
|
| 17 |
* Huge (653M parameters).
|
| 18 |
-
* Gigantic (1.
|
| 19 |
|
| 20 |
C-RADIOv2 was trained for 1M steps (400k more steps than v1), using inverse frequency sampling for data balancing, and [PHI Standardization](https://arxiv.org/abs/2410.01680) for teacher distribution balancing.
|
| 21 |
|
|
@@ -134,7 +134,7 @@ The resulting tensor will have shape `(B,D,H,W)`, as is typically seen with comp
|
|
| 134 |
* https://huggingface.co/nvidia/C-RADIOv2-B
|
| 135 |
* https://huggingface.co/nvidia/C-RADIOv2-L
|
| 136 |
* https://huggingface.co/nvidia/C-RADIOv2-H
|
| 137 |
-
* https://huggingface.co/nvidia/C-RADIOv2-
|
| 138 |
|
| 139 |
# Training and Evaluation Datasets
|
| 140 |
|
|
|
|
| 15 |
* Base (90M parameters).
|
| 16 |
* Large (320M parameters).
|
| 17 |
* Huge (653M parameters).
|
| 18 |
+
* Gigantic (1.1B parameters).
|
| 19 |
|
| 20 |
C-RADIOv2 was trained for 1M steps (400k more steps than v1), using inverse frequency sampling for data balancing, and [PHI Standardization](https://arxiv.org/abs/2410.01680) for teacher distribution balancing.
|
| 21 |
|
|
|
|
| 134 |
* https://huggingface.co/nvidia/C-RADIOv2-B
|
| 135 |
* https://huggingface.co/nvidia/C-RADIOv2-L
|
| 136 |
* https://huggingface.co/nvidia/C-RADIOv2-H
|
| 137 |
+
* https://huggingface.co/nvidia/C-RADIOv2-g
|
| 138 |
|
| 139 |
# Training and Evaluation Datasets
|
| 140 |
|