File size: 371 Bytes
cd8472c
 
 
 
 
 
 
 
 
 
 
ac4a328
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
---
base_model:
- deepseek-ai/DeepSeek-R1-0528
language:
- en
library_name: transformers
license: mit
tags:
- deepseek
- transformers
- bf16
---

[deepseek-ai/DeepSeek-R1-0528](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528) model upcasted from fp8 to bf16 with [fp8_cast_bf16.py](https://huggingface.co/deepseek-ai/DeepSeek-V3/blob/main/inference/fp8_cast_bf16.py)