plaguss
/

Mistral-7B-v0.1-Math-Shepherd-PRM-0.2

Token Classification

Generated from Trainer

stepwise-reward-trainer

text-generation-inference

Model card Files Files and versions

Mistral-7B-v0.1-Math-Shepherd-PRM-0.2

14.2 GB

1 contributor

History: 4 commits

plaguss's picture

Training in progress, step 500

df2266a verified 11 months ago

.gitattributes

1.52 kB

initial commit 11 months ago
config.json

673 Bytes

Training in progress, step 500 11 months ago
model-00001-of-00003.safetensors

4.94 GB
xet

Training in progress, step 500 11 months ago
model-00002-of-00003.safetensors

5 GB
xet

Training in progress, step 500 11 months ago
model-00003-of-00003.safetensors

4.28 GB
xet

Training in progress, step 500 11 months ago
model.safetensors.index.json

24 kB

Training in progress, step 500 11 months ago
special_tokens_map.json

437 Bytes

Training in progress, step 500 11 months ago
tokenizer.json

3.51 MB

Training in progress, step 500 11 months ago
tokenizer_config.json

1.03 kB

Training in progress, step 500 11 months ago
training_args.bin
Detected Pickle imports (14)
- "trl.trainer.stepwise_reward_config.StepwiseRewardConfig",
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.training_args.OptimizerNames",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "accelerate.state.PartialState",
- "torch.device",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "torch.bfloat16",
- "transformers.trainer_utils.HubStrategy",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
How to fix it?
6.78 kB
xet

Training in progress, step 500 11 months ago