Adjust number of reserved tokens to match the model

#15

by dzhulgakov - opened Jul 15

←

Jul 15

>>> transformers.AutoTokenizer.from_pretrained("moonshotai/Kimi-K2-Instruct", trust_remote_code=True).vocab_size
163842

But the model has 163840. I think this +2 is not needed then. It doesn't affect tokenization as those are regular reserved tokens

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment