Fix num_experts to match saved weights

#2
by echarlaix HF Staff - opened
Optimum Intel Internal Testing org

The saved weights have 128 experts (matching num_local_experts: 128) but num_experts was set to 4, causing a size mismatch error when loading with transformers >= 5.6.

Fix: set num_experts: 128 to match the actual weight shapes. Both num_experts and num_local_experts are now 128, ensuring compatibility with transformers 5.6 and 5.10.

echarlaix changed pull request status to open
echarlaix changed pull request status to merged

Sign up or log in to comment

Free AI Image Generator No sign-up. Instant results. Open Now