mlx-community/medgemma-27b-text-it-8bit does not stop generating output

#1
by sjug - opened
MLX Community org

On the plus side the output of the 8bit quant seems plausibly correct.
The downside is that gets to <end_of_turn>thought and continues generating over and over with no end.

Once again I don't think that mlx-lm supports MedGemma yet.

MLX Community org

Any idea about this. Trying the 4bit version without much success.

MLX Community org

@horcle I think we'd need to open an issue with mlx-lm to get this fixed.

Sign up or log in to comment