zai-org/GLM-4.5-Air · i love GLM! but please consider longer context length

im merely an enthusiast, my words might mean nothing, i cant even run the model locally.
but recently i tried the model through openrouter and woah boy, it outperformed every other model including closed source ones. this model is very reminiscent of gemini 2.5 in its early days (which later got nerfed). the only thing missing is a longer context length and it would be SOTA in every way. in my use case (which is help with academic study and synthesis of math/cs), it was brilliant and outperformed all models including the recent qwen3, but the most recent qwen3 supports a context length of up to 1m, and even offers that in very small models that i can run locally.

thank you for your work! please stay open source! you are our best hope!!