GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 16 days ago • 158
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 14 days ago • 219
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Paper • 2506.18841 • Published Jun 23 • 56
OpenSAE-LLaMA-3.1-8B Collection OpenSAE checkpoints for LLaMA 3.1 8B base model • 38 items • Updated Jan 29 • 5
VerIF Collection RL trained models and datasets for instruction-following • 7 items • Updated Jun 12 • 4
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following Paper • 2506.09942 • Published Jun 11 • 6
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis Paper • 2506.04142 • Published Jun 4 • 27
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models Paper • 2506.04180 • Published Jun 4 • 33
Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models Paper • 2505.20152 • Published May 26 • 11
An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes Paper • 2504.15270 • Published Apr 21 • 10
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper • 2412.21059 • Published Dec 30, 2024 • 18
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Paper • 2503.08525 • Published Mar 11 • 17
LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models Paper • 2502.14834 • Published Feb 20 • 24
Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament Paper • 2501.13007 • Published Jan 22 • 20
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks Paper • 2412.15204 • Published Dec 19, 2024 • 38