Sparsified State-Space Models are Efficient Highway Networks Paper • 2505.20698 • Published May 27 • 2
Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs Paper • 2404.10308 • Published Apr 16, 2024
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation Paper • 2508.05399 • Published 17 days ago • 16
ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification Paper • 2502.14565 • Published Feb 20