Adaptive Caching for Faster Video Generation with Diffusion Transformers Paper • 2411.02397 • Published Nov 4, 2024 • 24
RoRA-VLM: Robust Retrieval-Augmented Vision Language Models Paper • 2410.08876 • Published Oct 11, 2024
Efficient Streaming Language Models with Attention Sinks Paper • 2309.17453 • Published Sep 29, 2023 • 14