OmarAlterkait 's Collections
MaskBit: Embedding-free Image Generation via Bit Tokens
Paper
• 2409.16211
• Published
• 17
Goku: Flow Based Video Generative Foundation Models
Paper
• 2502.04896
• Published
• 106
Discrete Audio Tokens: More Than a Survey!
Paper
• 2506.10274
• Published
• 32
HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based
Diffusion Sampling
Paper
• 2506.20452
• Published
• 19
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for
Long Video Generation
Paper
• 2506.19852
• Published
• 42
Franca: Nested Matryoshka Clustering for Scalable Visual Representation
Learning
Paper
• 2507.14137
• Published
• 35
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis
Paper
• 2509.10441
• Published
• 31
Vision Transformers Don't Need Trained Registers
Paper
• 2506.08010
• Published
• 22
Ming-UniVision: Joint Image Understanding and Generation with a Unified
Continuous Tokenizer
Paper
• 2510.06590
• Published
• 77
Advancing End-to-End Pixel Space Generative Modeling via Self-supervised
Pre-training
Paper
• 2510.12586
• Published
• 113