Sihan XU's picture

Sihan XU

sihanxu

·

https://sihanxu.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

upvoted a paper about 1 month ago

WildRayZer: Self-supervised Large View Synthesis in Dynamic Environments

upvoted a paper about 1 month ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

View all activity

Organizations

upvoted a paper 1 day ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 2 days ago • 64

upvoted 2 papers about 1 month ago

WildRayZer: Self-supervised Large View Synthesis in Dynamic Environments

Paper • 2601.10716 • Published Jan 15 • 4

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published Jan 22 • 53

upvoted 3 papers 3 months ago

Bidirectional Normalizing Flow: From Data to Noise and Back

Paper • 2512.10953 • Published Dec 11, 2025 • 7

Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation

Paper • 2512.16913 • Published Dec 18, 2025 • 34

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 87

upvoted a collection 3 months ago

NEPA

5 items • Updated Dec 19, 2025 • 11

upvoted a paper 5 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 166

upvoted 2 papers over 1 year ago

Multi-Object Hallucination in Vision-Language Models

Paper • 2407.06192 • Published Jul 8, 2024 • 12

DiffusionPDE: Generative PDE-Solving Under Partial Observation

Paper • 2406.17763 • Published Jun 25, 2024 • 24

upvoted a paper about 2 years ago

Inversion-Free Image Editing with Natural Language

Paper • 2312.04965 • Published Dec 7, 2023 • 2