9 23 56

Jiaming Han

csuhan

https://csuhan.com

csuhan

AI & ML interests

Computer Vision

Recent Activity

liked a Space about 12 hours ago

multimodalart/Qwen-Image-Fast

updated a collection 10 days ago

Tar

updated a collection 10 days ago

Tar

View all activity

Organizations

authored a paper 25 days ago

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published 25 days ago • 94

authored a paper 2 months ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23 • 33

authored a paper 4 months ago

Multimodal Long Video Modeling Based on Temporal Dynamic Context

Paper • 2504.10443 • Published Apr 14 • 4

authored 3 papers 6 months ago

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

Paper • 2303.16199 • Published Mar 28, 2023 • 4

LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model

Paper • 2304.15010 • Published Apr 28, 2023 • 4

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Paper • 2502.16707 • Published Feb 23 • 13

authored a paper 9 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

authored a paper 10 months ago

Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant

Paper • 2410.13360 • Published Oct 17, 2024 • 9

authored a paper over 1 year ago

OneLLM: One Framework to Align All Modalities with Language

Paper • 2312.03700 • Published Dec 6, 2023 • 24

authored 2 papers almost 2 years ago

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

Paper • 2311.07575 • Published Nov 13, 2023 • 15

ImageBind-LLM: Multi-modality Instruction Tuning

Paper • 2309.03905 • Published Sep 7, 2023 • 17

Jiaming Han

AI & ML interests

Recent Activity

Organizations

csuhan's activity