PersonaVLM: Long-Term Personalized Multimodal LLMs (CVPR 2026)
🎉 News: Our paper "PersonaVLM: Long-Term Personalized Multimodal LLMs" is accepted to CVPR 2026!
🌟 Introduction
PersonaVLM is an innovative personalized multimodal agent framework designed for long-term personalization. It transforms a general-purpose MLLM into a personalized assistant by integrating three key capabilities:
- Remembering: Proactively extracts and summarizes multimodal memories into a personalized database.
- Reasoning: Conducts multi-turn reasoning by retrieving relevant memories from a multi-type memory architecture (core, semantic, episodic, and procedural).
- Response Alignment: Infers the user's evolving personality using a Momentum-based Personality Evolving Mechanism (PEM) to ensure aligned outputs.
📊 Persona-MME Benchmark
We establish Persona-MME, a comprehensive benchmark comprising over 2,000 curated interaction cases across 14 fine-grained tasks to assess long-term MLLM personalization.
🔗 Official Resources
This project consists of several components. You can access the model weights, training data, benchmark, and code via the links below:
| Resource | Link |
|---|---|
| 🌐 Project Page | https://PersonaVLM.github.io |
| 💻 Official Code | GitHub: PersonaVLM |
| 🤗 Model Weights | HF: PersonaVLM (Qwen2.5-VL-7B) |
| 📊 Benchmark | HF: Persona-MME (2,000+ cases) |
| 📂 Training Data | HF: PersonaVLM-Dataset (80k+ samples) |
✒️ Citation
If you find our work helpful, please cite our paper:
@inproceedings{nie2026personavlm,
title={PersonaVLM: Long-Term Personalized Multimodal LLMs},
author={Nie, Chang and Fu, Chaoyou and Zhang, Yifan and Yang, Haihua and Shan, Caifeng},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2026},
url={http://arxiv.org/abs/2604.13074}
}
- Downloads last month
- 28
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for ClareNie/PersonaVLM
Datasets used to train ClareNie/PersonaVLM
Viewer • Updated • 4.54k • 684 • 11
ClareNie/PersonaVLM-Dataset
Viewer • Updated • 33.3k • 107 • 16