Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
Shehan Munasinghe's picture
2 11 2

Shehan Munasinghe

shehan97
Sarim-Hash's profile picture SasikaA073's profile picture boda's profile picture
·
https://shehanmunasinghe.github.io/
  • shehan_u_e_m
  • shehanmunasinghe

AI & ML interests

Computer Vision, Multi-modal learning

Recent Activity

authored a paper about 1 month ago
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
upvoted a paper 6 months ago
Sekai: A Video Dataset towards World Exploration
upvoted a paper 7 months ago
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark
View all activity

Organizations

Mohamed Bin Zayed University of Artificial Intelligence's profile picture

authored a paper about 1 month ago

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7, 2024 • 23
authored a paper about 2 years ago

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

Paper • 2311.13435 • Published Nov 22, 2023 • 18
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required