Mohammad Albarham's picture

Mohammad Albarham

pain

·

https://mohammad-albarham.github.io/

AI & ML interests

Computer Vision - NLP - Multi-modality

Recent Activity

liked a model 6 days ago

Qwen/Qwen3-VL-Embedding-8B

upvoted a paper 6 days ago

MVEB: Massive Video Embedding Benchmark

upvoted a paper 9 days ago

Xiaomi Auto World Model: A Joint World Model Integrating Reconstruction and Generation for Autonomous Driving

View all activity

Organizations

upvoted a paper 6 days ago

MVEB: Massive Video Embedding Benchmark

Paper • 2606.14958 • Published 12 days ago • 15

upvoted a paper 9 days ago

Xiaomi Auto World Model: A Joint World Model Integrating Reconstruction and Generation for Autonomous Driving

Paper • 2605.18137 • Published 28 days ago • 1

upvoted an article 25 days ago

Article

Compressing Time: A Comparative Study of Video VAEs in Diffusers

Bekhouche

•

27 days ago

• 2

upvoted a paper 28 days ago

jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images

Paper • 2412.08802 • Published Dec 11, 2024 • 7

upvoted a collection about 1 month ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 225

upvoted 2 papers 3 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 175

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 141

upvoted a paper 4 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 166

upvoted an article 4 months ago

Article

SigLIP 2: A better multilingual vision language encoder

+1

ariG23498, merve, qubvel-hf

•

Feb 21, 2025

• 217

upvoted a collection 7 months ago

Real-time Vision Models

A collection of real-time detectors. • 21 items • Updated 7 days ago • 24

upvoted a collection 8 months ago

AraCLIP collection

3 items • Updated Nov 4, 2025 • 1

upvoted a paper 8 months ago

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 204

upvoted an article 8 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

+3

abidlabs, znation, nouamanetazi, sasha, qgallouedec

•

Jul 29, 2025

• 225

upvoted a paper 10 months ago

LidarCLIP or: How I Learned to Talk to Point Clouds

Paper • 2212.06858 • Published Dec 13, 2022 • 3

upvoted a collection over 1 year ago

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20, 2025 • 99

upvoted 2 articles over 1 year ago

Article

Open R1: How to use OlympicCoder locally for coding

+3

burtenshaw, reach-vb, lewtun, edbeeching, yagilb

•

Mar 20, 2025

• 63

Article

PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face

not-lain

•

Nov 11, 2024

• 20

upvoted an article almost 2 years ago

Article

XetHub is joining Hugging Face!

yuchenglow, julien-c

•

Aug 8, 2024

• 118

upvoted a paper about 2 years ago

YaART: Yet Another ART Rendering Technology

Paper • 2404.05666 • Published Apr 8, 2024 • 18

upvoted a paper over 2 years ago

CIDAR: Culturally Relevant Instruction Dataset For Arabic

Paper • 2402.03177 • Published Feb 5, 2024 • 8