Mohammad Albarham's picture

Mohammad Albarham

pain

·

https://mohammad-albarham.github.io/

AI & ML interests

Computer Vision - NLP - Multi-modality

Recent Activity

upvoted a paper 2 days ago

Xiaomi Auto World Model: A Joint World Model Integrating Reconstruction and Generation for Autonomous Driving

liked a Space 2 days ago

build-small-hackathon/small-talk

liked a model 6 days ago

jinaai/jina-clip-v2

View all activity

Organizations

upvoted a paper 2 days ago

Xiaomi Auto World Model: A Joint World Model Integrating Reconstruction and Generation for Autonomous Driving

Paper • 2605.18137 • Published 21 days ago • 1

liked a Space 2 days ago

Small Talk

An AI-to-AI robot podcast hosted by Reachy Minis

liked a model 6 days ago

jinaai/jina-clip-v2

Feature Extraction • 0.9B • Updated Apr 8 • 31.9k • 333

upvoted an article 18 days ago

Article

Compressing Time: A Comparative Study of Video VAEs in Diffusers

Bekhouche

•

20 days ago

• 2

upvoted a paper 20 days ago

jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images

Paper • 2412.08802 • Published Dec 11, 2024 • 7

upvoted a collection 30 days ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 222

liked a model about 2 months ago

moonshotai/Kimi-K2.6

Image-Text-to-Text • 1.1T • Updated 29 days ago • 1.97M • • 1.46k

upvoted a paper 3 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 175

liked a Space 3 months ago

Easytranscriber Demo

Interactive transcription with highlighting of spoken words

upvoted 2 papers 3 months ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 140

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 166

upvoted an article 3 months ago

Article

SigLIP 2: A better multilingual vision language encoder

+1

ariG23498, merve, qubvel-hf

•

Feb 21, 2025

• 217

liked a model 4 months ago

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6, 2025 • 348k • • 629

liked a Space 6 months ago

Background Removal

Remove backgrounds from images instantly

upvoted a collection 7 months ago

Real-time Vision Models

A collection of real-time detectors. • 20 items • Updated Feb 18 • 24

reacted to their post with ❤️ 7 months ago

Post

2682

We have published an excellent paper for Arabic CLIP model.

Paper link:
https://aclanthology.org/2024.arabicnlp-1.9/

More information in this website:
https://arabic-clip.github.io/Arabic-CLIP/

All datasets, models, and demo are published to Huggingface:

Arabic-Clip

The codes are published to github:
https://github.com/Arabic-Clip/Arabic-CLIP

updated a collection 7 months ago

AraCLIP collection

3 items • Updated Nov 4, 2025 • 1

upvoted a collection 7 months ago

AraCLIP collection

3 items • Updated Nov 4, 2025 • 1

upvoted a paper 7 months ago

3D Gaussian Splatting for Real-Time Radiance Field Rendering

Paper • 2308.04079 • Published Aug 8, 2023 • 204

updated a model 8 months ago

pain/dinov3-smallplus-mask2former-v1.0-3000_samples-12-classes-enhanced-diff-lr

Updated Oct 23, 2025 • 1