Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
Wenxuan Huang's picture
3 7 3

Wenxuan Huang

Osilly
gaotiexinqu's profile picture mir2mir's profile picture Stars321123's profile picture
·
  • Osilly

AI & ML interests

None yet

Organizations

VLM-Reasoning's profile picture Actial's profile picture

upvoted a paper 2 months ago

Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models

Paper • 2511.01618 • Published Nov 3, 2025 • 10
upvoted a collection 3 months ago

Dynamic-LLaVA

Collection
5 items • Updated Sep 18, 2025 • 2
upvoted a paper 3 months ago

Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models

Paper • 2510.01304 • Published Oct 1, 2025 • 10
upvoted a paper 4 months ago

Interleaving Reasoning for Better Text-to-Image Generation

Paper • 2509.06945 • Published Sep 8, 2025 • 14
upvoted a collection 5 months ago

Vision-R1

Collection
7 items • Updated Jul 17, 2025 • 2
upvoted 2 papers 9 months ago

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published Mar 9, 2025 • 31

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10, 2025 • 46
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required