Weiyun Wang's picture

Weiyun Wang

Weiyun1025

·

Weiyun1025

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Intern-S1: A Scientific Multimodal Foundation Model

upvoted a paper 19 days ago

Qwen-Image Technical Report

liked a model about 1 month ago

Qwen/Qwen3-8B

View all activity

Organizations

upvoted a paper 1 day ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published 3 days ago • 205

upvoted a paper 19 days ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published 20 days ago • 216

upvoted 2 papers about 1 month ago

Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models

Paper • 2507.12566 • Published Jul 16 • 14

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published Jul 17 • 40

upvoted 3 papers about 2 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 86

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 89

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 232

upvoted 5 papers 2 months ago

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17 • 45

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 69

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12 • 74

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 263

Magistral

Paper • 2506.10910 • Published Jun 12 • 63

upvoted 8 papers 3 months ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 90

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 256

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 68

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4 • 43

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 79

Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Paper • 2506.00123 • Published May 30 • 34

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29 • 46

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 46