72 24 65

Ziyang Luo

Ziyang

https://chiyeunglaw.github.io/

AI & ML interests

Agents, LLMs, Multimodal ML

Recent Activity

authored a paper 4 days ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

upvoted a paper 4 days ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

commented on a paper 4 days ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

View all activity

Organizations

authored a paper 4 days ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published 5 days ago • 29

upvoted a paper 4 days ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published 5 days ago • 29

commented a paper 4 days ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published 5 days ago • 29 •

authored 4 papers about 2 months ago

upvoted a paper about 2 months ago

GTA1: GUI Test-time Scaling Agent

Paper • 2507.05791 • Published Jul 8 • 25

updated a dataset about 2 months ago

CodeResearch/tree-of-evol-75k

Viewer • Updated Jul 5 • 75.4k • 35

published a dataset about 2 months ago

CodeResearch/tree-of-evol-75k

Viewer • Updated Jul 5 • 75.4k • 35

published 3 models about 2 months ago

CodeResearch/tree-of-evol-14b

15B • Updated Dec 24, 2024 • 4

CodeResearch/tree-of-evol-7b

8B • Updated Dec 24, 2024 • 4

CodeResearch/tree-of-evol-1.5b

2B • Updated Dec 19, 2024 • 8

upvoted an article about 2 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

and 1 other •

Jun 21

• 67

upvoted an article 2 months ago

Article

GRPO for GUI Grounding Done Right

•

Jun 11

• 30

upvoted an article 3 months ago

Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Jun 6

• 53

authored a paper 4 months ago

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

Paper • 2411.18932 • Published Nov 28, 2024 • 1

upvoted a paper 4 months ago

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

Paper • 2411.18932 • Published Nov 28, 2024 • 1

commented on Tiny Agents: a MCP-powered agent in 50 lines of code 4 months ago

🔥🔥🔥

upvoted an article 4 months ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

Apr 25

• 295

Ziyang Luo

AI & ML interests

Recent Activity

Organizations

Ziyang's activity

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

GRPO for GUI Grounding Done Right

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Tiny Agents: a MCP-powered agent in 50 lines of code