14 36 48

XuHao Hu

Foreshhh

AI & ML interests

NLP MM

Recent Activity

liked a dataset 1 day ago

wanlilll/WeaveBench

upvoted a paper 4 days ago

PhoneWorld: Scaling Phone-Use Agent Environments

upvoted a paper 15 days ago

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

View all activity

Organizations

New activity in mPLUG/ToolCUA-8B 29 days ago

Create README.md

#1 opened 29 days ago by

Foreshhh

commented a paper 8 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9, 2025 • 23 •

New activity in OpenSafetyLab/MD-Judge-v0_2-internlm2_7b 10 months ago

Unable to download the model

#2 opened over 1 year ago by

sriharshasurineni

New activity in Foreshhh/vlsbench 10 months ago

[bot] Conversion to Parquet

#1 opened over 1 year ago by

parquet-converter

Question about image files - no images found when loading dataset

#2 opened 10 months ago by

leo1200213

New activity in OpenSafetyLab/t2i_safety_dataset 10 months ago

Improve dataset card for T2ISafety benchmark

#1 opened 11 months ago by

nielsr

commented a paper over 1 year ago

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Paper • 2411.19939 • Published Nov 29, 2024 • 10 •

New activity in OpenSafetyLab/Salad-Data over 1 year ago

Paraphrased questions from the base set in a new dataset

👍 2

#2 opened over 1 year ago by

skirdey-inflection

New activity in OpenSafetyLab/MD-Judge-v0_2-internlm2_7b almost 2 years ago

It's a great model, and I have a few questions.

#1 opened almost 2 years ago by

chelcy

New activity in OpenSafetyLab/MD-Judge-v0.1 about 2 years ago

Batch inference via huggingface.

#2 opened about 2 years ago by

anki08

New activity in OpenSafetyLab/Salad-Bench-Leaderboard about 2 years ago

What is `dommension`?

#1 opened about 2 years ago by

zhiminy