Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wang Chengyao's picture
3 10 15

Wang Chengyao

wcy1122
Nakamotosatoshi's profile picture Norod78's profile picture sudanenator's profile picture
·
https://wcy1122.github.io/
  • wcy1122
  • wcy1122

AI & ML interests

Multimodal Intelligence

Recent Activity

updated a Space 3 days ago
wcy1122/MGM-Omni
new activity 4 days ago
wcy1122/MGM-Omni:Thanks a Million for This HF Space!
reacted to their post with 🚀 4 days ago
🚀 Introducing MGM-Omni, an omni-chatbot capable of processing text, image, video, and speech inputs, and can generate both text and speech responses. 👂 MGM-Omni support hour-level audio understanding. 🗣️ MGM-Omni support 10-minute speech generation and voice cloning. For more details, please check: 📝 Blog: https://mgm-omni.notion.site/MGM-Omni-An-Open-source-Omni-Chatbot-2395728e0b0180149ac9f24683fc9907 🌟 Code: https://github.com/dvlab-research/MGM-Omni 🤖 Model: https://huggingface.co/collections/wcy1122/mgm-omni-6896075e97317a88825032e1 🎮 Demo: https://huggingface.co/spaces/wcy1122/MGM-Omni
View all activity

Organizations

ZeroGPU Explorers's profile picture

New activity in wcy1122/MGM-Omni 4 days ago

Thanks a Million for This HF Space!

1
#1 opened 4 days ago by
9voltfan2009
commented a paper 8 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 49 •
3
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略