view post Post 4093 🚀 Introducing MGM-Omni, an omni-chatbot capable of processing text, image, video, and speech inputs, and can generate both text and speech responses.👂 MGM-Omni support hour-level audio understanding.🗣️ MGM-Omni support 10-minute speech generation and voice cloning.For more details, please check:📝 Blog: https://mgm-omni.notion.site/MGM-Omni-An-Open-source-Omni-Chatbot-2395728e0b0180149ac9f24683fc9907 🌟 Code: https://github.com/dvlab-research/MGM-Omni 🤖 Model: wcy1122/mgm-omni-6896075e97317a88825032e1 🎮 Demo: wcy1122/MGM-Omni See translation 🚀 8 8 🔥 3 3 + Reply
view post Post 4093 🚀 Introducing MGM-Omni, an omni-chatbot capable of processing text, image, video, and speech inputs, and can generate both text and speech responses.👂 MGM-Omni support hour-level audio understanding.🗣️ MGM-Omni support 10-minute speech generation and voice cloning.For more details, please check:📝 Blog: https://mgm-omni.notion.site/MGM-Omni-An-Open-source-Omni-Chatbot-2395728e0b0180149ac9f24683fc9907 🌟 Code: https://github.com/dvlab-research/MGM-Omni 🤖 Model: wcy1122/mgm-omni-6896075e97317a88825032e1 🎮 Demo: wcy1122/MGM-Omni See translation 🚀 8 8 🔥 3 3 + Reply
MGM-Omni Collection An open-source Omni Chatbot for Long Audio and Voice Clone • 12 items • Updated 8 days ago • 6