Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Guilherme34
/
Samantha-omni
like
1
Any-to-Any
Transformers
Safetensors
openbmb/RLAIF-V-Dataset
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
arxiv:
2405.17220
arxiv:
2408.01800
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Samantha-omni
/
assets
/
input_examples
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
Guilherme34
Upload assets/input_examples/indian-accent.wav with huggingface_hub
471cf33
verified
2 days ago
Trump_WEF_2018_10s.mp3
Safe
161 kB
LFS
Upload assets/input_examples/Trump_WEF_2018_10s.mp3 with huggingface_hub
2 days ago
assistant_default_female_voice.wav
Safe
224 kB
LFS
Upload assets/input_examples/assistant_default_female_voice.wav with huggingface_hub
2 days ago
assistant_female_voice.wav
Safe
235 kB
LFS
Upload assets/input_examples/assistant_female_voice.wav with huggingface_hub
2 days ago
assistant_male_voice.wav
Safe
144 kB
LFS
Upload assets/input_examples/assistant_male_voice.wav with huggingface_hub
2 days ago
audio_understanding.mp3
Safe
321 kB
LFS
Upload assets/input_examples/audio_understanding.mp3 with huggingface_hub
2 days ago
chi-english-1.wav
Safe
492 kB
LFS
Upload assets/input_examples/chi-english-1.wav with huggingface_hub
2 days ago
cxk_original.wav
Safe
384 kB
LFS
Upload assets/input_examples/cxk_original.wav with huggingface_hub
2 days ago
exciting-emotion.wav
Safe
696 kB
LFS
Upload assets/input_examples/exciting-emotion.wav with huggingface_hub
2 days ago
fast-pace.wav
Safe
986 kB
LFS
Upload assets/input_examples/fast-pace.wav with huggingface_hub
2 days ago
icl_20.wav
Safe
619 kB
LFS
Upload assets/input_examples/icl_20.wav with huggingface_hub
2 days ago
indian-accent.wav
Safe
1.41 MB
LFS
Upload assets/input_examples/indian-accent.wav with huggingface_hub
2 days ago