skit-ai
/

speechllm-2B

Feature Extraction

speech-language

Model card Files Files and versions

shangeth commited on Jun 4, 2024

Commit

183a43c

·

verified ·

1 Parent(s): 1b27420

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -57,6 +57,14 @@ model-index:
 # SpeechLLM
 ## Usage
 ```python
 # Load model directly from huggingface

 # SpeechLLM
+SpeechLLM is a multi-modal LLM trained to predict the metadata of the speaker's turn in a conversation. SpeechLLM model is based on HubertX acoustic encoder and TinyLlama LLM. The model predicts the following:
+1. Speech Activity
+2. ASR Transcript
+3. Gender of the speaker
+4. Age of the speaker
+5. Accent of the speaker
+6. Emotion of the speaker
 ## Usage
 ```python
 # Load model directly from huggingface