Update README.md
Browse files
README.md
CHANGED
|
@@ -27,6 +27,36 @@ base_model:
|
|
| 27 |
- nvidia/parakeet-tdt-0.6b-v2
|
| 28 |
---
|
| 29 |
|
| 30 |
-
Parakeet TDT 0.6B V2
|
| 31 |
|
| 32 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 27 |
- nvidia/parakeet-tdt-0.6b-v2
|
| 28 |
---
|
| 29 |
|
| 30 |
+
# Parakeet TDT 0.6B V2 - CoreML
|
| 31 |
|
| 32 |
+
This is a CoreML-optimized version of NVIDIA's Parakeet TDT 0.6B V2 model, designed for high-performance automatic speech recognition on Apple platforms.
|
| 33 |
+
|
| 34 |
+
## Model Description
|
| 35 |
+
|
| 36 |
+
Models will continue to evolve as we optimize performance and accuracy. This model has been converted to CoreML format for efficient on-device inference on Apple Silicon and iOS devices, enabling real-time speech recognition with
|
| 37 |
+
minimal memory footprint.
|
| 38 |
+
|
| 39 |
+
## Usage in Swift
|
| 40 |
+
|
| 41 |
+
See the [FluidAudio repository](https://github.com/FluidInference/FluidAudioSwift) for instructions.
|
| 42 |
+
|
| 43 |
+
## Performance
|
| 44 |
+
|
| 45 |
+
- Real-time factor: < 0.3x on M1 Pro
|
| 46 |
+
- Memory usage: ~800MB peak
|
| 47 |
+
- Supported platforms: macOS 14+, iOS 17+
|
| 48 |
+
- Optimized for: Apple Silicon
|
| 49 |
+
|
| 50 |
+
## Model Details
|
| 51 |
+
|
| 52 |
+
- Architecture: FastConformer-TDT
|
| 53 |
+
- Parameters: 0.6B
|
| 54 |
+
- Sample rate: 16kHz
|
| 55 |
+
|
| 56 |
+
## License
|
| 57 |
+
|
| 58 |
+
This model is released under the CC-BY-4.0 license. See the LICENSE file for details.
|
| 59 |
+
|
| 60 |
+
Acknowledgments
|
| 61 |
+
|
| 62 |
+
Based on NVIDIA's Parakeet TDT model. CoreML conversion and Swift integration by the FluidInference team.
|