bweng commited on
Commit
a95894b
·
verified ·
1 Parent(s): 0cedc4b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -2
README.md CHANGED
@@ -27,6 +27,36 @@ base_model:
27
  - nvidia/parakeet-tdt-0.6b-v2
28
  ---
29
 
30
- Parakeet TDT 0.6B V2 (En)
31
 
32
- Work in progress, follow this [repo](https://github.com/FluidInference/FluidAudio) for updates, models will continue to change as we tune them
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  - nvidia/parakeet-tdt-0.6b-v2
28
  ---
29
 
30
+ # Parakeet TDT 0.6B V2 - CoreML
31
 
32
+ This is a CoreML-optimized version of NVIDIA's Parakeet TDT 0.6B V2 model, designed for high-performance automatic speech recognition on Apple platforms.
33
+
34
+ ## Model Description
35
+
36
+ Models will continue to evolve as we optimize performance and accuracy. This model has been converted to CoreML format for efficient on-device inference on Apple Silicon and iOS devices, enabling real-time speech recognition with
37
+ minimal memory footprint.
38
+
39
+ ## Usage in Swift
40
+
41
+ See the [FluidAudio repository](https://github.com/FluidInference/FluidAudioSwift) for instructions.
42
+
43
+ ## Performance
44
+
45
+ - Real-time factor: < 0.3x on M1 Pro
46
+ - Memory usage: ~800MB peak
47
+ - Supported platforms: macOS 14+, iOS 17+
48
+ - Optimized for: Apple Silicon
49
+
50
+ ## Model Details
51
+
52
+ - Architecture: FastConformer-TDT
53
+ - Parameters: 0.6B
54
+ - Sample rate: 16kHz
55
+
56
+ ## License
57
+
58
+ This model is released under the CC-BY-4.0 license. See the LICENSE file for details.
59
+
60
+ Acknowledgments
61
+
62
+ Based on NVIDIA's Parakeet TDT model. CoreML conversion and Swift integration by the FluidInference team.