jacktol commited on
Commit
a5f1c1b
·
verified ·
1 Parent(s): 8e04c74

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -9
README.md CHANGED
@@ -33,6 +33,12 @@ model-index:
33
 
34
  # Whisper Medium EN Fine-Tuned for Air Traffic Control (ATC)
35
 
 
 
 
 
 
 
36
  ## Model Overview
37
 
38
  This model is a fine-tuned version of OpenAI's Whisper Medium EN model, specifically trained on **Air Traffic Control (ATC)** communication datasets. The fine-tuning process significantly improves transcription accuracy on domain-specific aviation communications, reducing the **Word Error Rate (WER) by 84%**, compared to the original pretrained model. The model is particularly effective at handling accent variations and ambiguous phrasing often encountered in ATC communications.
@@ -63,15 +69,6 @@ The fine-tuned Whisper model is designed for:
63
 
64
  You can test the model online using the [ATC Transcription Assistant](https://huggingface.co/spaces/jacktol/ATC-Transcription-Assistant), which lets you upload audio files and generate transcriptions.
65
 
66
- ## Model Description
67
-
68
- Whisper Medium EN fine-tuned for ATC is optimized to handle short, distinct transmissions between pilots and air traffic controllers. It is fine-tuned using data from the **[ATC Dataset](https://huggingface.co/datasets/jacktol/atc-dataset)**, a combined and cleaned dataset sourced from the following:
69
-
70
- - **[ATCO2 corpus](https://huggingface.co/datasets/Jzuluaga/atco2_corpus_1h)** (1-hour test subset)
71
- - **[UWB-ATCC corpus](https://huggingface.co/datasets/Jzuluaga/uwb_atcc)**
72
-
73
- The **ATC Dataset** merges these two original sources, filtering and refining the data to enhance transcription accuracy for domain-specific ATC communications.
74
-
75
  ## Training Procedure
76
 
77
  - **Hardware**: Fine-tuning was conducted on two A100 GPUs with 80GB memory.
 
33
 
34
  # Whisper Medium EN Fine-Tuned for Air Traffic Control (ATC)
35
 
36
+ ## Deprecation Notice (August 24, 2025)
37
+
38
+ This model is now **deprecated**. A newer and **larger, better-performing model** is available, achieving a **6.5% word error rate**, a significant improvement over the previous version (≈15.08% WER).
39
+
40
+ [View the updated model on Hugging Face](https://huggingface.co/jacktol/whisper_large_v3_finetuned_6.5_eval_wer)
41
+
42
  ## Model Overview
43
 
44
  This model is a fine-tuned version of OpenAI's Whisper Medium EN model, specifically trained on **Air Traffic Control (ATC)** communication datasets. The fine-tuning process significantly improves transcription accuracy on domain-specific aviation communications, reducing the **Word Error Rate (WER) by 84%**, compared to the original pretrained model. The model is particularly effective at handling accent variations and ambiguous phrasing often encountered in ATC communications.
 
69
 
70
  You can test the model online using the [ATC Transcription Assistant](https://huggingface.co/spaces/jacktol/ATC-Transcription-Assistant), which lets you upload audio files and generate transcriptions.
71
 
 
 
 
 
 
 
 
 
 
72
  ## Training Procedure
73
 
74
  - **Hardware**: Fine-tuning was conducted on two A100 GPUs with 80GB memory.