Reza2kn commited on
Commit
2eeee5e
·
verified ·
1 Parent(s): 74c46f8

Add staged smart-init model card

Browse files
Files changed (1) hide show
  1. README.md +44 -0
README.md ADDED
@@ -0,0 +1,44 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ tags:
4
+ - automatic-speech-recognition
5
+ - persian
6
+ - farsi
7
+ - nemo
8
+ - canary
9
+ language:
10
+ - fa
11
+ - en
12
+ base_model: nvidia/canary-180m-flash
13
+ ---
14
+
15
+ # Persian-heavy Canary 180M staged smart-init ASR
16
+
17
+ Experimental ASR-only adaptation of `nvidia/canary-180m-flash` for Persian-first bilingual ASR.
18
+
19
+ This checkpoint uses the newer staged adaptation path:
20
+
21
+ - Fresh Persian-heavy bilingual SentencePiece tokenizer.
22
+ - Smart vocabulary/embedding initialization from the original Canary tokenizer where possible.
23
+ - Stage 1: decoder/head adaptation with the encoder frozen.
24
+ - Stage 2: full-parameter continuation with a short initial encoder freeze.
25
+
26
+ Data mix:
27
+
28
+ - Persian: `Reza2kn/persian-asr-semi-clean-31h-awq-wer` selected/cleaned audio+text only.
29
+ - English: small FLEURS retention slice.
30
+ - Train split: 46,006 rows, about 31.742 hours.
31
+ - Validation split: 938 rows, about 0.652 hours.
32
+
33
+ Validation on the internal portable held-out split:
34
+
35
+ - Rows: 938
36
+ - WER: 0.341208 (34.12%)
37
+ - CER: 0.195946 (19.59%)
38
+
39
+ Artifact:
40
+
41
+ - `canary_180m_persian_semiclean31_staged_smart_gpu1_bd700.nemo`
42
+ - SHA256: `77fe2c46c30a507440b7129bb2efbb8e9b0e18622346509c7c46e99af16adb49`
43
+
44
+ This is still a research checkpoint, not yet an Android/CoreML/ONNX export. The earlier non-smart-init semiclean31 checkpoint was much worse, around 102% WER; this staged smart-init checkpoint is the first run where the adaptation is clearly learning.