lmrescore failure and missing Gr.fst when run the training/run.sh #1668

dyustc · 2024-11-28T11:25:27Z

Hi, dear authors,

I followed the training recipe in the vosk-api/training folder, I did get a trained model, but the performance is not so good.
Also I found the following errors and some mismatch in model structures from the pretrained models. I wondered if there is something I did wrong.
I used the recent official kaldi repo and installed it with cuda on successfully.

in the decode stage, steps/lmrescore_const_arpa.sh would fail, and here is the log(maybe there needs to be a specfic version of kaldi?) But there is a WER result at last. I guess just the rescored version failed.
I intended to get a model structure similar to "vosk-model-en-us-0.22-lgraph", but there is some difference. this is my exp/chain/tdnn folder.

First, compared to the pretrained models, I got a HCLG.fst, but not a HCLr.fst and Gr.fst, supposed I need a runtime graph.
Secondly, I don't find the model.conf file, I tried to collect all the params during training, but maybe not enough. So I just copied the one from "vosk-model-en-us-0.22-lgraph", it worked, but not sure it fits right into my own trained model.
I map the exp/chain/extractor folder to ivector folder, not sure it works, but the files are similar.

From the results I got, I run the python script, test_simple.py, all the output words are upper case also not very precise(since I just run the demo run.sh, the training data couldn't be sufficient, so maybe this is possible, I can attach the audio if necessary, it's a good quality speech with decent pronunciation ), and I got a warning, runtime graphs are not supported, as I mentioned above.

So could you help with this? Am I missing some steps in training or there is some twist I should do after training?
Many thanks~

nshmyrev · 2024-11-29T15:11:52Z

in the decode stage, steps/lmrescore_const_arpa.sh would fail, and here is the log(maybe there needs to be a specfic version of kaldi?) But there is a WER result at last. I guess just the rescored version failed.

bad option --project_output means you have openfst version mismatch. We recommend to use our branches for training, they have version mismatch fixes.

First, compared to the pretrained models, I got a HCLG.fst, but not a HCLr.fst and Gr.fst, supposed I need a runtime graph.

You run mkgraph_lookahead.sh script to make dynamic graph instead of static

Secondly, I don't find the model.conf file, I tried to collect all the params during training, but maybe not enough. So I just copied the one from "vosk-model-en-us-0.22-lgraph", it worked, but not sure it fits right into my own trained model.

It is ok, you can copy existing one

all the output words are upper case also not very precise

Accurate model requires a lot of training data, not sure how much did you use and what language was it

dyustc · 2024-12-10T09:34:26Z

hi, @nshmyrev , thanks for the quick response, but I get stucked in a cuda mismatch problem when I tried to install kaldi. I tried either 'main' or 'vosk' branches in
alphakaldi, there is a cuda bug in setup for kaldi/src folder. I am using NVCC version 12.6,

Cuda compilation tools, release 12.6, V12.6.68
Build cuda_12.6.r12.6/compiler.34714021_0

It crashes in makefiles setup all the time, while the latest official kaldi would pass, so this stands in the way of me taking the steps you suggested above.
is this another tools version mismatch problem

nshmyrev · 2024-12-10T09:37:16Z

Hm, something with new cuda. I need to update the codebase then.

dyustc changed the title ~~lmrescore failure and HCLG.fst~~ lmrescore failure and missing Gr.fst when run the training/run.sh Nov 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lmrescore failure and missing Gr.fst when run the training/run.sh #1668

lmrescore failure and missing Gr.fst when run the training/run.sh #1668

dyustc commented Nov 28, 2024

nshmyrev commented Nov 29, 2024

dyustc commented Dec 10, 2024

nshmyrev commented Dec 10, 2024

lmrescore failure and missing Gr.fst when run the training/run.sh #1668

lmrescore failure and missing Gr.fst when run the training/run.sh #1668

Comments

dyustc commented Nov 28, 2024

nshmyrev commented Nov 29, 2024

dyustc commented Dec 10, 2024

nshmyrev commented Dec 10, 2024