Step 1:
kaldi-asr –align –language fi –audio-file input.wav –textfile input.txt -output-format txt|eaf|srt –out aligned text
textfile: only text, no speaker info
- Ansible script to install kaldi-asr
- first install kaldi
- copy/link bin steps utils
- models from Juho in zip file: kaldi-version as part of filename
- add small test after install to check installation
- When installation via Ansible works:
- add ”–align”: kaldi-asr needs work, maybe implement as Ansible template
- Order of languages:
- fin
- Then: swe,eng,est
- Later: saami, komi, etc
Timeline: new models ~18.6.
Later:
kaldi-asr –align –language sv –number-of-speakers N –audio-file input.wav –textfile input.txt -output-format txt|eaf|srt –out aligned text