Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Have used VOSK a bit recently. The out-of-the-box experience was great compared to earlier projects (looking at you Kaldi and Sphinx...). Word-level audio segmentation was one usecase, https://stackoverflow.com/a/65370463/1967571


Vosk is built on Kaldi.


Kdenlive supports automatic subtitles created with VOSK now btw. This makes it a lot more accessible for non-tech folks.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: