On 7/5/21, Karl Semich <0xloem@gmail.com> wrote:
https://github.com/mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
If this thing is already well trained on English, needing not much more than reading maybe ten pages or so worth of a training book that it already knows and that comes with the docs, to get it 90+% tuned to a specific voice... it could be very useful for people needing handsfree dictation, thought capture, transcription, voice message conversion, etc. What other tools are out there in freeware to do speech to text? Did Google ever release its voicemail convertor? People have said that service sucks due to nature of needing to be trained for all callers, not to just one. Doubt any of them would be able to pull out whispers without it being trained on whispers for which there isn't much public corpus, and further tuned by a specific whisperer which is even rarer.