I’m creating this topic just out of curiosity, to compare…
After reading a lot about STT I understand that doing it locally with DeepSpeech would be possible, but complicated and more importantly requires powerful hardware, especially a good graphics card. My local server couldn’t do the job.
So honestly, I’m grateful to have a cloud STT service that is not named G or A or similar… And for free, on top of that (I’m not counting the $20 I paid to become a supporter that’s nothing and I wasn’t even required to do so). However I have noticed delays in Mycroft’s response that seem to be due to the time necessary for the text to come back from the STT server. Which can have one of two causes:
(a) My limited bandwidth (1MB/s up on a good day)
(b) Delays on the STT server itself
… Or both.
I am talking delays between 1-2 seconds (perfectly acceptable) up to 10-12 seconds on a same command (i.e. simple things like “turn off office wall”).
So… What’s your experience?