-
-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rpi client with extern sepia (stt) server cannot work without internet #228
Comments
Wake-word is always fully offline and should not be affected by the choice of the
I often use a gain of 5-10, depending on the microphone. The quality of your STT results depends heavily on the microphone and your distance to the device unfortunately. That is usually why it works better on your phone. Googles servers seem to use some unknown black-magic to get crazy good results, idk how they do it ^^, but I'm always working on improving the open-source systems.
Only for the |
hmm... but why are the rate for right sentences with native much better than with sepia. i didnt changed anything else. and also chrome should use the same stt engine from the server. |
you mean the actual transcription? Or the wake-word? For the wake-word I can't see any logical reason right now, for the transcription the Google service is just better :-/. They probably trained their system on thousands of hours of audio from a very large number of different devices and microphone setups (presumably a large part of their data is "illegally" recorded without user consent or knowledge).
Yes, Google can directly access the audio interface and optimize everything, SEPIA has to work with the official browser APIs. There is still room for improvements though, because currently I destroy and recreate the audio interface after the wake-word trigger for compatibility reasons. |
yes, the transcription is bad on "sepia engine". the wakeword is working.
so the stt with native-engine is done via google/chromium?
ah ok, so i will get maybe a faster response when i use a rpi4. but just for a client the rpi4 is too overpowerd 😄 at least for me. |
The short answer: On the Raspberry Pi DIY client "native" is the Chromium implementation of Web Speech API that will access Google servers, but: Currently the consensus in open-source speech recognition is that you either need a lot of compute power or have to work with limited vocabulary and language models to get a good accuracy. This is especially true for all non-english languages, including German :-/. My goal for 2023 is to do 1-2 larger SEPIA STT server updates with specialized language models and new engines for more powerful devices 🤞🙂.
Yes. I have plans for smaller clients aswell that will work on even smaller chips, but right now the priority is improving open-source STT. |
hey florian,
last week i had no internet cause they renew the street in my city.
so i thought thats the perfect case to test my smart home / voice assistant, since it should work complete offline.
my sepia server with stt is on a proxmox vm, also my home assistant instance. everthing is reachable within my network.
sepia on my phones worked as accpected but not my rpi client. i allways get this info in the client-connection remote terminal:
Broadcaster event: {"broadcast":{"client":"raspi_chrome_app_v0.25.0","deviceId":"raspi","sepia-speech":{"type":"asr_error","msg":"no connection to server"}}}
so i looked up my settings.js file and there i have this config for the asr:
and i looked at the wiki, i changed my config to:
this worked. no internet and a working voice assistant. but now my wakeword rate is very poor and the stt rating is horrible.
i have to speak commands more than 2 times till it get what i mean.
the usability with asr-engine at "native" worked so much better as with "sepia".
but with "native" i cannot use it without internet.
on my smartphones i have this config:
do you know what happens here?
does need the chromium version a internet connection?
The text was updated successfully, but these errors were encountered: