Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any tips for better speech recognition in linux? #1672

Open
40476 opened this issue Dec 2, 2024 · 3 comments
Open

Any tips for better speech recognition in linux? #1672

40476 opened this issue Dec 2, 2024 · 3 comments

Comments

@40476
Copy link

40476 commented Dec 2, 2024

i have a few problems i am looking to remedy. mainly my problem with speech accuracy when other voices are being played. in a quiet environment vosk performs wonderfully, but when there is noise or someone else talking it is absolutely unusable for my purpose, as i am using it for realtime STTS in voice chats with friends.

@nshmyrev
Copy link
Collaborator

nshmyrev commented Dec 2, 2024

What is your language/accent? You can probably try something modern like Whisper. It depends on many details - vocabulary, etc. It is better to separate channels to avoid speech overlap. If noise source is in your room, there are ways to isolate that. And so on.

@40476
Copy link
Author

40476 commented Dec 2, 2024

i would say about midwestern,I am using sprec which uses vosk and outputs it to terminal, here is my script.

#!/bin/bash
notify-send "please wait"
arecord -q --device front:CARD=U0x46d0x825,DEV=0 -fS16_LE -c1 -r16000 | sprec | grep -oP "final 1: \K.*" | tee >(espeak-ng -d TTS_voice) >(espeak-ng) >(systemd-cat -t eon-speak) &
sleep 3
notify-send "start speaking!"

i am using one of the gigaspeech models on my system since proccessing overhead is not an issue to me.

@nshmyrev
Copy link
Collaborator

nshmyrev commented Dec 2, 2024

If you installed small model like sprec readme suggest, you can also try bigger model. Also you can try whisper, it is much more accurate than Vosk for English. Vosk has very specific usecases these days.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants