You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
i have a few problems i am looking to remedy. mainly my problem with speech accuracy when other voices are being played. in a quiet environment vosk performs wonderfully, but when there is noise or someone else talking it is absolutely unusable for my purpose, as i am using it for realtime STTS in voice chats with friends.
The text was updated successfully, but these errors were encountered:
What is your language/accent? You can probably try something modern like Whisper. It depends on many details - vocabulary, etc. It is better to separate channels to avoid speech overlap. If noise source is in your room, there are ways to isolate that. And so on.
If you installed small model like sprec readme suggest, you can also try bigger model. Also you can try whisper, it is much more accurate than Vosk for English. Vosk has very specific usecases these days.
i have a few problems i am looking to remedy. mainly my problem with speech accuracy when other voices are being played. in a quiet environment vosk performs wonderfully, but when there is noise or someone else talking it is absolutely unusable for my purpose, as i am using it for realtime STTS in voice chats with friends.
The text was updated successfully, but these errors were encountered: