GitHub - 200ok-ch/ok-whisper: Voice transcription for Emacs using OpenAI's Whisper API.

Voice transcription for Emacs using OpenAI’s Whisper API.

Requirements

Emacs 27.1 or later
ffmpeg
PulseAudio (currently only supports Linux systems with PulseAudio)
OpenAI API key

Installation

MELPA

Coming soon.

Manual

Clone this repository and add to your load-path:

(add-to-list 'load-path "/path/to/ok-whisper")
(require 'ok-whisper)

Configuration

Set your OpenAI API key:

(setq ok-whisper--openai-token "your-api-key")

Usage

M-x ok-whisper--record-dwim: Start/stop recording (press once to start, again to stop)
M-x ok-whisper--transcribe-file: Transcribe an existing audio file

The transcription result will be inserted at point.

Define a shortcut:

(define-key global-map (kbd "C-x R") 'ok-whisper--record-dwim)

Limitations

Currently only works on Linux systems with PulseAudio
Maximum recording length is 180 seconds by default (but can be configured with ok-whisper--max-recording-seconds
Requires an active internet connection
Uses OpenAI’s API (costs apply)

License

AGPL-3.0. See LICENSE.

Credits

Inspired by Simon Safar’s blog post.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.org		README.org
ok-whisper.el		ok-whisper.el

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirements

Installation

MELPA

Manual

Configuration

Usage

Limitations

License

Credits

About

Releases

Packages

Languages

License

200ok-ch/ok-whisper

Folders and files

Latest commit

History

Repository files navigation

Requirements

Installation

MELPA

Manual

Configuration

Usage

Limitations

License

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages