Voice transcription for Emacs using OpenAI’s Whisper API.

Requirements

Installation

Coming soon.

Clone this repository and add to your load-path:

(add-to-list 'load-path "/path/to/ok-whisper")
(require 'ok-whisper)

Set your OpenAI API key:

(setq ok-whisper--openai-token "your-api-key")

M-x ok-whisper--record-dwim: Start/stop recording (press once to start, again to stop)
M-x ok-whisper--transcribe-file: Transcribe an existing audio file

The transcription result will be inserted at point.

Define a shortcut:

(define-key global-map (kbd "C-x R") 'ok-whisper--record-dwim)

Currently only works on Linux systems with PulseAudio
Maximum recording length is 180 seconds by default (but can be configured with ok-whisper--max-recording-seconds
Requires an active internet connection
Uses OpenAI’s API (costs apply)

AGPL-3.0. See LICENSE.