Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
audio.mp3		audio.mp3
index.html		index.html
main.css		main.css
main.js		main.js
video.mp4		video.mp4

README.md

Demo

This demo shows how to match the current progression of an audio element with words on a page to highlight the words are they are spoken. The match uses a JSON representation of each word, along with its start and end time.

Try it: https://propublica.github.io/transcript-audio-sync/demo/

JSON and HTML Audio Representation

The transcript is represented in JSON as an array of words. Each word in the transcript is represented as an object:

{
	"word":"Please", // The word
	"speaker":"John", // Represents the person speaking
	"start":0.55, // Start timestamp for word
	"end":0.91, // End timestamp for word
	"id":0 // Unique id that matches html
}

Words are represented in the HTML as a span with a unique id matching the id in JSON:

<span class="word" word-id="0">First</span>

Accessibility

To highlight individual words, each word in the HTML is wrapped in a span. However, wrapping words in spans causes screen readers like VoiceOver to read each word individually instead of in a sentence. To create a good experience for all users, we use aria-hidden="true" to hide the transcript with spans, and then we have a screen reader-friendly version of the content below.

Citation

Audio file, audio.mp3: Weinberger, S. (2013). Speech accent archive. George Mason University. (https://dagshub.com/kinkusuma/speech-accent-archive/src/master/recordings/english26.mp3)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

demo

demo

README.md

Demo

JSON and HTML Audio Representation

Accessibility

Citation

Files

demo

Directory actions

More options

Directory actions

More options

Latest commit

History

demo

Folders and files

parent directory

README.md

Demo

JSON and HTML Audio Representation

Accessibility

Citation