Skip to content

Latest commit

 

History

History
49 lines (26 loc) · 3.4 KB

README.md

File metadata and controls

49 lines (26 loc) · 3.4 KB

Teation — Dive into conversations

Teation is in early development. It was created in a couple days by Patrick Nomad as a hobby project. Some of its parts were quickly hacked together for an initial proof of concept. Refactoring is in the roadmap. Feel like contributing? Checkout the contribution note in the development guide.

Teation allows you to gain insight from thoughtful conversations.

Long discussions such as the ones found in podcasts are great. The participants and length of certain podcasts allow for conversations to develop into thought-provoking insights. However these insightful snippets are often hard to find — hidden within thousands of hours of conversation.

Teation uses artificial intelligence to understand conversations in order to find insightful information.

Getting Started

A hosted version of Teation can be found at teation.com.

Looking for development guides? Check out development.

How It Works

Teation uses a transformer retriever/reader pipeline in order to provide semantic search through an underlying question answering capability. When someone asks a question on the Teation website, the following events take place:

  • Retriever: Podcast transcriptions are searched using BM25. Top results are forwarded to the reader step. (in the future this will be switched for an embedded vector-based search).
  • Reader: Top results are then submitted to a language model for inference with the original question. The language model returns answers to the question. Each answer has a score, and the transcript source. The current language model is based on RoBERTa fine-tuned with SQuAD2.0 for question answering.

Top results from the reader undergoes normalization:

  • Alignment: Results are processed with a few adjustments required by this early proof of concept. Some of these adjustments include timing tuning, and adding/removing segments.
  • Response: The resulting answers are then sorted by score, combined by video source, and the segments are returned for frontend consumption.

The frontend is developed with SveleteKit and Tailwind CSS.

Caddy is used to proxy requests to SvelteKit, and Cloudflare as CDN.

Roadmap

So many possible changes! Check out the roadmap.

Documentation

Documentation is still a work in progress. A bit of refactoring is still required before writing getting started instructions for development. Adding more documentation is in the roadmap.

For now, some useful information can be found in the development and architecture pages.

Contributions

Feel like helping out? Check out the contribution notes.

Questions?

Feel free to message Patrick Nomad on Twitter, or open an issue on GitHub.