Introduction

Veronese is an audio-to-text platform for creators. You bring the audio — Veronese turns it into an editable transcript you can polish and publish anywhere.

What Veronese does

Ingest — Send audio to Veronese. Upload a file, paste a YouTube URL, forward a podcast feed, send via email, or drop a voice note from Telegram.
Transcribe — Veronese normalizes the audio with FFmpeg and transcribes it using Whisper or Fireworks AI.
Edit — The transcript lands in a rich-text editor. You own every word.
Export — Export as Markdown, plain text, or copy directly to your writing tool of choice.

Key concepts

Episodes

An episode is the core unit of work in Veronese. Each episode holds one audio recording and its full lifecycle — ingestion state, transcript, and editable content.

Series

Episodes are grouped into series (think: a podcast, a notebook, a project). Every episode must belong to a series. Series keep your work organized and make bulk exports easy.

Transcripts

When Veronese finishes transcribing an episode, it stores two versions:

raw_text — the immutable machine output, never overwritten after first generation.
content — the user-editable version, seeded from raw_text once. Edit freely.

Credits

Transcription is metered in seconds of audio. Your account has a free credit pool (granted at signup) and an optional paid pool (purchased via top-up packages).

Ingestion channels

Channel	How it works
Web upload	Drag and drop an audio file in the browser
YouTube / URL	Paste a YouTube, X (Twitter), or podcast URL
Email	Send an audio attachment to your personal upload address
Telegram	Drop a voice note to the Veronese bot

Next step

Create your first episode →