Introduction
Veronese is an audio-to-text platform for creators. You bring the audio — Veronese turns it into an editable transcript you can polish and publish anywhere.
What Veronese does
Section titled “What Veronese does”- Ingest — Send audio to Veronese. Upload a file, paste a YouTube URL, forward a podcast feed, send via email, or drop a voice note from Telegram.
- Transcribe — Veronese normalizes the audio with FFmpeg and transcribes it using Whisper or Fireworks AI.
- Edit — The transcript lands in a rich-text editor. You own every word.
- Export — Export as Markdown, plain text, or copy directly to your writing tool of choice.
Key concepts
Section titled “Key concepts”Episodes
Section titled “Episodes”An episode is the core unit of work in Veronese. Each episode holds one audio recording and its full lifecycle — ingestion state, transcript, and editable content.
Series
Section titled “Series”Episodes are grouped into series (think: a podcast, a notebook, a project). Every episode must belong to a series. Series keep your work organized and make bulk exports easy.
Transcripts
Section titled “Transcripts”When Veronese finishes transcribing an episode, it stores two versions:
raw_text— the immutable machine output, never overwritten after first generation.content— the user-editable version, seeded fromraw_textonce. Edit freely.
Credits
Section titled “Credits”Transcription is metered in seconds of audio. Your account has a free credit pool (granted at signup) and an optional paid pool (purchased via top-up packages).
Ingestion channels
Section titled “Ingestion channels”| Channel | How it works |
|---|---|
| Web upload | Drag and drop an audio file in the browser |
| YouTube / URL | Paste a YouTube, X (Twitter), or podcast URL |
| Send an audio attachment to your personal upload address | |
| Telegram | Drop a voice note to the Veronese bot |