Skip to main content
This guide explains how to create a high-quality professional voice clone. Follow each step carefully to ensure optimal results.

Recording Requirements

Recording Length60–90 minutes of continuous speech
ContentNatural, live monologue about any topic + reading script and characters
SpeakerOnly one speaker — no interruptions or overlapping voices
Audio QualityClear, consistent, no background noise or echo. Any setup including newer smartphones works.
EnvironmentQuiet space with a microphone
File Format.mp3 or .wav
SubmissionUpload to your dedicated Slack channel. If you don’t have one, contact the team.
1

Recording Environment

Your recording environment has a significant impact on voice clone quality. Choose your setup carefully.
Recommended setups:
  • Phone booth
  • Meeting room
  • Sound-treated space
2

Minimum Checklist

Before you start recording, confirm the following:
  • At least 60 minutes of recording
  • Only one speaker throughout
  • Consistent audio quality
  • File format is .mp3 or .wav
3

Conversational Recording

Duration: ~50 minutesRecord natural, unscripted speech. It doesn’t matter what you say — just keep it natural and tell stories. Do not read from a script.Suggested topics:
  • Vacation experiences
  • Childhood and upbringing
  • Food and preferences
  • Hobbies
  • Daily routines
4

Script Recording

This section ensures pronunciation consistency and structured speech coverage.General instructions:
  • Speak only the agent’s lines
  • Pause ~1 second after each line
  • Read customer text silently — do not speak it aloud
  • Maintain a friendly, professional tone
1
Agent: “Good day, this is Anna Weber from Müller Immobilien GmbH. Am I speaking with Mr. Mustermann?”(Pause)
2
Agent: “Exactly. A few days ago, you showed interest via our online listing for the 3-room apartment in Prenzlauer Berg. I wanted to ask if you would have time to schedule an appointment?”(Pause)
3
Agent: “Great. I still have appointments this week on Wednesday at 4 p.m. or on Friday at 10 a.m. Would either of those times work for you?”(Pause)
4
Agent: “Friday at 10 a.m. is currently best, because there are more viewings scheduled afterward. If you prefer, we can move the appointment to 11 a.m. – would that be okay?”(Pause)
5
Agent: “Great, then the appointment is set for Friday, June 6 at 11 a.m. The address is Schönhauser Allee 45, Prenzlauer Berg. I’ll send you a confirmation by email shortly. Do you have any other questions about the apartment?”(Pause)
6
Agent: “The monthly utility costs are approximately 250 euros, including heating and water supply. There isn’t a private parking space in the building, but there’s a parking garage on the street, where we can help you arrange a spot if you like. Is that okay for you?”(Pause)
7
Agent: “The deposit amounts to one month’s rent. Move-in could be as early as July 1st, if everything works out and you decide to proceed. I’ll summarize all of this in the email. If anything is still unclear afterward, you’re welcome to call me anytime.”(Pause)
8
Agent: “You’re very welcome. See you Friday, Mr. Mustermann. Have a nice day!”(Pause)
Alphabet
  • “A … B … C”
  • “A as in Alpha … Z as in Zulu”
Tone: Friendly
Number Sequences
  • 0 to 30
  • “1,234,567”
  • “4,999.99 €”
Say once slowly and clearly, once naturally.
Date & Time
  • “May 28, 2025, 4:30 p.m.”
Say once slowly and clearly, once naturally.
Phone & ZIP Codes
  • “+49 30 8899 1122”
  • “0800 123 45 67”
  • “10115”
Say once slowly and clearly, once naturally.
Special Characters
SymbolSay
@at
#hashtag
/slash
%percent
euro
&and
-dash
_underscore
5

Submission

Checklist before upload:
  • Audio is clear and consistent
  • No interruptions or background noise
  • Full duration completed
  • File is exported as .mp3 or .wav
Upload: Submit the file to your dedicated channel of communication.