- Sabrina Ramonov đ
- Posts
- 100% Automated AI Clone Videos with Professional Voice!
100% Automated AI Clone Videos with Professional Voice!
How to improve your AI Clone's voice with Elevenlabs' professional voice cloning
This automation combines Make, Perplexity, ChatGPT, HeyGen, Elevenlabs, and Blotato to research, write, create, and distribute talking head AI clone videos, with professional voice quality, to every social media platform every single day. 100% automated.
This updated tutorial shows you how to improve your AI Clone's voice by modifying Heygen's API parameters and/or integrating your Elevenlabs professional voice clone.
Iâve also simplified the Make scenario by removing JSON data structures, so the only thing you need to setup in Make:
connect your accounts (Perplexity, Heygen, optional ChatGPT, optional Elevenlabs)
your Blotato.com API key and account IDs (in the node âSetup Social Accountsâ)
đď¸ Grab the updated Make blueprint at the bottom.
Hereâs the Youtube version of this post, which I highly recommend you watch to observe the differences in voice quality:
I want to improve my newsletter Referral program to incentivize more people to share my newsletter. What rewards would you love to get? |
Overview
Hereâs an overview of the full workflow, which I covered in detail in the last post.
1. Research and Scripting
Use Perplexity AI to gather real-time information on your niche or topic, like "practical cybersecurity tips for families."
Generate a monologue video script
Use ChatGPT to create a video caption. You can also use Perplexity or Claude, instead of ChatGPT.
2. Video Avatar Creation
Use HeyGen's API to create your avatar video.
Make sure you havenât run out of Heygen API credits (only 10 videos per month)
Incorporate delays while your automation waits for the avatar video to be generated. Generally, 5-10 minute delay for videos around ~1 minute. If your video is longer than 1 minute, you may need to add more delays.
3. Distribute to Socials
Fetch the completed avatar video using its video ID. If the returned
video_url
is empty, then either:Heygen hasnât finished making your video, so add a longer delay.
Or, youâve run out of Heygen API credits.
Connect your social media accounts in Blotato.com
Obtain your Blotato API key and social account IDs
In the Make automation, there is an HTTP node that uploads the video to Blotato via API, then distributes it to all your social platforms.
Approaches to Improving Voice Quality
Iâll cover 3 ways to improve your avatarâs voice quality:
Improve Training Data
Tweak HeyGen Voice Settings
ElevenLabs Voice Integration
The neat thing â you can combine approaches!
In the accompanying Youtube video, I show 3 avatar examples, using option 2, option 3, and combining options 2 and 3.
1. Improve Training Data
âGarbage in, garbage outâ is a data science axiomâŚ
You can significantly improve your avatarâs voice simply by recording high-quality audio:
quiet environment
no background noise
use a professional external microphone, such as a Blue Yeti USB Mic (NOT your iPhone or Airpods)
upload 5 full minutes of you talking (NOT just the minimum duration)
exaggerate your talking style and intonation (because Heygen tends to produce a monotone voice)
These tips always apply, whether you use Heygen, Elevenlabs, or some other service.
2. Tweak HeyGen Voice Settings
If you donât want to pay for YET ANOTHER SUBSCRIPTION⌠đ
Playing around with HeyGenâs voice settings may be sufficient.
In Make, open the node âCreate Avatar Videoâ and scroll down to these settings:
From HeyGenâs API documentation:
Speed: Voice speed, value between
0.5
and1.5
. Default is1
.Pitch: Voice pitch, value between
-50
and50
. Default is0
.Emotion: Voice emotion, if voice support emotion. value are
['Excited','Friendly','Serious','Soothing','Broadcaster']
I recommend playing around with these values to improve your avatarâs voice, making it sound less monotone.
I liked these settings, but itâs personal preference:
Speed: 1.1
Pitch: 50
Emotion: Excited
3. ElevenLabs Voice Cloning
HeyGen's default voice cloning is nice, but ElevenLabs offers a significantly more sophisticated voice cloning service.
ElevenLabs Setup
Sign up at https://elevenlabs.io/pricing
Select the âCreatorâ plan if you want to make a very high-quality Professional Voice Clone using hours of audio upload.
Otherwise, choose âStarterâ plan and youâll be able to make an Instant Voice Clone.
Create ElevenLabs Instant Voice Clone
For this tutorial, I walk through creating an Instant Voice Clone, which is much faster than creating a Professional Voice Clones. Itâs the same exact process, but the latter requires a lot more data (I uploaded 4.5 hours of audio for mine) and it takes much longer for the voice to be ready (~6 hours).
In contrast, an Instant Voice Clone is done in a few minutes:
Go to ElevenLabs app
Click âVoicesâ in left sidebar
Click âAdd a new voiceâ
Select âInstant Voice Cloneâ
Fill out the form - name your voice, add up to 25 audio samples (up to 10MB each), add labels, add a description, etc.
To view all your Elevenlabs voices, go to My Voices > Personal:
My instant voice clone and my professional voice clone
HeyGen and ElevenLabs Integration
Grab your ElevenLabs API key via your account settings
Go to HeyGen > AI Voice > Integrate 3rd party voice (top right corner)
Select âElevenLabsâ and input your API key
Click âManageâ to import your ElevenLabs voices into Heygen
Find your instant voice clone and toggle âImport to HeyGenâ
Awesome, youâre all set! đĽł
Now you can access your ElevenLabs voices within HeyGen and Make.
Final step:
Go back to your Make automation, open âCreate Avatar Videoâ node, click âVoice IDâ, and select your newly made instant voice clone:
Now, your automation creates a HeyGen avatar video using your ElevenLabs voice clone!
Closing Thoughts
Personally, Iâm super impressed with the quality of my AI avatar after connecting my ElevenLabs professional voice clone. I plan to experiment with short-form videos to promote Blotato using this automation and, as always, Iâll share what I learn đ
Thereâs still a difficult balance between automation vs. quality, so please donât assume this automation makes content creation âeasy.â The algorithms are very good at NOT showing low-quality content, whether AI or human generated.
Hereâs the plug-and-play Make blueprint that you can import into your account: