Issue 09 – The Sound of You, Cloned

Multimodal, Eleven Labs, Voice Cloning

🧠 AI Term of The Week: Multimodal

Multimodal AI is a model that can understand and generate across multiple types of input like text, images, audio, and video all at once.

Instead of just reading or writing words, a multimodal model can do things like:

  • Look at an image and explain it

  • Watch a video and summarize it

  • Hear a sound and respond

  • Read a chart and answer questions about it

  • Or even combine all of the above

Why it matters:
Multimodal AI unlocks more human-like interaction. It sees, listens, reads, and speaks just like we do. Tools like GPT-4o, Gemini, and Claude Opus are already showing us what's possible.

For creators:
This means smarter workflows, faster idea generation, and way more creative potential, especially when you combine visuals, voice, and text in your builds.

💬 Quote of the Week:

“Opportunity is missed by most people because it's dressed in overalls and looks like work.”

— Henry Ford

🧰 AI Tool of The Week: Eleven Labs

ElevenLabs is no longer just a voice generator — it’s becoming a full audio studio powered by AI.

You can now:

  • Clone your voice with scary accuracy

  • Generate new voices with emotions, accents, and style

  • Create sound effects with text prompts (using the new Sound Effects tool)

  • Use their new Projects feature to assemble complete voiceovers with timing and narration tracks

It's a perfect match for a multimodal workflow:
Use ChatGPT for the script, Sora for visuals, and ElevenLabs for voice and sound. Just like that, you’ve built a video — no microphone or recording audio, no camers, no problem.

🛠 What I Made:
Eleven Labs: How To Clone Your Voice Using AI

This tutorial will show you how to use Eleven Labs to clone your own voice so you can generate audio recordings from text prompts that sound like you.

Here’s what you’ll learn:
✅ How to create account and upgrade on Eleven Labs
✅ How to open App and access Voice Cloning
✅ How to upload audio and clone voice
✅ How to Text to Speech and select Voice Clone

Tools used:

💬 Thank You!

– Mike Murphy
Learning AI.
Building digital products.
Make a living as a content creator.