Friday, April 3, 2026

The Daily Scroll

Where Every Story Has a Voice

Featured image: ElevenLabs Wants to Make You a Music Producer Now
Tech

ElevenLabs Wants to Make You a Music Producer Now

The AI voice company just entered the music game. Here's what it can actually do.

Can an app that's never heard a guitar solo understand what makes a song feel like something? That's the question ElevenLabs is quietly betting you won't ask — because today, the company best known for eerily realistic AI voice cloning just launched a new music-generation app, and the tech press is already losing its mind about it.

Here's what's actually happening: ElevenLabs, the startup valued at $3.3 billion as of its January 2024 funding round, has expanded beyond voice into full music generation. The new app lets users create original tracks from text prompts, adjust vocals, and layer instrumentals — all without touching an instrument, hiring a producer, or knowing what a BPM is.

Whether that's exciting or alarming depends entirely on where you sit in the music industry. Spoiler: most people who make their living in it are sitting somewhere between uncomfortable and furious.

Enjoying this? Get stories like this delivered daily.

Article photo 1

What ElevenLabs Actually Announced

ElevenLabs is calling this a standalone music app — not a plugin, not a buried feature — built to generate complete songs from text descriptions. You type something like "upbeat lo-fi hip hop with melancholy piano and rain sounds," and the app produces a track. (The company calls this "AI-powered music creation." What it actually does is pattern-match against an enormous dataset of existing music and reassemble sonic elements into something new-ish.)

The app also integrates ElevenLabs' existing voice synthesis technology, meaning generated tracks can include AI vocals — sung in styles ranging from pop to R&B — without a human singer in the room. That's the part that's genuinely new here. Most music-gen tools, like Suno or Udio, already do text-to-song. ElevenLabs is betting its voice quality is the differentiator.

And honestly? On voice quality alone, they have a case. ElevenLabs' voice synthesis has been the industry benchmark for realism since at least 2023. If that fidelity carries over to singing, this is a different product than what Suno v3 or Udio 1.5 are offering right now.

Article photo 2

Who Is ElevenLabs, and Why Are They Doing This?

If you haven't been following the AI audio space, here's the quick version: ElevenLabs launched in 2022, founded by Piotr Dabkowski and Mati Staniszewski, two former Google and Palantir employees. They built the most convincing real-time voice cloning tool on the market. By early 2024, they'd raised $80 million in a Series B led by Andreessen Horowitz and hit that $3.3 billion valuation.

Their core product lets you clone a voice from a short audio sample and generate speech in that voice. Publishers use it for audiobooks. Podcasters use it for multilingual dubbing. Bad actors have used it for scam calls, which is a whole other conversation — one ElevenLabs has repeatedly said it's working to address through watermarking and detection tools.

Music is the logical next territory. Audio is audio. The company already has the infrastructure, the compute relationships, and the brand recognition. Moving into music generation isn't a pivot; it's an expansion. If you've already built the most realistic voice engine in the industry, why wouldn't you try to sing with it?

Article photo 3

How Does It Stack Up Against Suno and Udio?

This is the question everyone in the music-tech space is actually asking today. Suno, which raised $125 million in 2024 at a reported $500 million valuation, and Udio, backed by a16z, are the two dominant text-to-music platforms right now. Both generate full songs — vocals, instrumentals, lyrics — from a text prompt in under a minute.

Suno's v3 model, released in early 2024, was genuinely impressive for pop and folk structures. Udio has been stronger on genre diversity and audio fidelity. Neither has ElevenLabs' specific advantage in voice realism.

Is ElevenLabs' entry going to immediately leapfrog both? Probably not on day one. Music generation is harder than speech synthesis because it requires harmonic coherence over time — a voice can be convincing in isolation, but a song has to stay in key, maintain rhythm, and resolve melodically. Those are different engineering problems.

Article photo 4

But ElevenLabs doesn't need to win on launch day. They need to be good enough to acquire users and iterate. And with $80 million in the bank and an existing user base that's already comfortable with their platform, they have runway to get there.

What This Means for Actual Musicians

Let's not dress this up: tools like this one are a real economic pressure on working musicians. Not the headliners. Not the artists with label deals and streaming minimums. The session musicians, the jingle writers, the composers who score YouTube videos and indie games and corporate training videos.

That market — what the industry calls sync licensing and production music — was already under pressure from stock music libraries like Epidemic Sound and Artlist. AI music generation is the next wave of that compression. A marketing team that used to pay $500 for a custom jingle can now generate fifty options in an afternoon for the cost of a subscription.

Article photo 5

I covered Silicon Valley's record-breaking Q1 funding spree earlier this year, and the through-line was exactly this: capital is flowing toward tools that automate creative and knowledge work, and the creative class is absorbing the economic shock of that in real time. ElevenLabs' music app is one more data point in that trend.

Is this a problem? Depends on who you ask. If you're a brand manager with a $2,000 content budget, it's a solution. If you're a composer who spent five years building a client roster, it's a threat. Both of those things are true simultaneously.

The Legal Situation Is Still a Mess

Here's the part that doesn't get enough attention in the launch-day coverage: the training data question is completely unresolved.

Article photo 6

Suno and Udio are currently facing a lawsuit filed in June 2024 by the Recording Industry Association of America (RIAA) on behalf of Universal Music Group, Sony Music, and Warner Music Group. The core claim is that these models were trained on copyrighted recordings without license or compensation. The case is still working through the courts.

ElevenLabs hasn't disclosed what its music model was trained on. That's not unusual — almost no generative AI company has been fully transparent about training data — but it means the same legal exposure that's hanging over Suno and Udio is almost certainly hanging over ElevenLabs' new product too.

The company has been relatively careful about compliance in its voice products, building in features like consent verification for voice cloning. Whether that same rigor extended to the music training pipeline is something we don't know yet. And in the current legal environment, that's not a small question.

Article photo 7

What You Should Actually Do With This Information

If you're a creator, marketer, or just someone who makes things for the internet, here's the practical read: ElevenLabs' music app is worth watching, but probably not worth switching to immediately if you're already using Suno or Udio for quick content generation.

The voice quality differentiation is real, but it only matters for tracks where vocals are the centerpiece. For background music, ambient sound design, or instrumental content, the existing tools are already good enough and more battle-tested.

If you're a musician or audio professional, the more important move right now is to understand where AI-generated music legally cannot be used. Many sync licensing platforms, broadcast networks, and gaming companies have explicit policies against AI-generated content — policies that are getting stricter, not looser, as the RIAA lawsuit moves forward. Knowing those boundaries is where your competitive advantage lives right now.

Article photo 8

And if you're just curious — if you want to type a prompt and hear what comes out — the app is available now. Go try it. Just don't mistake novelty for quality, and don't mistake quality for ownership. Those are two very different things, and the music industry's lawyers are currently very focused on the second one.

The Bottom Line

ElevenLabs entering music generation is a significant move from a company with real technical credibility and serious financial backing. It's not a toy project. It's a strategic expansion into a market that's already contested, legally murky, and economically consequential for working musicians.

The voice quality could genuinely be the differentiator — if the singing holds up to the same standard as their speech synthesis, they have something neither Suno nor Udio can easily replicate. That's a meaningful if.

Article photo 9

What this isn't is a revolution. (I know, I know.) It's a well-funded company doing what well-funded companies do: finding an adjacent market, deploying capital, and seeing what sticks. The music industry has survived drum machines, Auto-Tune, and Spotify's royalty structure. It'll survive this too — just not without some painful economic reshuffling along the way.

The one thing I'd tell you to actually do today: if you're in any kind of audio-adjacent creative work, go use the app for five minutes. Not because it will replace you. Because understanding your competition is always better than ignoring it.

Some links in this article may earn us a small commission — at no extra cost to you.