CaraComp
Log inGet Started
CaraComp
Forensic-Grade AI Face Recognition for:
Get Started7-day refund guarantee**
biometrics

Big Tech Stole Their Voices to Train AI — Now Illinois Law Could Cost Billions

Big Tech Stole Their Voices to Train AI — Now Illinois Law Could Cost Billions

Nine journalists and audiobook narrators walked into court with a surprisingly simple argument: you took our voices without asking, you built a product that replaces us, and Illinois law says that's illegal. If they're right — and there's mounting evidence they are — the entire AI training data economy just got a lot more expensive.

TL;DR

Class-action lawsuits filed under Illinois' Biometric Information Privacy Act argue that AI companies capturing voiceprints for model training are violating the same biometric consent law that has already cost the tech industry billions — and if courts agree, voiceprints will finally get the same legal protection as faceprints and fingerprints.

The lawsuit, reported by Common Dreams, targets nine major technology companies across nine separate class-action filings. The plaintiffs aren't fringe activists. They're working journalists and professional audiobook narrators — people whose voices are their livelihoods — and they're using a 17-year-old Illinois privacy statute to argue that AI companies committed the biometric equivalent of identity theft at industrial scale.

This isn't a copyright case. It's not a fair use argument. It's a biometric privacy claim, and that distinction matters more than most coverage of this story has bothered to explain.

What BIPA Actually Says — And Why It Hits Differently Here

Illinois passed the Biometric Information Privacy Act in 2008, back when "biometric data" mostly meant fingerprint scanners at factories and retinal scans in sci-fi movies. But the legislators who drafted it were surprisingly forward-thinking. BIPA defines biometric identifiers to include voiceprints alongside fingerprints, retina scans, and facial geometry. That language sat mostly dormant for years until plaintiff attorneys realized it was a loaded weapon pointed directly at the tech industry.

The mechanism is worth understanding. A voiceprint isn't a recording — it's a mathematical model of the acoustic properties that make your voice uniquely yours. Pitch, timbre, resonance, the subtle physiological characteristics of your vocal tract. According to Biometric Update, this representation is specific enough to identify a speaker the same way a fingerprint identifies a person — and that's exactly what makes it irreplaceable. You can change a password. You can freeze a credit card. You cannot change your voice. This article is part of a series — start with Federal Judges Just Gutted The Its Real Defense And Investig.

That irreversibility is what makes courts take this seriously. When AI companies ingest hours of someone's narration, extract the voiceprint, and encode it into a text-to-speech model, the harm isn't just commercial. It's permanent. The biometric data is out there, embedded in a system that can now synthesize speech in your voice without you, forever.

$1.375B
Google's settlement with Texas over unlawful collection of voiceprints via Google Photos and Google Assistant
Source: Biometric Update / Capitol News Illinois

The Market Displacement Nobody Wants to Talk About

Here's the thing that makes this lawsuit different from the usual privacy-violation hand-wringing: the plaintiffs can show direct economic harm. Google Text-to-Speech is already being used by audiobook publishers instead of human narrators. Google's NotebookLM generates podcast-style audio overviews in synthetic voices that bear an uncanny resemblance to real human speech. The technology that was allegedly trained on these narrators' voices now competes directly with those narrators for the same contracts.

That's not hypothetical future harm. That's a story a judge can follow without a technology PhD.

As detailed in Capitol News Illinois, the defendants include some of the biggest names in AI development, and the complaints allege something damning: these companies knew exactly how to build consent systems that comply with BIPA — they built them in other contexts — and chose not to apply those systems to their voice training pipelines. That's not an oversight. That's a business decision.

"A stolen fingerprint or voice scan has no remedy, unlike a stolen Social Security number — you can get a new Social Security number, but you cannot get a new voice or new fingerprints." — Legal argument framing cited in Courthouse News Service

That framing — voiced in the legal complaints themselves, as reported by Courthouse News Service — is the one that will resonate with judges. It moves this out of the abstract and into something viscerally comprehensible. You didn't just take something. You took something that cannot be returned. Previously in this series: The 3 Forensic Checks That Expose A Deepfake Your Eyes Will .

Trusted by Investigators Worldwide
Run Forensic-Grade Comparisons in Seconds
Court-ready facial comparison reports. Results in seconds.
Get Started
7-day refund guarantee**

BIPA's Track Record Makes This Very Real

Skeptics will say these lawsuits are long shots. They're wrong, and the receipts are already in the system. BIPA has generated over 1,400 class action lawsuits since it took effect, with settlements ranging from nuisance-level payouts to genuinely staggering figures. Whole Foods paid $300,000 over voiceprint collection. Facebook settled a facial recognition case for $650 million. And in November, Google settled a case with the state of Texas for $1.375 billion, specifically over the unlawful collection of voiceprints through Google Assistant and facial geometry data through Google Photos.

Read that number again. $1.375 billion. That's not a theoretical risk. That's a line item on a balance sheet.

The legal strategy outlined by the plaintiffs' firm, as explained at Loevy + Loevy, is built on a specific advantage that biometric law offers over copyright claims: BIPA sidesteps the fair use defense entirely. Copyright cases get bogged down in arguments about transformative use, research exemptions, and the murky status of training data under existing law. BIPA doesn't care about any of that. It asks one question: did you collect a biometric identifier without written consent? Yes or no.

Why This Matters for Biometrics Broadly

  • Consent chains become non-negotiable — Auditable records of biometric consent aren't a best practice anymore; they're the difference between operating legally and facing nine-figure liability.
  • 📊 Voice joins the protected tier — If courts rule voiceprints fall under BIPA's core protections, every platform capturing user audio for any AI purpose needs to reassess its data pipeline immediately.
  • 🔮 Training data economics change permanently — Consent-compliant voice data costs more to acquire upfront. But BIPA settlements cost billions. The math is finally starting to favor doing it right the first time.
  • ⚖️ The fragmentation risk is real — Texas has already signaled it may carve out AI training data from biometric restrictions. If states diverge, companies will face a compliance patchwork that makes GDPR look simple.

The Counterargument — And Why It Probably Won't Hold

AI companies have a defense prepared, and it's not stupid. They'll argue that voiceprints embedded in model weights don't constitute "collection" of biometric data under BIPA, that feature vectors extracted during training aren't speaker-specific identifiers in the legal sense, and that the statute wasn't designed to regulate this use case. As Bloomberg Law notes, the question of whether voiceprints can "uniquely identify" an individual the way fingerprints can will be a central battleground — and courts haven't definitively answered it yet.

That's a legitimate legal uncertainty. But here's why it probably doesn't save them. The technology defendants are selling — text-to-speech synthesis that replicates specific voices — is direct evidence that their own systems treat voiceprints as unique identifiers. You can't simultaneously argue your product can clone a journalist's voice with uncanny accuracy AND that voice data doesn't uniquely identify people. That's not a legal argument. That's a contradiction. Up next: Biometric Data Legislation Investigator Compliance Risk.

The bigger risk for defendants is what happens during discovery. If internal communications show product teams discussing BIPA compliance and choosing to route around it — which is what the complaints allege — the litigation calculus shifts from "will we lose" to "how much will we pay."


For the facial recognition industry, this story is more than adjacent — it's a preview. The same logic that makes voiceprints protected biometric identifiers applies directly to faceprints. Every platform that captures facial geometry for any purpose, whether for authentication, analysis, or AI training, is operating in a legal environment that is getting stricter quarter by quarter. Courts are developing an increasingly sophisticated understanding of what makes biometric data dangerous: permanence, uniqueness, and the inability to revoke it once it's out. Those properties don't belong only to faces. They belong to voices, gaits, and whatever biometric modality AI decides to target next.

Key Takeaway

Voiceprints meet every legal and technical definition of a biometric identifier — they're permanent, unique, and physiologically determined — and the companies that treated them as free training material are now discovering that BIPA doesn't care how novel your product is. If you captured biometrics without consent, you owe an explanation to a judge, not just a privacy policy update.

If a faceprint requires explicit written consent under Illinois law — and it does — there is no principled argument for why a voiceprint should get different treatment. The lawsuits filed by these journalists and narrators aren't just about their careers. They're stress-testing a legal architecture that the entire industry will have to live inside. And right now, the architecture is holding.

The question worth sitting with isn't whether these companies will eventually pay. Given BIPA's track record, the Texas settlement, and the strength of the legal theory, some version of payment is almost certainly coming. The real question is simpler and more uncomfortable: if a single journalist's voice is biometric data the moment it enters a training pipeline, what exactly did these companies think they were collecting?

Ready for forensic-grade facial comparison?

2 free comparisons with full forensic reports. Results in seconds.

Run My First Search