The Prosthetic Voice

When Roger Ebert lost his lower jaw—and, thus, his voice—to cancer, the text-to-speech company CereProc created a synthetic voice that would be custom-made for the film critic. The computerized voice, a fusion of the words Ebert had recorded in his long career, would not sound fully natural; it would, however, sound distinctive. It was meant to help Ebert regain something he had lost with the removal of his vocal cords: a voice of his own.

Most people are not so lucky. Those who have had strokes—or who live with ailments like Parkinson's or cerebral palsy—often rely on versions of synthetic voices that are completely generic in their delivery. (Think of Stephen Hawking's computerized monotone. Or of Alex, the voice of Apple's VoiceOver software.) The good news is that these people are able to be heard; the bad news is that they have still been robbed of one of the most powerful things a voice can give us: a unique, and audible, identity.

Up in Boston, Rupal Patel is hoping to change that. She and her collaborator, Dr. Tim Bunnell of Nemours AI DuPont Hospital for Children, have for several years been developing algorithms that build voices for those unable to speak—without computer assistance. The voices aren't just natural-sounding; they're also unique. They're vocal prosthetics, essentially, tailored to the existing voices (and, more generally, the identities) of their users.

They're premised on the idea, Patel told me, that technology now allows us to think about the voice "just like we think about fonts for written text."

It works like this: Volunteers come to a studio and read through several thousand sample sentences (sourced from books like White Fang, The Velveteen Rabbit, and The Wonderful Wizard of Oz). Patel, Bunnell, and their team then take recordings of a recipient's own voice, if possible, to get a sense of its pitch and tone. (If the recipient has no voice at all, they select for thing like gender, age, and regional origin.) Then, the team strips down the voice recordings into tiny units of speech—so that, for example, a single vowel could exist as multiple units. Then, using a software tool called ModelTalker, they blend the two voice samples together to create a new, lab-engineered voice: a collection of words that are at the disposal of a person who needs them to communicate.

This is, needless to say, a painstaking process. Creating a voice that is simply usable, New Scientist notes, requires a donor to read at least (at least!) 800 sentences. And coming up with a voice that sounds relatively natural requires 3,000 sentences to be read. Plus, the current system—human recording combined with algorithmic remixing—requires the physical presence of voice donors. "Right now," Patel told me, "our process is to call people into the laboratory—and that doesn't scale."

Despite all those impediments, though, people seem to be interested in lending their voices to those in need. Patel and Bunnell are now developing the Human Voicebank Initiative, a project that aims to create a repository of human voices that can be donated to people who don't have voices of their own—and the initiative currently has more than 10,000 people registered as voice donors, Patel says. She and her team are building up the project's tech infrastructure, developing tools like a web client and an iPhone app that will allow donors to do their own recordings in their own time.

It's an appropriate use, perhaps, of the devices that will increasingly call on human voices for their commands. "When we're thinking about technologies that you and I use and rely on, we're now going to use speech much more," Patel says. "We talk to our phones, and our phones talk to us."

The Prosthetic Voice

Trending Articles

Scuffham Amps - S-GEAR 2.6.0 VST, AAX, STANDALONE x86 x64 (R2R NO iLok2, +NO...

Practice Sheet of Right form of verbs for HSC Students

VHSE First (1st) Allotment 2025 - vhscap.kerala.gov.in

UNIVERSE LEAGUE – UNIVERSE LEAGUE – WAR (We Are Ready) – EP [iTunes Plus M4A]

City Hunter Teledrama – Episode 18 – 07th May 2016

Comment on Proposed Criteria for Identifying Predatory Conferences by Luke...

Bureau of Internal Revenue: Regional Offices (Directory)

Kendrick Lamar – Not Like Us (2024) [24Bit-88.2kHz] [PMEDIA] ⭐️

Inception 2010 Hindi Dual Audio 650MB BRRip 720p ESubs HEVC

East Hull MD admits sexual assaults after another victim comes forward

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

R. v. Sargeant, 2023 ONSC 6406 (CanLII)

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Who’s been sentenced at Northampton Magistrates’ Court

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

Family cries out as traditional ruler allegedly abducts brother, extorts N2.5m

Long-Running Conflict In Springfield (MA) Gangland Sphere Has Manzi Family &...

Wondershare Filmora X v10.1.20.16 x64

Man arrested after fracas in flat

Man charged in ongoing Sexual Assault Investigation Derek Nyilas, 46, Faces...