Categories: Gadgets360

ElevenLabs Releases AI-Powered Voice Design API and X to Voice Features

ElevenLabs, a New York-based artificial intelligence (AI) firm, released an application programming interface (API) for its Voice Design feature, which recently made its debut. The announcement came last week, and alongside, the company also introduced an open-source project dubbed X to Voice, which can generate a unique voice for an X (formerly known as Twitter) profile based on the posts of the user. The feature also shows a text prompt which is auto-generated based on the analysis of the profile.

ElevenLabs Releases New AI Tools

In a blog post, ElevenLabs detailed the two new AI tools. The first is an API version of the Voice Design tool, which was recently introduced. Voice Design is a new capability developed by the company which can generate unique AI voices based on text prompts. These voices are based on the description shared by the user, including the pitch, timbre, delivery pace, intonation, and more.

Now, this feature is being made available via the company’s API. This means developers can use this capability to build apps and software. Voice Design can either be offered by developers to develop voices for their AI characters or to users so that they can generate new voices for themselves.

The company has offered two endpoints. First allows developers to generate three unique voice previews based on a text prompt. The second allows them to save the voice previews to their library for local use. ElevenLabs did not highlight the price of the API or the cost per request of the AI model. Details about the AI model are also not known.

The second tool is the company’s open-source project dubbed X to Voice. It is an extension of the feature available to test on a web client here. Users can add an X username and the AI will automatically analyse the profile including the bio and posts. Once analysed, it generates a text prompt on the basis of the analysis.

The text prompt is then fed to Voice Design automatically to generate a unique voice for the profile. Gadgets 360 tested out the feature and found that it takes between 30 seconds to a minute to generate voice previews for a profile. In total, three voice previews are generated. The AI voice speaks a line which is also based on the analysis of the profile.

Alongside the three voice previews, the page also displays the text prompt it used to generate the AI voice. We also found that the feature animates the profile pictures of users who have added a close of their face and syncs lip and mouth movements to match the words that are being spoken.

Recent Posts

Beyoncé’s NFL Christmas Halftime Show Now Streaming on Netflix: Everything You Need to Know

Beyoncé's much-anticipated halftime performance, part of Netflix's NFL Christmas Gameday event, is set to release…

10 months ago

Scientists Predict Under Sea Volcano Eruption Near Oregon Coast in 2025

An undersea volcano situated roughly 470 kilometers off Oregon's coastline, Axial Seamount, is showing signs…

10 months ago

Organic Molecules in Space: A Key to Understanding Life’s Cosmic Origins

As researchers delve into the cosmos, organic molecules—the building blocks of life—emerge as a recurring…

10 months ago

The Secret of the Shiledars OTT Release Date Announced: What You Need to Know

Director Aditya Sarpotdar, following his successful venture "Munjya," has announced the release of his treasure…

10 months ago

Anne Hathaway’s Mothers’ Instinct Now Streaming on Lionsgate Play

The psychological thriller Mothers' Instinct, featuring Anne Hathaway, Jessica Chastain, and Kelly Carmichael, delves into…

10 months ago

All We Imagine As Light OTT Release Date: When and Where to Watch it Online?

Payal Kapadia's award-winning film, All We Imagine As Light, will soon be available for streaming,…

10 months ago