Siliconindia Magazine

News

IIT graduates-led 'Smallest.ai' unveils World's fastest real-time Text to Speech model

By Team Startupcity | Tuesday, 05 November 2024, 12:43 IST

The world's fastest real-time text-to-speech model, lightning, has been launched by Smallest.ai, a top developer of multi-modal AI based models with its headquarters located in San Francisco, California. Lightning can produce up to 10 seconds of audio in just 100 milliseconds. This significantly streamlines the integration process while allowing voicebot providers worldwide to create incredibly lifelike bots with sub-second latency.

With prices starting at 0.02 USD/min (1.6 Rs/min), Lightning is also significantly less expensive than its Western rivals, which is another factor that makes it revolutionary. This makes it possible to employ voicebots on a population scale for less than 1 rupee a minute.

Real-time text-to-speech models typically need streaming, which creates a web socket connection. This raises the server's processing burden and makes scaling AI-based tools and voicebots more challenging and costly. Lightning makes it feasible to obtain audio using a basic REST API in about 100ms, which allows bot providers to grow more quickly while significantly reducing API expenses.

Several Hindi and English accents are presently supported by Lightning. In the upcoming months, Smallest.ai intends to integrate additional Asian, European, and Indian languages.

Voicebot platforms that had early access to Lightning saw a significant improvement in speech quality and an eight-fold reduction in their cost per minute.

Although Lightning is designed for real-time applications, it can also be used to provide voiceovers for reels on YouTube, Instagram, and several other social media sites, as well as audiobooks. Lightning is accessible to non-developers via the Waves Speech platform, which offers beta versions of services like accent conversion and voice cloning.

Sudarshan Kamath, Founder of Smallest.ai says, "Why are 1B humans not speaking to AI voices everyday despite incredible advancements in Voice AI? This is the problem we are trying to solve."

Swiggy's Initial Public Offering to open on November 06, 2024

Siliconindia Magazine

IIT graduates-led 'Smallest.ai' unveils World's fastest real-time Text to Speech model

CURRENT ISSUE

BEST STARTUPS TO WORK FOR

Green Is The Way To Sustainable Shipping Solutions

Innovatively Crafting Delightful Treats With Quality Intact

Empowering Businesses With Legal Finesse