Skip to content Skip to footer

Groq Launches Distil-Whisper: A Faster API for Speech Recognition

Groq Launches Distil-Whisper: A Faster & Affordable API for Speech Recognition (distil-whisper-large-v3-en) on GroqCloud™ Developer Console, offering the developer community a faster and more efficient option for English speech recognition. This new, compressed version of OpenAI’s Whisper model is designed to provide faster (164x) and more efficient English speech recognition while maintaining high accuracy.

What is Distil-Whisper?

Distil-Whisper is a streamlined version of the Whisper Large V3 model, specifically engineered to offer a balance between speed and accuracy. While it is 51% smaller in size, with only 756 million parameters compared to Whisper Large V3’s 1.55 billion, it runs at an impressive 240x real-time speed factor. This means it processes audio 240 times faster than the length of the audio clip, making it an excellent choice for real-time applications. Despite its reduced size, Distil-Whisper maintains a high level of accuracy, achieving a Word Error Rate (WER) within 2.4% on short-form transcriptions.

Groq Launches Distil-Whisper

Key Features and Benefits

  • Speed and Efficiency: Distil-Whisper’s 240x real-time speed factor allows for incredibly fast processing, making it ideal for applications requiring quick responses, such as real-time customer service chatbots and voice-controlled interfaces.
  • Reduced Model Size: The model’s 51% smaller size compared to Whisper Large V3 translates into lower computational costs and faster deployment times, without a significant loss in accuracy.
  • Robustness to Noise: The model is designed to perform well even in noisy environments, reducing hallucination errors by 1.3x and insertion errors by 2.1% compared to Whisper Large V3.
  • Compatibility: Distil-Whisper is compatible with popular Whisper libraries, making it easy to integrate into existing workflows and applications.

Groq Launches Distil-Whisper Speed

Use Cases

Distil-Whisper can be used across a wide range of industries to enhance AI applications, such as:

  • Real-Time Customer Service: Quickly transcribe and respond to customer inquiries in real-time, improving customer satisfaction and efficiency.
  • Automated Speech-to-Text: Ideal for industries like healthcare, finance, and education, where accurate and fast transcription is critical.
  • Voice-Controlled Interfaces: Enhance user experiences in smart homes, cars, and other devices with accurate speech recognition.
  • Media Transcriptions: Simplify the process of transcribing audio and video recordings, allowing media professionals to focus on content creation and analysis.
  • Meeting Transcriptions: Integrate with LLMs to transcribe and summarize meeting recordings, generating actionable insights and decisions.
  • Insurance Claims Processing: Improve service by transcribing interviews and calls, streamlining the claims process.

Pricing

Distil-Whisper is available at a competitive price of $0.02 per hour of audio processed. This makes it an affordable option for developers and enterprises looking to enhance their speech recognition capabilities without incurring significant costs.

From October 1, 2024, the price for Whisper Large V3 will increase to $0.111 per hour for on-demand GroqCloud™ users. Additionally, there will be a minimum charge for requests under 10 seconds, equating to $0.01 per 18,000 requests on Distil-Whisper.

Groq Launches Distil-Whisper Pricing

Performance Metrics

  • Speed Factor: Distil-Whisper offers a speed factor of 240x real-time, making it the fastest implementation of Whisper models.
  • Cost Efficiency: At $0.333 per 1000 minutes of audio, Distil-Whisper provides an affordable solution for high-volume transcription needs.

Getting Started

Developers can access Distil-Whisper today through the GroqCloud™ Developer Console. Whether you are building new applications or enhancing existing ones, Distil-Whisper offers a powerful and efficient tool for all your speech recognition needs. Start building today and see how Distil-Whisper can transform your AI applications.

For more information or assistance in integrating Distil-Whisper into your projects, feel free to reach out at hello@deskinvestor.com.

Leave a Reply

Discover more from Desk Investor

Subscribe now to keep reading and get access to the full archive.

Continue reading